Re: [PR] [HUDI-7432] Fix excessive object creation in KeyGenUtils [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10721: URL: https://github.com/apache/hudi/pull/10721#issuecomment-1956044243 ## CI report: * d801e283ca4ab295e5e8d2624302dcb0d3649a71 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] Fix a bug in the HoodieBaseListData.isEmpty() empty-check logic [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10722: URL: https://github.com/apache/hudi/pull/10722#issuecomment-1956044292 ## CI report: * 6d19db1591569a511e4d379c08252fdb9b3db4ee Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [MINOR] Re-containerize 4th Azure CI module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10720: URL: https://github.com/apache/hudi/pull/10720#issuecomment-1956044197 ## CI report: * 50f9c5f47b90873b6f8d2326c694c05c8db8dd4b Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] Fix a bug in the HoodieBaseListData.isEmpty() empty-check logic [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10722: URL: https://github.com/apache/hudi/pull/10722#issuecomment-1956035150 ## CI report: * 6d19db1591569a511e4d379c08252fdb9b3db4ee UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

Re: [PR] [MINOR] Re-containerize 4th Azure CI module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10720: URL: https://github.com/apache/hudi/pull/10720#issuecomment-1956035086 ## CI report: * 50f9c5f47b90873b6f8d2326c694c05c8db8dd4b Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] Fix a bug in the HoodieBaseListData.isEmpty() empty-check logic [hudi]

2024-02-20 Thread via GitHub
usberkeley commented on PR #10722: URL: https://github.com/apache/hudi/pull/10722#issuecomment-1956032866 @yihua bro, please review the code -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the sp

Re: [PR] Fix a bug in the HoodieBaseListData.isEmpty() empty-check logic [hudi]

2024-02-20 Thread via GitHub
usberkeley commented on PR #10722: URL: https://github.com/apache/hudi/pull/10722#issuecomment-1956031804 @yihua bro, please help me take a look -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to th

[jira] [Closed] (HUDI-7434) Fix a bug in the HoodieBaseListData.isEmpty() empty-check logic

2024-02-20 Thread bradley (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] bradley closed HUDI-7434. - Reviewers: (was: Y Ethan Guo) Resolution: Duplicate > Fix a bug in the HoodieBaseListData.isEmpty() empty-c

Re: [PR] [HUDI-7432] Fix excessive object creation in KeyGenUtils [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10721: URL: https://github.com/apache/hudi/pull/10721#issuecomment-1956026758 ## CI report: * d801e283ca4ab295e5e8d2624302dcb0d3649a71 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

[jira] [Created] (HUDI-7434) Fix a bug in the HoodieBaseListData.isEmpty() empty-check logic

2024-02-20 Thread bradley (Jira)
bradley created HUDI-7434: - Summary: Fix a bug in the HoodieBaseListData.isEmpty() empty-check logic Key: HUDI-7434 URL: https://issues.apache.org/jira/browse/HUDI-7434 Project: Apache Hudi Issue Ty

[jira] [Created] (HUDI-7433) Fix a bug in the HoodieBaseListData.isEmpty() empty-check logic

2024-02-20 Thread bradley (Jira)
bradley created HUDI-7433: - Summary: Fix a bug in the HoodieBaseListData.isEmpty() empty-check logic Key: HUDI-7433 URL: https://issues.apache.org/jira/browse/HUDI-7433 Project: Apache Hudi Issue Ty

[PR] Fix a bug in the HoodieBaseListData.isEmpty() empty-check logic [hudi]

2024-02-20 Thread via GitHub
usberkeley opened a new pull request, #10722: URL: https://github.com/apache/hudi/pull/10722 ### Change Logs There is an obvious logical error in HoodieBaseListData.isEmpty(). When the internal data structure is Stream, there is a logical error in determining whether it is empty.

Re: [PR] [HUDI-7432] Fix excessive object creation in KeyGenUtils [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10721: URL: https://github.com/apache/hudi/pull/10721#issuecomment-1955979902 ## CI report: * d801e283ca4ab295e5e8d2624302dcb0d3649a71 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

Re: [PR] [MINOR] Re-containerize 4th Azure CI module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10720: URL: https://github.com/apache/hudi/pull/10720#issuecomment-1955979862 ## CI report: * 50f9c5f47b90873b6f8d2326c694c05c8db8dd4b Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-6497] Replace FileSystem, Path, and FileStatus usage in `hudi-common` module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10591: URL: https://github.com/apache/hudi/pull/10591#issuecomment-1955972192 ## CI report: * 8207558e8c8714386cf2f71929d6fb08db10617b UNKNOWN * b9a8282209bfeb916001ce28661e8713e62ed8b5 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [MINOR] Re-containerize 4th Azure CI module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10720: URL: https://github.com/apache/hudi/pull/10720#issuecomment-1955972584 ## CI report: * 50f9c5f47b90873b6f8d2326c694c05c8db8dd4b UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

[jira] [Updated] (HUDI-7432) Fix excessive object creation in KeyGenUtils

2024-02-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7432: - Labels: pull-request-available (was: ) > Fix excessive object creation in KeyGenUtils > -

[PR] [HUDI-7432] Fix excessive object creation in KeyGenUtils [hudi]

2024-02-20 Thread via GitHub
wombatu-kun opened a new pull request, #10721: URL: https://github.com/apache/hudi/pull/10721 ### Change Logs Some Key generators are creating excessive copies of objects during recordKey/partitionPath computation to cut off last character from them (as methods getRecordKey, getRecor

[PR] [MINOR] Re-containerize 4th Azure CI module [hudi]

2024-02-20 Thread via GitHub
linliu-code opened a new pull request, #10720: URL: https://github.com/apache/hudi/pull/10720 ### Change Logs We have tested it for 20+ times without failures. We decided to reenable it to see how it works in prod. ### Impact More reliable. ### Risk level (write

Re: [I] [SUPPORT] Process killed with no additional info when loading large parquet files in Spark [hudi]

2024-02-20 Thread via GitHub
ad1happy2go commented on issue #10697: URL: https://github.com/apache/hudi/issues/10697#issuecomment-1955943929 Had working session with @alberttwong . The issue was docker instance was not given enough memory. We were able to read it using 8 GB memory. Closing this out. Thanks again @alber

Re: [I] [SUPPORT] Process killed with no additional info when loading large parquet files in Spark [hudi]

2024-02-20 Thread via GitHub
alberttwong closed issue #10697: [SUPPORT] Process killed with no additional info when loading large parquet files in Spark URL: https://github.com/apache/hudi/issues/10697 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

Re: [I] [SUPPORT] Process killed with no additional info when loading large parquet files in Spark [hudi]

2024-02-20 Thread via GitHub
alberttwong commented on issue #10697: URL: https://github.com/apache/hudi/issues/10697#issuecomment-1955943191 I had to allocate more memory for docker desktop. That fixed the issue. ![Screenshot 2024-02-20 at 9 50 49  PM](https://github.com/apache/hudi/assets/749093/e9ea642f-eb7c

Re: [I] Partitioning data into two keys is taking more time (10x) than partitioning into one key. [hudi]

2024-02-20 Thread via GitHub
maheshguptags commented on issue #10456: URL: https://github.com/apache/hudi/issues/10456#issuecomment-1955928240 @danny0405 still waiting for your response. can you please take look on this plz? -- This is an automated message from the Apache Git Service. To respond to the message, p

Re: [I] RLI Spark Hudi Error occurs when executing map [hudi]

2024-02-20 Thread via GitHub
maheshguptags commented on issue #10609: URL: https://github.com/apache/hudi/issues/10609#issuecomment-1955927720 @ad1happy2go and @yihua any update on this? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL abov

Re: [PR] [HUDI-7416] Add interface for StreamProfile to be used in StreamSync for reading and writing data [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10687: URL: https://github.com/apache/hudi/pull/10687#issuecomment-1955925406 ## CI report: * 78c5f6e801b1c7428326456360dbc4282a9f9703 UNKNOWN * 2d33c841bb67b947626bd4df724df54ea4522c1b Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

[jira] [Updated] (HUDI-7432) Fix excessive object creation in KeyGenUtils

2024-02-20 Thread Vova Kolmakov (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vova Kolmakov updated HUDI-7432: Description: Some Key generators are creating excessive copies of objects during recordKey/partitio

Re: [PR] [HUDI-7416] Add interface for StreamProfile to be used in StreamSync for reading and writing data [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10687: URL: https://github.com/apache/hudi/pull/10687#issuecomment-1955918898 ## CI report: * 78c5f6e801b1c7428326456360dbc4282a9f9703 UNKNOWN * 2d33c841bb67b947626bd4df724df54ea4522c1b Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-6860] Move createNewInstantTime out of HoodieActiveTimeline [hudi]

2024-02-20 Thread via GitHub
wombatu-kun commented on PR #10714: URL: https://github.com/apache/hudi/pull/10714#issuecomment-1955917467 I don't know exactly, I'm not the reporter of this improvement in ASF (it is Hui An). I suppose his purpose was to extract such a common utility code from HoodieActiveTimeline class to

Re: [PR] [HUDI-6860] Move createNewInstantTime out of HoodieActiveTimeline [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10714: URL: https://github.com/apache/hudi/pull/10714#issuecomment-1955911711 ## CI report: * 237c5cdda3c87fec59247bb403c202d39bc5cdca Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [I] [SUPPORT]compaction plan generate too fast [hudi]

2024-02-20 Thread via GitHub
lqbFFF commented on issue #10713: URL: https://github.com/apache/hudi/issues/10713#issuecomment-1955910881 > Hmm, 0.11 is a very old release, can you show some logs in JM to enlight some clues of the scheduling process? I cannot find any related logs and clue about scheduling process

Re: [I] [SUPPORT] Setting hoodie.datasource.insert.dup.policy to drop still upserts the record in 0.14 [hudi]

2024-02-20 Thread via GitHub
ad1happy2go commented on issue #10650: URL: https://github.com/apache/hudi/issues/10650#issuecomment-1955898747 Thanks @jmnatzaganian . We were made aware of that recently and we are working on document update. For datasource writer we still need to use the old config and this only works fo

[jira] [Updated] (HUDI-7432) Fix excessive object creation in KeyGenUtils

2024-02-20 Thread Vova Kolmakov (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vova Kolmakov updated HUDI-7432: Status: In Progress (was: Open) > Fix excessive object creation in KeyGenUtils > --

[jira] [Created] (HUDI-7432) Fix excessive object creation in KeyGenUtils

2024-02-20 Thread Vova Kolmakov (Jira)
Vova Kolmakov created HUDI-7432: --- Summary: Fix excessive object creation in KeyGenUtils Key: HUDI-7432 URL: https://issues.apache.org/jira/browse/HUDI-7432 Project: Apache Hudi Issue Type: Impr

Re: [I] [SUPPORT]compaction plan generate too fast [hudi]

2024-02-20 Thread via GitHub
danny0405 commented on issue #10713: URL: https://github.com/apache/hudi/issues/10713#issuecomment-1955867139 Hmm, 0.11 is a very old release, can you show some logs in JM to enlight some clues of the scheduling process? -- This is an automated message from the Apache Git Service. To resp

[jira] [Updated] (HUDI-6774) Prefix HiveConf properties to Hoodie catalog properties map with '.hadoop'

2024-02-20 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-6774: - Fix Version/s: 0.14.2 > Prefix HiveConf properties to Hoodie catalog properties map with '.hadoop' > -

[jira] [Resolved] (HUDI-6774) Prefix HiveConf properties to Hoodie catalog properties map with '.hadoop'

2024-02-20 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen resolved HUDI-6774. -- > Prefix HiveConf properties to Hoodie catalog properties map with '.hadoop' > -

[jira] [Closed] (HUDI-6774) Prefix HiveConf properties to Hoodie catalog properties map with '.hadoop'

2024-02-20 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen closed HUDI-6774. Resolution: Fixed Fixed via master branch: be0a852dab0e2865ac3119a9eb190ab9d2d76ec8 > Prefix HiveConf prope

(hudi) branch master updated: [HUDI-6774] Prefix HiveConf properties to Hoodie catalog properties map with '.hadoop' (#10686)

2024-02-20 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new be0a852dab0 [HUDI-6774] Prefix HiveConf propert

Re: [PR] [HUDI-6774] Prefix HiveConf props to Hoodie catalog properties map with '.hadoop' [hudi]

2024-02-20 Thread via GitHub
danny0405 merged PR #10686: URL: https://github.com/apache/hudi/pull/10686 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apac

Re: [PR] [HUDI-6860] Move createNewInstantTime out of HoodieActiveTimeline [hudi]

2024-02-20 Thread via GitHub
danny0405 commented on PR #10714: URL: https://github.com/apache/hudi/pull/10714#issuecomment-1955860732 Can you elaborate what is the purpose of this change? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL abo

Re: [PR] [HUDI-6497] Replace FileSystem, Path, and FileStatus usage in `hudi-common` module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10591: URL: https://github.com/apache/hudi/pull/10591#issuecomment-1955858682 ## CI report: * 8207558e8c8714386cf2f71929d6fb08db10617b UNKNOWN * 968e1ad1799199c24368f05b01762a5d3b8f7629 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-7416] Add interface for StreamProfile to be used in StreamSync for reading and writing data [hudi]

2024-02-20 Thread via GitHub
vinishjail97 commented on code in PR #10687: URL: https://github.com/apache/hudi/pull/10687#discussion_r1496874320 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/streamer/StreamProfileSupplier.java: ## @@ -0,0 +1,28 @@ +/* + * Licensed to the Apache Software Foundatio

Re: [PR] [HUDI-6497] Replace FileSystem, Path, and FileStatus usage in `hudi-common` module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10591: URL: https://github.com/apache/hudi/pull/10591#issuecomment-1955815105 ## CI report: * 8207558e8c8714386cf2f71929d6fb08db10617b UNKNOWN * 968e1ad1799199c24368f05b01762a5d3b8f7629 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-6860] Move createNewInstantTime out of HoodieActiveTimeline [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10714: URL: https://github.com/apache/hudi/pull/10714#issuecomment-1955757868 ## CI report: * 8acd57d8a2270eda79fb69b3219d2861e1b96f30 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7430] Fix empty schema issue for compactor [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10718: URL: https://github.com/apache/hudi/pull/10718#issuecomment-1955751559 ## CI report: * 0fcfe358f651975c5276f7030ebb81b0011e5d5f Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-6860] Move createNewInstantTime out of HoodieActiveTimeline [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10714: URL: https://github.com/apache/hudi/pull/10714#issuecomment-1955751508 ## CI report: * 8acd57d8a2270eda79fb69b3219d2861e1b96f30 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

[PR] initial commit for hudi blogs [hudi]

2024-02-20 Thread via GitHub
nfarah86 opened a new pull request, #10719: URL: https://github.com/apache/hudi/pull/10719 updated hudi blogs for new content @bhasudha @xushiyan -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

Re: [PR] [HUDI-6497] Replace FileSystem, Path, and FileStatus usage in `hudi-common` module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10591: URL: https://github.com/apache/hudi/pull/10591#issuecomment-1955745031 ## CI report: * 8207558e8c8714386cf2f71929d6fb08db10617b UNKNOWN * 968e1ad1799199c24368f05b01762a5d3b8f7629 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [MINOR] StreamerUtil prints wrong table path [hudi]

2024-02-20 Thread via GitHub
zhuanshenbsj1 commented on code in PR #10706: URL: https://github.com/apache/hudi/pull/10706#discussion_r1496780492 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/util/StreamerUtil.java: ## @@ -265,7 +265,7 @@ public static HoodieTableMetaClient initTableIfNot

Re: [PR] [HUDI-7430] Fix empty schema issue for compactor [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10718: URL: https://github.com/apache/hudi/pull/10718#issuecomment-1955685207 ## CI report: * 0fcfe358f651975c5276f7030ebb81b0011e5d5f Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7430] Fix empty schema issue for compactor [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10718: URL: https://github.com/apache/hudi/pull/10718#issuecomment-1955662561 ## CI report: * 0fcfe358f651975c5276f7030ebb81b0011e5d5f UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

Re: [PR] [HUDI-6497] Replace FileSystem, Path, and FileStatus usage in `hudi-common` module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10591: URL: https://github.com/apache/hudi/pull/10591#issuecomment-1955661512 ## CI report: * 8207558e8c8714386cf2f71929d6fb08db10617b UNKNOWN * a285a0602386d073e4893d975d19d2e6bce41193 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-7431] Add replication and block size to StoragePathInfo to be backwards compatible [hudi]

2024-02-20 Thread via GitHub
vinothchandar commented on code in PR #10717: URL: https://github.com/apache/hudi/pull/10717#discussion_r1496747921 ## hudi-io/src/main/java/org/apache/hudi/storage/StoragePathInfo.java: ## @@ -35,15 +35,23 @@ public class StoragePathInfo implements Serializable { private fin

Re: [PR] [HUDI-7431] Add replication and block size to StoragePathInfo to be backwards compatible [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10717: URL: https://github.com/apache/hudi/pull/10717#issuecomment-1955641683 ## CI report: * 8fa6388dcc89a6cafabfe00fe350d5af9b823ed3 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-6497] Replace FileSystem, Path, and FileStatus usage in `hudi-common` module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10591: URL: https://github.com/apache/hudi/pull/10591#issuecomment-1955640736 ## CI report: * 8207558e8c8714386cf2f71929d6fb08db10617b UNKNOWN * a285a0602386d073e4893d975d19d2e6bce41193 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-9424]Support using local timezone when writing flink TIMESTAMP data [hudi]

2024-02-20 Thread via GitHub
cmmp6 commented on code in PR #10594: URL: https://github.com/apache/hudi/pull/10594#discussion_r1495303542 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/utils/TestRowDataToAvroConverters.java: ## @@ -0,0 +1,124 @@ +/* + * Licensed to the Apache Software Foun

Re: [PR] [HUDI-7416] Add interface for StreamProfile to be used in StreamSync for reading and writing data [hudi]

2024-02-20 Thread via GitHub
rmahindra123 commented on code in PR #10687: URL: https://github.com/apache/hudi/pull/10687#discussion_r1496727416 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/streamer/StreamContext.java: ## @@ -0,0 +1,44 @@ +/* + * Licensed to the Apache Software Foundation (ASF)

[PR] [HUDI-7430] Fix empty schema issue for compactor [hudi]

2024-02-20 Thread via GitHub
linliu-code opened a new pull request, #10718: URL: https://github.com/apache/hudi/pull/10718 ### Change Logs The input schema string is empty. We try to get it form schema file or instants. ### Impact Fixed the bug. ### Risk level (write none, low medium or hig

[jira] [Updated] (HUDI-7430) Fix the empty schema for compactor

2024-02-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7430: - Labels: pull-request-available (was: ) > Fix the empty schema for compactor > ---

Re: [PR] [HUDI-6497] Replace FileSystem, Path, and FileStatus usage in `hudi-common` module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10591: URL: https://github.com/apache/hudi/pull/10591#issuecomment-1955513830 ## CI report: * 8207558e8c8714386cf2f71929d6fb08db10617b UNKNOWN * f2d9666248d466987c440f56188042d5bf4292a4 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-6497] Replace FileSystem, Path, and FileStatus usage in `hudi-common` module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10591: URL: https://github.com/apache/hudi/pull/10591#issuecomment-1955491868 ## CI report: * 8207558e8c8714386cf2f71929d6fb08db10617b UNKNOWN * f2d9666248d466987c440f56188042d5bf4292a4 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-6497] Replace FileSystem, Path, and FileStatus usage in `hudi-common` module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10591: URL: https://github.com/apache/hudi/pull/10591#issuecomment-1955469039 ## CI report: * 8207558e8c8714386cf2f71929d6fb08db10617b UNKNOWN * d43032d71a56ba1ccae0592943671f72b4d73229 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-7431] Add replication and block size to StoragePathInfo to be backwards compatible [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10717: URL: https://github.com/apache/hudi/pull/10717#issuecomment-1955342428 ## CI report: * 8fa6388dcc89a6cafabfe00fe350d5af9b823ed3 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7431] Add replication and block size to StoragePathInfo to be backwards compatible [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10717: URL: https://github.com/apache/hudi/pull/10717#issuecomment-1955321110 ## CI report: * 8fa6388dcc89a6cafabfe00fe350d5af9b823ed3 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

Re: [PR] [HUDI-7406] Rename classes to be readable in storage abstraction [hudi]

2024-02-20 Thread via GitHub
yihua commented on code in PR #10672: URL: https://github.com/apache/hudi/pull/10672#discussion_r1496650067 ## hudi-hadoop-common/src/main/java/org/apache/hudi/storage/hadoop/HoodieHadoopStorage.java: ## @@ -60,108 +60,108 @@ public URI getUri() { } @Override - public O

[jira] [Updated] (HUDI-7431) Add replication and block size to StoragePathInfo to be backwards compatible

2024-02-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7431: - Labels: pull-request-available (was: ) > Add replication and block size to StoragePathInfo to be

[PR] [HUDI-7431] Add replication and block size to StoragePathInfo to be backwards compatible [hudi]

2024-02-20 Thread via GitHub
yihua opened a new pull request, #10717: URL: https://github.com/apache/hudi/pull/10717 ### Change Logs This PR adds the replication and block size information to `StoragePathInfo` so that it is backward compatible for generating `FileStatus` and Hive's `FileInputFormat` to properly

[I] [SUPPORT] Spark Write into MoR type hudi table small parquets issue + Athena Internal Error [hudi]

2024-02-20 Thread via GitHub
yizhenglu opened a new issue, #10716: URL: https://github.com/apache/hudi/issues/10716 **_Tips before filing an issue_** **Describe the problem you faced** Background: Currently, I have around 100 mb data for each day (batch process), so I am using the delete operation with

Re: [I] [SUPPORT] Process killed with no additional info when loading large parquet files in Spark [hudi]

2024-02-20 Thread via GitHub
alberttwong commented on issue #10697: URL: https://github.com/apache/hudi/issues/10697#issuecomment-1955184344 I have confirmed that ingestion of the 1.1 GB file works fine on onehouse.ai. -- This is an automated message from the Apache Git Service. To respond to the message, please log o

[jira] [Updated] (HUDI-7431) Add replication and block size to StoragePathInfo to be backwards compatible

2024-02-20 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7431: Fix Version/s: 0.15.0 1.0.0 > Add replication and block size to StoragePathInfo to be bac

[jira] [Assigned] (HUDI-7431) Add replication and block size to StoragePathInfo to be backwards compatible

2024-02-20 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-7431: --- Assignee: Ethan Guo > Add replication and block size to StoragePathInfo to be backwards compatible >

[jira] [Created] (HUDI-7431) Add replication and block size to StoragePathInfo to be backwards compatible

2024-02-20 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-7431: --- Summary: Add replication and block size to StoragePathInfo to be backwards compatible Key: HUDI-7431 URL: https://issues.apache.org/jira/browse/HUDI-7431 Project: Apache Hudi

Re: [PR] [MINOR] Enable code coverage in Azure [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10646: URL: https://github.com/apache/hudi/pull/10646#issuecomment-1955144011 ## CI report: * 8c2f326afe803101c4314d9637f756392bc231d4 UNKNOWN * b4700ebffe772028ab1bf66484efef6e77087a8c UNKNOWN * e33deb49ab93589767822e842fe2a6f0106785b3 Azure: [SUCC

Re: [PR] [MINOR] Enable code coverage in Azure [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10646: URL: https://github.com/apache/hudi/pull/10646#issuecomment-1955032606 ## CI report: * 8c2f326afe803101c4314d9637f756392bc231d4 UNKNOWN * b4700ebffe772028ab1bf66484efef6e77087a8c UNKNOWN * fda9e0a5e5403efef42bccbf4d0e9c987cd00f9c Azure: [SUCC

[jira] [Created] (HUDI-7430) Fix the empty schema for compactor

2024-02-20 Thread Lin Liu (Jira)
Lin Liu created HUDI-7430: - Summary: Fix the empty schema for compactor Key: HUDI-7430 URL: https://issues.apache.org/jira/browse/HUDI-7430 Project: Apache Hudi Issue Type: Bug Reporter:

Re: [PR] [MINOR] Enable code coverage in Azure [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10646: URL: https://github.com/apache/hudi/pull/10646#issuecomment-1955009767 ## CI report: * 8c2f326afe803101c4314d9637f756392bc231d4 UNKNOWN * b4700ebffe772028ab1bf66484efef6e77087a8c UNKNOWN * fda9e0a5e5403efef42bccbf4d0e9c987cd00f9c Azure: [SUCC

Re: [PR] [HUDI-6497] Replace FileSystem, Path, and FileStatus usage in `hudi-common` module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10591: URL: https://github.com/apache/hudi/pull/10591#issuecomment-1955009285 ## CI report: * 8207558e8c8714386cf2f71929d6fb08db10617b UNKNOWN * d43032d71a56ba1ccae0592943671f72b4d73229 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-6497] Replace FileSystem, Path, and FileStatus usage in `hudi-common` module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10591: URL: https://github.com/apache/hudi/pull/10591#issuecomment-1954986497 ## CI report: * 8207558e8c8714386cf2f71929d6fb08db10617b UNKNOWN * 726f5c1bffbfd0e4710b83515b353e572aa3f7aa Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [MINOR] Enable code coverage in Azure [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10646: URL: https://github.com/apache/hudi/pull/10646#issuecomment-1954910568 ## CI report: * 8c2f326afe803101c4314d9637f756392bc231d4 UNKNOWN * b4700ebffe772028ab1bf66484efef6e77087a8c UNKNOWN * fda9e0a5e5403efef42bccbf4d0e9c987cd00f9c Azure: [SUCC

Re: [PR] [HUDI-6497] Replace FileSystem, Path, and FileStatus usage in `hudi-common` module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10591: URL: https://github.com/apache/hudi/pull/10591#issuecomment-1954910294 ## CI report: * 8207558e8c8714386cf2f71929d6fb08db10617b UNKNOWN * 12afa66ed0b62dff350a10e46e1bc8dee88acb4e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-6497] Replace FileSystem, Path, and FileStatus usage in `hudi-common` module [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10591: URL: https://github.com/apache/hudi/pull/10591#issuecomment-1954899157 ## CI report: * 8207558e8c8714386cf2f71929d6fb08db10617b UNKNOWN * 12afa66ed0b62dff350a10e46e1bc8dee88acb4e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [MINOR] Enable code coverage in Azure [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10646: URL: https://github.com/apache/hudi/pull/10646#issuecomment-1954818473 ## CI report: * 8c2f326afe803101c4314d9637f756392bc231d4 UNKNOWN * b4700ebffe772028ab1bf66484efef6e77087a8c UNKNOWN * 9159eb2301cc84a033bba51885d39a82aa07237c Azure: [SUCC

Re: [PR] [MINOR] Enable code coverage in Azure [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10646: URL: https://github.com/apache/hudi/pull/10646#issuecomment-1954804494 ## CI report: * 8c2f326afe803101c4314d9637f756392bc231d4 UNKNOWN * b4700ebffe772028ab1bf66484efef6e77087a8c UNKNOWN * 9159eb2301cc84a033bba51885d39a82aa07237c Azure: [SUCC

Re: [I] [SUPPORT] Setting hoodie.datasource.insert.dup.policy to drop still upserts the record in 0.14 [hudi]

2024-02-20 Thread via GitHub
jmnatzaganian commented on issue #10650: URL: https://github.com/apache/hudi/issues/10650#issuecomment-1954799508 Despite the initial report being with `upsert`, I can confirm that the new `hoodie.datasource.insert.dup.policy` option does not drop dupes as expected with the `insert` write o

Re: [PR] [HUDI-7416] Add interface for StreamProfile to be used in StreamSync for reading and writing data [hudi]

2024-02-20 Thread via GitHub
yihua commented on code in PR #10687: URL: https://github.com/apache/hudi/pull/10687#discussion_r1496284240 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/streamer/StreamSync.java: ## @@ -285,7 +285,7 @@ public StreamSync(HoodieStreamer.Config cfg, SparkSession spark

Re: [PR] [HUDI-7384] [HUDI-7405] [secondary-index] Secondary index support [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10625: URL: https://github.com/apache/hudi/pull/10625#issuecomment-1954792878 ## CI report: * 1a69c6a5dec0d8ba1e0d465eda0cf4e488a22b30 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7416] Add interface for StreamProfile to be used in StreamSync for reading and writing data [hudi]

2024-02-20 Thread via GitHub
yihua commented on code in PR #10687: URL: https://github.com/apache/hudi/pull/10687#discussion_r1496269281 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/streamer/StreamProfileSupplier.java: ## @@ -0,0 +1,28 @@ +/* + * Licensed to the Apache Software Foundation (ASF)

(hudi) branch master updated: [HUDI-7424] Throw conversion error of Avro record properly for error table (#10705)

2024-02-20 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 655298911f0 [HUDI-7424] Throw conversion error of A

Re: [PR] [HUDI-7424] Throw conversion error of Avro record properly for error table [hudi]

2024-02-20 Thread via GitHub
yihua merged PR #10705: URL: https://github.com/apache/hudi/pull/10705 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.o

Re: [PR] [HUDI-7373] Make schema evolution doc and config correct [hudi]

2024-02-20 Thread via GitHub
jonvex merged PR #10612: URL: https://github.com/apache/hudi/pull/10612 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.

Re: [PR] [HUDI-7384] [HUDI-7405] [secondary-index] Secondary index support [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10625: URL: https://github.com/apache/hudi/pull/10625#issuecomment-1954586379 ## CI report: * 804d73922a136f6fed0fdcc559bfb697bda4942e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

(hudi) branch asf-site updated: make docs correct (#10612)

2024-02-20 Thread jonvex
This is an automated email from the ASF dual-hosted git repository. jonvex pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new 32830e351ad make docs correct (#10612) 32830e3

Re: [PR] [HUDI-7384] [HUDI-7405] [secondary-index] Secondary index support [hudi]

2024-02-20 Thread via GitHub
hudi-bot commented on PR #10625: URL: https://github.com/apache/hudi/pull/10625#issuecomment-1954569975 ## CI report: * 804d73922a136f6fed0fdcc559bfb697bda4942e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [WIP] Secondary index read [hudi]

2024-02-20 Thread via GitHub
bhat-vinay closed pull request #10657: [WIP] Secondary index read URL: https://github.com/apache/hudi/pull/10657 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe,

Re: [PR] [WIP] Secondary index read [hudi]

2024-02-20 Thread via GitHub
bhat-vinay commented on PR #10657: URL: https://github.com/apache/hudi/pull/10657#issuecomment-1954533765 Closing this in favor of the merged PR (https://github.com/apache/hudi/pull/10625) having both reader and writer support -- This is an automated message from the Apache Git Service.

Re: [PR] [HUDI-7373] Make schema evolution doc and config correct [hudi]

2024-02-20 Thread via GitHub
jonvex commented on code in PR #10612: URL: https://github.com/apache/hudi/pull/10612#discussion_r1496076240 ## website/docs/configurations.md: ## @@ -127,59 +127,59 @@ Options useful for writing tables via `write.format.option(...)` [**Advanced Configs**](#Write-Options-advan

Re: [PR] [WIP] [HUDI-6787] Implement the HoodieFileGroupReader API for Hive [hudi]

2024-02-20 Thread via GitHub
jonvex commented on PR #10422: URL: https://github.com/apache/hudi/pull/10422#issuecomment-1954508175 > Overall looks good to me. @jonvex : What Hive versions are we targeting/testing ? @bvaradar I used the docker demo to test. I think that is using Hive 2. We would like this to repl

Re: [PR] [WIP] [HUDI-6787] Implement the HoodieFileGroupReader API for Hive [hudi]

2024-02-20 Thread via GitHub
jonvex commented on code in PR #10422: URL: https://github.com/apache/hudi/pull/10422#discussion_r1496057861 ## hudi-hadoop-mr/src/main/java/org/apache/hudi/hadoop/HiveHoodieReaderContext.java: ## @@ -0,0 +1,259 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under o

Re: [PR] [WIP] [HUDI-6787] Implement the HoodieFileGroupReader API for Hive [hudi]

2024-02-20 Thread via GitHub
jonvex commented on code in PR #10422: URL: https://github.com/apache/hudi/pull/10422#discussion_r1496053647 ## hudi-hadoop-mr/src/main/java/org/apache/hudi/hadoop/HiveHoodieReaderContext.java: ## @@ -0,0 +1,259 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under o

Re: [PR] [WIP] [HUDI-6787] Implement the HoodieFileGroupReader API for Hive [hudi]

2024-02-20 Thread via GitHub
jonvex commented on code in PR #10422: URL: https://github.com/apache/hudi/pull/10422#discussion_r1496049414 ## hudi-hadoop-mr/src/main/java/org/apache/hudi/hadoop/HiveHoodieReaderContext.java: ## @@ -0,0 +1,259 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under o

  1   2   >