Re: [PR] [HUDI-7438][Test][DNM] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10743: URL: https://github.com/apache/hudi/pull/10743#issuecomment-1962288861 ## CI report: * 0b1be7eef7f597cc3cb8899160700b38601a5c4d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7438][DNM] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1962288844 ## CI report: * 040e2c89e131b994c2a0b7875e512ab992b3c547 UNKNOWN * 0b1be7eef7f597cc3cb8899160700b38601a5c4d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

(hudi) branch HUDI-7438-fix-issue-comment-processing updated (e3182b43f7a -> 69086bc3a84)

2024-02-23 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a change to branch HUDI-7438-fix-issue-comment-processing in repository https://gitbox.apache.org/repos/asf/hudi.git discard e3182b43f7a [HUDI-7438] Fix Azure CI report check with new issue comments add 69086

Re: [PR] [MINOR][TESTING] Test PR [hudi]

2024-02-23 Thread via GitHub
yihua closed pull request #10737: [MINOR][TESTING] Test PR URL: https://github.com/apache/hudi/pull/10737 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail

Re: [PR] [HUDI-7416] Add interface for StreamProfile to be used in StreamSync for reading and writing data [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10736: URL: https://github.com/apache/hudi/pull/10736#issuecomment-1962287364 ## CI report: * dbe9cea4f203fe6f056b1f1e1f639e7ad775736c Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7438][Test] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10743: URL: https://github.com/apache/hudi/pull/10743#issuecomment-1962287404 ## CI report: * 0b1be7eef7f597cc3cb8899160700b38601a5c4d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

(hudi) branch HUDI-7438-fix-issue-comment-processing updated (3d09199fa55 -> e3182b43f7a)

2024-02-23 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a change to branch HUDI-7438-fix-issue-comment-processing in repository https://gitbox.apache.org/repos/asf/hudi.git discard 3d09199fa55 [HUDI-7438] Fix Azure CI report check with new issue comments add e3182

Re: [PR] [HUDI-7438][Test] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10743: URL: https://github.com/apache/hudi/pull/10743#issuecomment-1962286022 ## CI report: * 0b1be7eef7f597cc3cb8899160700b38601a5c4d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1962286011 ## CI report: * 040e2c89e131b994c2a0b7875e512ab992b3c547 UNKNOWN * 1734e2500da3e94e7bf3bd2740f9eed513e4b566 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-7416] Add interface for StreamProfile to be used in StreamSync for reading and writing data [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10736: URL: https://github.com/apache/hudi/pull/10736#issuecomment-1962285984 ## CI report: * dbe9cea4f203fe6f056b1f1e1f639e7ad775736c Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

(hudi) branch HUDI-7438-fix-issue-comment-processing updated (98efc813fec -> 3d09199fa55)

2024-02-23 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a change to branch HUDI-7438-fix-issue-comment-processing in repository https://gitbox.apache.org/repos/asf/hudi.git discard 98efc813fec [HUDI-7438] Fix Azure CI report check with new issue comments add 3d091

(hudi) branch HUDI-7438-fix-issue-comment-processing updated (0b1be7eef7f -> 98efc813fec)

2024-02-23 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a change to branch HUDI-7438-fix-issue-comment-processing in repository https://gitbox.apache.org/repos/asf/hudi.git omit 0b1be7eef7f [HUDI-7438] Fix Azure CI report check with new issue comments add 98efc

Re: [PR] [HUDI-7438][Test] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10743: URL: https://github.com/apache/hudi/pull/10743#issuecomment-1962277214 ## CI report: * 0b1be7eef7f597cc3cb8899160700b38601a5c4d UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

Re: [PR] [MINOR] Add permissions to the PR size labeler [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10478: URL: https://github.com/apache/hudi/pull/10478#issuecomment-1962277121 ## CI report: * 3acf3f7f5de88cc1c770644a3a04de93742a1fd9 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [MINOR] Add permissions to the PR size labeler [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10478: URL: https://github.com/apache/hudi/pull/10478#issuecomment-1962275778 ## CI report: * 3acf3f7f5de88cc1c770644a3a04de93742a1fd9 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

[PR] [HUDI-7438][Test] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
yihua opened a new pull request, #10743: URL: https://github.com/apache/hudi/pull/10743 ### Change Logs _Describe context and summary for this change. Highlight if any code was copied._ ### Impact _Describe any public API or user-facing feature change or any performance

(hudi) branch HUDI-7438-fix-issue-comment-processing created (now 0b1be7eef7f)

2024-02-23 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a change to branch HUDI-7438-fix-issue-comment-processing in repository https://gitbox.apache.org/repos/asf/hudi.git at 0b1be7eef7f [HUDI-7438] Fix Azure CI report check with new issue comments No new revisi

(hudi) branch fix-size-labeler deleted (was 0b90ccf97e8)

2024-02-23 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a change to branch fix-size-labeler in repository https://gitbox.apache.org/repos/asf/hudi.git was 0b90ccf97e8 Add permissions to the PR size labeler The revisions that were on this branch are still contained

Re: [PR] [MINOR][Test] Add permissions to the PR size labeler [hudi]

2024-02-23 Thread via GitHub
yihua closed pull request #10742: [MINOR][Test] Add permissions to the PR size labeler URL: https://github.com/apache/hudi/pull/10742 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific com

(hudi) branch fix-size-labeler updated (39975ef29a2 -> 0b90ccf97e8)

2024-02-23 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a change to branch fix-size-labeler in repository https://gitbox.apache.org/repos/asf/hudi.git omit 39975ef29a2 Add permissions to the PR size labeler add 0b90ccf97e8 Add permissions to the PR size labeler

[PR] [MINOR][Test] Add permissions to the PR size labeler [hudi]

2024-02-23 Thread via GitHub
yihua opened a new pull request, #10742: URL: https://github.com/apache/hudi/pull/10742 ### Change Logs _Describe context and summary for this change. Highlight if any code was copied._ ### Impact _Describe any public API or user-facing feature change or any performance

(hudi) branch fix-size-labeler created (now 39975ef29a2)

2024-02-23 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a change to branch fix-size-labeler in repository https://gitbox.apache.org/repos/asf/hudi.git at 39975ef29a2 Add permissions to the PR size labeler No new revisions were added by this update.

Re: [PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1962266190 ## CI report: * 040e2c89e131b994c2a0b7875e512ab992b3c547 UNKNOWN * 392c0624e5b0e9ab8883781d0e7ef4c11dc87319 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1962264492 ## CI report: * 040e2c89e131b994c2a0b7875e512ab992b3c547 UNKNOWN * 392c0624e5b0e9ab8883781d0e7ef4c11dc87319 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1962263014 ## CI report: * 040e2c89e131b994c2a0b7875e512ab992b3c547 UNKNOWN * 392c0624e5b0e9ab8883781d0e7ef4c11dc87319 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [I] [SUPPORT] org.apache.avro.SchemaParseException: Can't redefine: array When there are Top level variables , Struct and Array[struct] (no complex datatype within array[struct]) [hudi]

2024-02-23 Thread via GitHub
Jonathanrodrigr12 commented on issue #7717: URL: https://github.com/apache/hudi/issues/7717#issuecomment-1962262653 Hi, i have the same problem but i am use the HoodieMultiTableStreamer **Description** I have a lot parquet files, all of them have this struct ![image](https://github

Re: [PR] [HUDI-4444] Refactor DataSourceInternalWriterHelper [hudi]

2024-02-23 Thread via GitHub
wombatu-kun commented on code in PR #10715: URL: https://github.com/apache/hudi/pull/10715#discussion_r1501346077 ## hudi-spark-datasource/hudi-spark-common/src/main/java/org/apache/hudi/internal/DataSourceInternalWriterHelper.java: ## @@ -66,13 +66,11 @@ public DataSourceIntern

Re: [PR] [HUDI-6089] Handle default insert behaviour to ingest duplicates [hudi]

2024-02-23 Thread via GitHub
wombatu-kun commented on code in PR #10728: URL: https://github.com/apache/hudi/pull/10728#discussion_r1501343613 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieWriteConfig.java: ## @@ -562,7 +562,7 @@ public class HoodieWriteConfig extends HoodieCo

Re: [PR] [HUDI-6089] Handle default insert behaviour to ingest duplicates [hudi]

2024-02-23 Thread via GitHub
wombatu-kun commented on code in PR #10728: URL: https://github.com/apache/hudi/pull/10728#discussion_r1501343700 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieWriteConfig.java: ## @@ -562,7 +562,7 @@ public class HoodieWriteConfig extends HoodieCo

Re: [PR] [HUDI-6089] Handle default insert behaviour to ingest duplicates [hudi]

2024-02-23 Thread via GitHub
wombatu-kun commented on code in PR #10728: URL: https://github.com/apache/hudi/pull/10728#discussion_r1501343613 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieWriteConfig.java: ## @@ -562,7 +562,7 @@ public class HoodieWriteConfig extends HoodieCo

[jira] [Closed] (HUDI-7433) Fix a bug in the HoodieBaseListData.isEmpty() empty-check logic

2024-02-23 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen closed HUDI-7433. Resolution: Fixed Fixed via master branch: 22e2063261ceded17a12d5443ca58910bd6a471b > Fix a bug in the Hood

(hudi) branch master updated (b8b6917f8b0 -> 22e2063261c)

2024-02-23 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from b8b6917f8b0 [HUDI-7440] Verify field exist in schema before fetching the value (#10733) add 22e2063261c [HUDI-7

Re: [PR] Fix a bug in the HoodieBaseListData.isEmpty() empty-check logic [hudi]

2024-02-23 Thread via GitHub
danny0405 merged PR #10722: URL: https://github.com/apache/hudi/pull/10722 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apac

Re: [I] Bugs about the hudi table created by hive catalog and wrong results when querying RO table [hudi]

2024-02-23 Thread via GitHub
danny0405 commented on issue #10735: URL: https://github.com/apache/hudi/issues/10735#issuecomment-1962251473 We should not use Hive catalog, that's why we introduce a `HoodieHiveCatalog` where we do many tasks for `createTable`. -- This is an automated message from the Apache Git Service

[jira] [Closed] (HUDI-7440) Verify field exist in schema before fetching the value

2024-02-23 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen closed HUDI-7440. Resolution: Fixed Fixed via master branch: b8b6917f8b0ba0d8b3b3034a275aa1f0947be954 > Verify field exist in

[jira] [Updated] (HUDI-7440) Verify field exist in schema before fetching the value

2024-02-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7440: - Labels: pull-request-available (was: ) > Verify field exist in schema before fetching the value >

(hudi) branch master updated (cddd7d416a5 -> b8b6917f8b0)

2024-02-23 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from cddd7d416a5 [HUDI-7275] Separate use of HoodieTimelineTimeZone.UTC and LOCAL in tests to prevent infinite loops (#10

Re: [PR] [HUDI-7440] Verify field exist in schema before fetching the value [hudi]

2024-02-23 Thread via GitHub
danny0405 merged PR #10733: URL: https://github.com/apache/hudi/pull/10733 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apac

[jira] [Created] (HUDI-7440) Verify field exist in schema before fetching the value

2024-02-23 Thread Danny Chen (Jira)
Danny Chen created HUDI-7440: Summary: Verify field exist in schema before fetching the value Key: HUDI-7440 URL: https://issues.apache.org/jira/browse/HUDI-7440 Project: Apache Hudi Issue Type:

Re: [PR] [HUDI-6089] Handle default insert behaviour to ingest duplicates [hudi]

2024-02-23 Thread via GitHub
danny0405 commented on code in PR #10728: URL: https://github.com/apache/hudi/pull/10728#discussion_r1501342711 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieWriteConfig.java: ## @@ -562,7 +562,7 @@ public class HoodieWriteConfig extends HoodieConf

[jira] [Closed] (HUDI-7275) org.apache.hudi.TestHoodieSparkSqlWriter#testInsertDatasetWithTimelineTimezoneUTC causes issues with following tests

2024-02-23 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen closed HUDI-7275. Resolution: Fixed Fixed via master branch: cddd7d416a5db31de879790a80a33bb86cf02cbc > org.apache.hudi.TestH

[jira] [Updated] (HUDI-7275) org.apache.hudi.TestHoodieSparkSqlWriter#testInsertDatasetWithTimelineTimezoneUTC causes issues with following tests

2024-02-23 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-7275: - Fix Version/s: 0.14.2 > org.apache.hudi.TestHoodieSparkSqlWriter#testInsertDatasetWithTimelineTimezoneUTC

(hudi) branch master updated (6f74c7f6ec6 -> cddd7d416a5)

2024-02-23 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from 6f74c7f6ec6 [HUDI-7438] Add GitHub action to check Azure CI report (#10731) add cddd7d416a5 [HUDI-7275] Separat

Re: [PR] [HUDI-7275] Separate use of HoodieTimelineTimeZone.UTC and LOCAL in tests to prevent infinite loops [hudi]

2024-02-23 Thread via GitHub
danny0405 merged PR #10738: URL: https://github.com/apache/hudi/pull/10738 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apac

Re: [PR] [HUDI-4444] Refactor DataSourceInternalWriterHelper [hudi]

2024-02-23 Thread via GitHub
danny0405 commented on code in PR #10715: URL: https://github.com/apache/hudi/pull/10715#discussion_r1501342066 ## hudi-spark-datasource/hudi-spark-common/src/main/java/org/apache/hudi/internal/DataSourceInternalWriterHelper.java: ## @@ -66,13 +66,11 @@ public DataSourceInternal

(hudi) branch asf-site updated: [HUDI-6089][DOCS] update default value of hoodie.merge.allow.duplicate.on.inserts to true (#10739)

2024-02-23 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new 3235c366f84 [HUDI-6089][DOCS] update defaul

Re: [PR] [HUDI-6089][DOCS] update default value of hoodie.merge.allow.duplicate.on.inserts to true [hudi]

2024-02-23 Thread via GitHub
danny0405 merged PR #10739: URL: https://github.com/apache/hudi/pull/10739 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apac

Re: [I] [SUPPORT] Hive SYNC TOOL on EMR failed, Exception in thread main java.ang.NoClassDefFoundError: com/fasterxml/... [hudi]

2024-02-23 Thread via GitHub
danny0405 commented on issue #10741: URL: https://github.com/apache/hudi/issues/10741#issuecomment-1962247269 Looks like a jackson jar conflict. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to th

Re: [PR] [HUDI-7430] Fix empty schema issue for compactor [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10718: URL: https://github.com/apache/hudi/pull/10718#issuecomment-1962220086 ## CI report: * 82ab33600666ccd65fd4f963277e71ff2b8c7726 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1962203426 ## CI report: * 040e2c89e131b994c2a0b7875e512ab992b3c547 UNKNOWN * 392c0624e5b0e9ab8883781d0e7ef4c11dc87319 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1962200894 ## CI report: * 040e2c89e131b994c2a0b7875e512ab992b3c547 UNKNOWN * b6a1c7b7b8ba77121c972e80bf602de239fa9138 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-7430] Fix empty schema issue for compactor [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10718: URL: https://github.com/apache/hudi/pull/10718#issuecomment-1962200833 ## CI report: * 892125e6b08cf7629cc4f9a586809f093673d0b4 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7430] Fix empty schema issue for compactor [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10718: URL: https://github.com/apache/hudi/pull/10718#issuecomment-1962178951 ## CI report: * 892125e6b08cf7629cc4f9a586809f093673d0b4 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7430] Fix empty schema issue for compactor [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10718: URL: https://github.com/apache/hudi/pull/10718#issuecomment-1962172458 ## CI report: * 892125e6b08cf7629cc4f9a586809f093673d0b4 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

(hudi) branch asf-site updated: updated delete to mention duplicates- and did some writing cleanup (#10659)

2024-02-23 Thread bhavanisudha
This is an automated email from the ASF dual-hosted git repository. bhavanisudha pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new eb6a998fd85 updated delete to mention du

Re: [PR] updated delete to mention duplicates- and did some writing cleanup [hudi]

2024-02-23 Thread via GitHub
bhasudha merged PR #10659: URL: https://github.com/apache/hudi/pull/10659 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apach

Re: [I] [SUPPORT] Can process parquet file if using upsert or bulk_insert but cannot process parquet file if using insert [hudi]

2024-02-23 Thread via GitHub
soumilshah1995 commented on issue #10725: URL: https://github.com/apache/hudi/issues/10725#issuecomment-1962150892 Roger that -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

Re: [PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1962138653 ## CI report: * 988039cbb47927ba6f0ef3e0c2f77e0736d3cc36 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1962134737 ## CI report: * 988039cbb47927ba6f0ef3e0c2f77e0736d3cc36 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1962130304 ## CI report: * 988039cbb47927ba6f0ef3e0c2f77e0736d3cc36 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1962093422 ## CI report: * 988039cbb47927ba6f0ef3e0c2f77e0736d3cc36 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7430] Fix empty schema issue for compactor [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10718: URL: https://github.com/apache/hudi/pull/10718#issuecomment-1962093263 ## CI report: * bdf483d0c96502fc888e7dc7f2fe087f7643ecb6 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1962087323 ## CI report: * 988039cbb47927ba6f0ef3e0c2f77e0736d3cc36 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7430] Fix empty schema issue for compactor [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10718: URL: https://github.com/apache/hudi/pull/10718#issuecomment-1962087189 ## CI report: * bdf483d0c96502fc888e7dc7f2fe087f7643ecb6 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1962081364 ## CI report: * 988039cbb47927ba6f0ef3e0c2f77e0736d3cc36 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [I] [SUPPORT] Spark Write into MoR type hudi table small parquets issue + Athena Internal Error [hudi]

2024-02-23 Thread via GitHub
huliwuli commented on issue #10716: URL: https://github.com/apache/hudi/issues/10716#issuecomment-1962059629 **Regarding Athena Issue:** Due to the small size of parquets, I implemented clustering (inline) with max commits =1 for test. Athena Raises Error: Generic_INTERNAL_ERROR

Re: [PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
yihua commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1962035139 test comment -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubsc

Re: [PR] [HUDI-7416] Add interface for StreamProfile to be used in StreamSync for reading and writing data [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10736: URL: https://github.com/apache/hudi/pull/10736#issuecomment-1962009037 ## CI report: * dbe9cea4f203fe6f056b1f1e1f639e7ad775736c Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

[I] [SUPPORT] Hive SYNC TOOL on EMR failed, Exception in thread main java.ang.NoClassDefFoundError: com/fasterxml/... [hudi]

2024-02-23 Thread via GitHub
huliwuli opened a new issue, #10741: URL: https://github.com/apache/hudi/issues/10741 Tips before filing an issue Describe the problem you faced Did Async Clustering on EMR 6.14 and Hive on Athena did not sync the latest commit after clustering? I want to use the hive sync tool

Re: [PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1961961656 ## CI report: * 988039cbb47927ba6f0ef3e0c2f77e0736d3cc36 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1961953455 ## CI report: * 988039cbb47927ba6f0ef3e0c2f77e0736d3cc36 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

Re: [PR] [HUDI-7416] Add interface for StreamProfile to be used in StreamSync for reading and writing data [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10736: URL: https://github.com/apache/hudi/pull/10736#issuecomment-1961941627 ## CI report: * 2b66c852e373113c8bd1bd66bd0376a8f537044e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7430] Fix empty schema issue for compactor [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10718: URL: https://github.com/apache/hudi/pull/10718#issuecomment-1961941516 ## CI report: * bdf483d0c96502fc888e7dc7f2fe087f7643ecb6 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
yihua commented on PR #10740: URL: https://github.com/apache/hudi/pull/10740#issuecomment-1961919983 New comment -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscr

[PR] [HUDI-7438] Fix Azure CI report check with new issue comments [hudi]

2024-02-23 Thread via GitHub
yihua opened a new pull request, #10740: URL: https://github.com/apache/hudi/pull/10740 ### Change Logs This PR fixes Azure CI report check with new issue comments. ### Impact Fixes bug and improves Azure CI check. ### Risk level none ### Documentatio

Re: [PR] [HUDI-7416] Add interface for StreamProfile to be used in StreamSync for reading and writing data [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10736: URL: https://github.com/apache/hudi/pull/10736#issuecomment-196125 ## CI report: * 2b66c852e373113c8bd1bd66bd0376a8f537044e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7144] Build storage partition stats index and use it for data skipping [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10352: URL: https://github.com/apache/hudi/pull/10352#issuecomment-1961878815 ## CI report: * a592ad1361583635a55ead7b634d2b20a92c239f Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7144] Build storage partition stats index and use it for data skipping [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10352: URL: https://github.com/apache/hudi/pull/10352#issuecomment-1961869252 ## CI report: * d0faf8850bf513fb1d610831b3459680c244 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7430] Fix empty schema issue for compactor [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10718: URL: https://github.com/apache/hudi/pull/10718#issuecomment-1961816513 ## CI report: * 0fcfe358f651975c5276f7030ebb81b0011e5d5f Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7144] Build storage partition stats index and use it for data skipping [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10352: URL: https://github.com/apache/hudi/pull/10352#issuecomment-1961815729 ## CI report: * d0faf8850bf513fb1d610831b3459680c244 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7430] Fix empty schema issue for compactor [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10718: URL: https://github.com/apache/hudi/pull/10718#issuecomment-1961806589 ## CI report: * 0fcfe358f651975c5276f7030ebb81b0011e5d5f Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7416] Add interface for StreamProfile to be used in StreamSync for reading and writing data [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10736: URL: https://github.com/apache/hudi/pull/10736#issuecomment-1961715953 ## CI report: * 2b66c852e373113c8bd1bd66bd0376a8f537044e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7416] Add interface for StreamProfile to be used in StreamSync for reading and writing data [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10736: URL: https://github.com/apache/hudi/pull/10736#issuecomment-1961554948 ## CI report: * 6c41fe5e29de60a4d33701ebfd4aefc898cc605f Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7416] Add interface for StreamProfile to be used in StreamSync for reading and writing data [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10736: URL: https://github.com/apache/hudi/pull/10736#issuecomment-1961542149 ## CI report: * 6c41fe5e29de60a4d33701ebfd4aefc898cc605f Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [I] [SUPPORT] Spark Write into MoR type hudi table small parquets issue + Athena Internal Error [hudi]

2024-02-23 Thread via GitHub
huliwuli commented on issue #10716: URL: https://github.com/apache/hudi/issues/10716#issuecomment-1961532330 @ad1happy2go Thanks for the reply. I used insert operation. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[jira] [Updated] (HUDI-7439) Remove redundant logs from hive server from Azure CI 4th module

2024-02-23 Thread Lin Liu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lin Liu updated HUDI-7439: -- Description: When there is an error or failure, we can see at the bottom there are some redundant logs from hiv

[jira] [Created] (HUDI-7439) Remove logs from hive server from Azure CI 4th module

2024-02-23 Thread Lin Liu (Jira)
Lin Liu created HUDI-7439: - Summary: Remove logs from hive server from Azure CI 4th module Key: HUDI-7439 URL: https://issues.apache.org/jira/browse/HUDI-7439 Project: Apache Hudi Issue Type: Bug

[jira] [Updated] (HUDI-7439) Remove redundant logs from hive server from Azure CI 4th module

2024-02-23 Thread Lin Liu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lin Liu updated HUDI-7439: -- Summary: Remove redundant logs from hive server from Azure CI 4th module (was: Remove logs from hive server fro

Re: [PR] [HUDI-6089] Handle default insert behaviour to ingest duplicates [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10728: URL: https://github.com/apache/hudi/pull/10728#issuecomment-1961420614 ## CI report: * 22f875240369cab37842e58c9d504873643f10e1 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

[PR] [HUDI-6089][DOCS] update default value of hoodie.merge.allow.duplicate.on.inserts to true [hudi]

2024-02-23 Thread via GitHub
wombatu-kun opened a new pull request, #10739: URL: https://github.com/apache/hudi/pull/10739 ### Change Logs Update documentation: update default value of hoodie.merge.allow.duplicate.on.inserts to true ### Impact none ### Risk level (write none, low medium or hi

Re: [PR] [HUDI-4444] Refactor DataSourceInternalWriterHelper [hudi]

2024-02-23 Thread via GitHub
wombatu-kun commented on code in PR #10715: URL: https://github.com/apache/hudi/pull/10715#discussion_r1500706025 ## hudi-spark-datasource/hudi-spark-common/src/main/java/org/apache/hudi/internal/DataSourceInternalWriterHelper.java: ## @@ -66,13 +66,11 @@ public DataSourceIntern

Re: [PR] [HUDI-6089] Handle default insert behaviour to ingest duplicates [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10728: URL: https://github.com/apache/hudi/pull/10728#issuecomment-1961258025 ## CI report: * 6348547bbb296493bb2d137a9764199117784e10 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-6089] Handle default insert behaviour to ingest duplicates [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10728: URL: https://github.com/apache/hudi/pull/10728#issuecomment-1961247518 ## CI report: * 6348547bbb296493bb2d137a9764199117784e10 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7275] Separate use of HoodieTimelineTimeZone.UTC and LOCAL in tests to prevent infinite loops [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10738: URL: https://github.com/apache/hudi/pull/10738#issuecomment-1961075063 ## CI report: * 18435cdea361b920b8ff01e4ded0143d94c6d6f5 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [MINOR][TESTING] Test PR [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10737: URL: https://github.com/apache/hudi/pull/10737#issuecomment-1960909243 ## CI report: * 0f4271e8c543fd2cda736a2af5c9356533c48cba Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22

Re: [PR] [HUDI-7416] Add interface for StreamProfile to be used in StreamSync for reading and writing data [hudi]

2024-02-23 Thread via GitHub
hudi-bot commented on PR #10736: URL: https://github.com/apache/hudi/pull/10736#issuecomment-1960909193 ## CI report: * 6c41fe5e29de60a4d33701ebfd4aefc898cc605f Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22