Re: [PR] [HUDI-7146] Implement secondary index write path [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11146: URL: https://github.com/apache/hudi/pull/11146#issuecomment-2097485263 ## CI report: * 1468a9a72c2cfdda7dae6bb62ac14551f600d8df Azure:

Re: [PR] [HUDI-7652] Add new `HoodieMergeKey` API to support simple and composite keys [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11077: URL: https://github.com/apache/hudi/pull/11077#issuecomment-2097485135 ## CI report: * 0eff97cd517ed728a93eea9e8aaca05e6eb72650 Azure:

Re: [PR] [HUDI-7652] Add new `HoodieMergeKey` API to support simple and composite keys [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11077: URL: https://github.com/apache/hudi/pull/11077#issuecomment-2097476481 ## CI report: * 0eff97cd517ed728a93eea9e8aaca05e6eb72650 Azure:

Re: [PR] [HUDI-7146] Implement secondary index write path [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11146: URL: https://github.com/apache/hudi/pull/11146#issuecomment-2097476674 ## CI report: * 1468a9a72c2cfdda7dae6bb62ac14551f600d8df Azure:

Re: [PR] [HUDI-7587] Make bundle dependencies for storage abstraction in correct order [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11131: URL: https://github.com/apache/hudi/pull/11131#issuecomment-2097467747 ## CI report: * 7c72471a1b9b5ad43ca63ab60da0f3d260f67cea Azure:

Re: [PR] [HUDI-7720] Fix HoodieTableFileSystemView NPE in fetchAllStoredFileGroups [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11161: URL: https://github.com/apache/hudi/pull/11161#issuecomment-2097467988 ## CI report: * 920a8f421b2d6650c8b2451af2038346a2906343 Azure:

Re: [PR] [HUDI-7704] Unify test client storage classes with duplicate code [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11152: URL: https://github.com/apache/hudi/pull/11152#issuecomment-2097467849 ## CI report: * e96cd9ce1f546e881806dbce71ff178ee89bc0f3 Azure:

[jira] [Closed] (HUDI-7721) Fix broken build on master

2024-05-06 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit closed HUDI-7721. - Fix Version/s: 1.0.0 Resolution: Fixed > Fix broken build on master > -- >

Re: [PR] [HUDI-7721] Fix broken build on master [hudi]

2024-05-06 Thread via GitHub
danny0405 commented on PR #11164: URL: https://github.com/apache/hudi/pull/11164#issuecomment-2097430511 cc @codope to land it first because several travis builds succeed to unblock the master AFAP. -- This is an automated message from the Apache Git Service. To respond to the message,

(hudi) branch master updated (fdb94192508 -> c359ecc971a)

2024-05-06 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from fdb94192508 [HUDI-7715] Partition TTL for Flink (#11156) add c359ecc971a [HUDI-7721] Fix broken build on master

Re: [PR] [HUDI-7720] Fix HoodieTableFileSystemView NPE in fetchAllStoredFileGroups [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11161: URL: https://github.com/apache/hudi/pull/11161#issuecomment-2097430015 ## CI report: * 50008938bd209ca6dedc42f7fa616d7df952df4b Azure:

Re: [PR] [HUDI-7721] Fix broken build on master [hudi]

2024-05-06 Thread via GitHub
danny0405 merged PR #11164: URL: https://github.com/apache/hudi/pull/11164 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] [HUDI-7720] Fix HoodieTableFileSystemView NPE in fetchAllStoredFileGroups [hudi]

2024-05-06 Thread via GitHub
danny0405 commented on code in PR #11161: URL: https://github.com/apache/hudi/pull/11161#discussion_r1591700013 ## hudi-common/src/main/java/org/apache/hudi/common/table/view/HoodieTableFileSystemView.java: ## @@ -307,6 +307,10 @@ void

Re: [PR] [HUDI-7404] Bloom execution improvements [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #10578: URL: https://github.com/apache/hudi/pull/10578#issuecomment-2097429242 ## CI report: * 86a6e24f202a76c316086b59fc69308c57631b4e UNKNOWN * 68bf61a85db16d50aa0663be7652874baf30489c Azure:

Re: [I] [SUPPORT]Flink Streaming Read hudi table which is in clustering,encounterd file not exists. [hudi]

2024-05-06 Thread via GitHub
danny0405 commented on issue #11090: URL: https://github.com/apache/hudi/issues/11090#issuecomment-2097428203 Are you enabling the clustering then? The clustering would rewrite all the partitions. > I think increasing the parameters of retention cleanup will probably generate more

[jira] [Updated] (HUDI-7722) Add a GH CI check on the PR branch age

2024-05-06 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7722: Description: We can add a CI check for the branch and make the merging protected based on the CI check, if

[jira] [Updated] (HUDI-7722) Add a GH CI check on the PR branch age

2024-05-06 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7722: Fix Version/s: 1.0.0 > Add a GH CI check on the PR branch age > -- > >

[jira] [Assigned] (HUDI-7722) Add a GH CI check on the PR branch age

2024-05-06 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-7722: --- Assignee: Ethan Guo > Add a GH CI check on the PR branch age >

[jira] [Created] (HUDI-7722) Add a GH CI check on the PR branch age

2024-05-06 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-7722: --- Summary: Add a GH CI check on the PR branch age Key: HUDI-7722 URL: https://issues.apache.org/jira/browse/HUDI-7722 Project: Apache Hudi Issue Type: Improvement

Re: [PR] [HUDI-7721] Fix broken build on master [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11164: URL: https://github.com/apache/hudi/pull/11164#issuecomment-2097423449 ## CI report: * 10292370450b75b41cd16f93072b3357e468ae85 Azure:

Re: [PR] [HUDI-7350] Create hudi io factory [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11163: URL: https://github.com/apache/hudi/pull/11163#issuecomment-2097423416 ## CI report: * 7376b451044473ce16aad09a1d356a9140442f9c UNKNOWN * d491f7ed864af5c291d365dcfe9392a5bbc8dd2d UNKNOWN * b7f31f230c12e285cd073bd619bd13f40df75c73 Azure:

Re: [PR] [HUDI-7652] Add new `HoodieMergeKey` API to support simple and composite keys [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11077: URL: https://github.com/apache/hudi/pull/11077#issuecomment-2097423226 ## CI report: * 0eff97cd517ed728a93eea9e8aaca05e6eb72650 Azure:

Re: [PR] [HUDI-7720] Fix HoodieTableFileSystemView NPE in fetchAllStoredFileGroups [hudi]

2024-05-06 Thread via GitHub
xuzifu666 commented on code in PR #11161: URL: https://github.com/apache/hudi/pull/11161#discussion_r1591794126 ## hudi-common/src/main/java/org/apache/hudi/common/table/view/HoodieTableFileSystemView.java: ## @@ -307,6 +307,10 @@ void

Re: [PR] [HUDI-7721] Fix broken build on master [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11164: URL: https://github.com/apache/hudi/pull/11164#issuecomment-2097417960 ## CI report: * 10292370450b75b41cd16f93072b3357e468ae85 Azure:

Re: [PR] [HUDI-7652] Add new `HoodieMergeKey` API to support simple and composite keys [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11077: URL: https://github.com/apache/hudi/pull/11077#issuecomment-2097417772 ## CI report: * 0eff97cd517ed728a93eea9e8aaca05e6eb72650 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run

Re: [PR] [MINOR] Fix CI ERROR in TestHoodieDeltaStreamer [hudi]

2024-05-06 Thread via GitHub
xuzifu666 closed pull request #11165: [MINOR] Fix CI ERROR in TestHoodieDeltaStreamer URL: https://github.com/apache/hudi/pull/11165 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[PR] [MINOR] Fix CI ERROR in TestHoodieDeltaStreamer [hudi]

2024-05-06 Thread via GitHub
xuzifu666 opened a new pull request, #11165: URL: https://github.com/apache/hudi/pull/11165 ### Change Logs CI fix ### Impact none ### Risk level (write none, low medium or high below) none ### Documentation Update _Describe any necessary documentation

Re: [PR] [HUDI-7721] Fix broken build on master [hudi]

2024-05-06 Thread via GitHub
jonvex commented on code in PR #11164: URL: https://github.com/apache/hudi/pull/11164#discussion_r1591775499 ## hudi-utilities/src/test/java/org/apache/hudi/utilities/deltastreamer/TestHoodieDeltaStreamer.java: ## @@ -2827,7 +2828,7 @@ public void testAutoGenerateRecordKeys()

Re: [PR] [HUDI-7721] Fix broken build on master [hudi]

2024-05-06 Thread via GitHub
codope commented on code in PR #11164: URL: https://github.com/apache/hudi/pull/11164#discussion_r1591770204 ## hudi-utilities/src/test/java/org/apache/hudi/utilities/deltastreamer/TestHoodieDeltaStreamer.java: ## @@ -2827,7 +2828,7 @@ public void testAutoGenerateRecordKeys()

Re: [PR] [HUDI-7721] Fix broken build on master [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11164: URL: https://github.com/apache/hudi/pull/11164#issuecomment-2097379386 ## CI report: * 10292370450b75b41cd16f93072b3357e468ae85 Azure:

Re: [PR] [HUDI-7704] Unify test client storage classes with duplicate code [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11152: URL: https://github.com/apache/hudi/pull/11152#issuecomment-2097379271 ## CI report: * dca94d809e6f517e82e7b4b41582995465c80676 Azure:

Re: [PR] [HUDI-7350] Create hudi io factory [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11163: URL: https://github.com/apache/hudi/pull/11163#issuecomment-2097379359 ## CI report: * 7376b451044473ce16aad09a1d356a9140442f9c UNKNOWN * d491f7ed864af5c291d365dcfe9392a5bbc8dd2d UNKNOWN * ea258fe4883b5612f52ce68a0c2c33ec2c0ef089 Azure:

Re: [PR] [HUDI-7587] Make bundle dependencies for storage abstraction in correct order [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11131: URL: https://github.com/apache/hudi/pull/11131#issuecomment-2097379202 ## CI report: * 1a5e0d9b0b3bd73b7034ef290fbeaf6aa7a66441 Azure:

Re: [PR] [HUDI-7404] Bloom execution improvements [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #10578: URL: https://github.com/apache/hudi/pull/10578#issuecomment-2097378545 ## CI report: * 86a6e24f202a76c316086b59fc69308c57631b4e UNKNOWN * 7f76ebca55ef148c1497786bb45bbf9c50ecdee6 Azure:

Re: [PR] [HUDI-7721] Fix broken build on master [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11164: URL: https://github.com/apache/hudi/pull/11164#issuecomment-2097373679 ## CI report: * 10292370450b75b41cd16f93072b3357e468ae85 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run

Re: [PR] [HUDI-7350] Create hudi io factory [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11163: URL: https://github.com/apache/hudi/pull/11163#issuecomment-2097373636 ## CI report: * 7376b451044473ce16aad09a1d356a9140442f9c UNKNOWN * d491f7ed864af5c291d365dcfe9392a5bbc8dd2d UNKNOWN * ea258fe4883b5612f52ce68a0c2c33ec2c0ef089 Azure:

Re: [PR] [HUDI-7704] Unify test client storage classes with duplicate code [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11152: URL: https://github.com/apache/hudi/pull/11152#issuecomment-2097373536 ## CI report: * dca94d809e6f517e82e7b4b41582995465c80676 Azure:

Re: [PR] [HUDI-7587] Make bundle dependencies for storage abstraction in correct order [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11131: URL: https://github.com/apache/hudi/pull/11131#issuecomment-2097373459 ## CI report: * 1a5e0d9b0b3bd73b7034ef290fbeaf6aa7a66441 Azure:

Re: [PR] [HUDI-7404] Bloom execution improvements [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #10578: URL: https://github.com/apache/hudi/pull/10578#issuecomment-2097372846 ## CI report: * 86a6e24f202a76c316086b59fc69308c57631b4e UNKNOWN * 7f76ebca55ef148c1497786bb45bbf9c50ecdee6 Azure:

Re: [PR] [HUDI-7713] Enforce ordering of fields during schema reconciliation [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11154: URL: https://github.com/apache/hudi/pull/11154#issuecomment-2097367499 ## CI report: * 12038dbde068e26f733a7b1c9cc7217019c31f25 UNKNOWN * fbb9dd5d64652ddec923dc7948f77adc61e823b3 Azure:

Re: [I] [SUPPORT]Flink Streaming Read hudi table which is in clustering,encounterd file not exists. [hudi]

2024-05-06 Thread via GitHub
weitianpei commented on issue #11090: URL: https://github.com/apache/hudi/issues/11090#issuecomment-2097364669 Then why are there always small files in the partition a few days ago that are constantly rebuilt and deleted? There is no more data written to these partitions. I think

Re: [I] [SUPPORT]Flink Streaming Read hudi table which is in clustering,encounterd file not exists. [hudi]

2024-05-06 Thread via GitHub
danny0405 commented on issue #11090: URL: https://github.com/apache/hudi/issues/11090#issuecomment-2097357620 > clean.retain_commits was 1 That means each time a new version of file generated, the old one would be deleted, for "COW" table, there is very high possibility you would

[jira] [Updated] (HUDI-7721) Fix broken build on master

2024-05-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7721: - Labels: pull-request-available (was: ) > Fix broken build on master > --

[PR] [HUDI-7721] Fix broken build on master [hudi]

2024-05-06 Thread via GitHub
jonvex opened a new pull request, #11164: URL: https://github.com/apache/hudi/pull/11164 ### Change Logs Due to big changes from de-hadooping. A pr was merged that brakes the build on master. ### Impact Fixes master so it can build. ### Risk level (write none,

[jira] [Created] (HUDI-7721) Fix broken build on master

2024-05-06 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7721: - Summary: Fix broken build on master Key: HUDI-7721 URL: https://issues.apache.org/jira/browse/HUDI-7721 Project: Apache Hudi Issue Type: Bug

Re: [PR] [HUDI-7522] Support find out the conflict instants in bucket partition when bucket id multiple [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #10898: URL: https://github.com/apache/hudi/pull/10898#issuecomment-2097317204 ## CI report: * e9fc630d3a8999c7ef0db7bd94da910b1f77df7d UNKNOWN * b7011691a07deb288ce0341dcd55bb6feeb4101d UNKNOWN * a387882bde246df2a81e8aca30c63835180791c9 Azure:

Re: [I] [SUPPORT]Flink Streaming Read hudi table which is in clustering,encounterd file not exists. [hudi]

2024-05-06 Thread via GitHub
weitianpei commented on issue #11090: URL: https://github.com/apache/hudi/issues/11090#issuecomment-2097307624 Small files are continuously merged in the background until 600M. Are you sure that if we do this, will the downstream program read less data or read repeatedly? For

Re: [PR] [HUDI-7404] Bloom execution improvements [hudi]

2024-05-06 Thread via GitHub
the-other-tim-brown commented on code in PR #10578: URL: https://github.com/apache/hudi/pull/10578#discussion_r1591737966 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/index/simple/HoodieSimpleIndex.java: ## @@ -143,19 +141,17 @@ protected HoodieData>

Re: [PR] [HUDI-7522] Support find out the conflict instants in bucket partition when bucket id multiple [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #10898: URL: https://github.com/apache/hudi/pull/10898#issuecomment-2097304476 ## CI report: * e9fc630d3a8999c7ef0db7bd94da910b1f77df7d UNKNOWN * b7011691a07deb288ce0341dcd55bb6feeb4101d UNKNOWN * 3f6b855b6cdace0a26751cc48eeadfe4bd183564 Azure:

Re: [PR] [HUDI-7713] Enforce ordering of fields during schema reconciliation [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11154: URL: https://github.com/apache/hudi/pull/11154#issuecomment-2097305369 ## CI report: * 12038dbde068e26f733a7b1c9cc7217019c31f25 UNKNOWN * 5f0a670935552be77adb223d64469b2419b97dc8 Azure:

Re: [I] [SUPPORT]Flink Streaming Read hudi table which is in clustering,encounterd file not exists. [hudi]

2024-05-06 Thread via GitHub
weitianpei commented on issue #11090: URL: https://github.com/apache/hudi/issues/11090#issuecomment-2097301076 小文件会在后台不停的被合并,直到600M为止。你确定我们这样做,会不会导致下游程序读数据少读,重复读呢? > 2024年5月7日 10:14,Danny Chan ***@***.***> 写道: > > > We did have the tests already in the repo for

Re: [I] [SUPPORT]Flink Streaming Read hudi table which is in clustering,encounterd file not exists. [hudi]

2024-05-06 Thread via GitHub
weitianpei commented on issue #11090: URL: https://github.com/apache/hudi/issues/11090#issuecomment-2097295654 I did not increase the clean.retain_commits,the clean.retain_commits was 1. By the way,how to configue the clean.async.enabled

Re: [PR] [HUDI-7713] Enforce ordering of fields during schema reconciliation [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11154: URL: https://github.com/apache/hudi/pull/11154#issuecomment-2097291256 ## CI report: * 12038dbde068e26f733a7b1c9cc7217019c31f25 UNKNOWN * 5f0a670935552be77adb223d64469b2419b97dc8 Azure:

Re: [PR] [HUDI-7522] Support find out the conflict instants in bucket partition when bucket id multiple [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #10898: URL: https://github.com/apache/hudi/pull/10898#issuecomment-2097290412 ## CI report: * e9fc630d3a8999c7ef0db7bd94da910b1f77df7d UNKNOWN * b7011691a07deb288ce0341dcd55bb6feeb4101d UNKNOWN * 3f6b855b6cdace0a26751cc48eeadfe4bd183564 Azure:

Re: [I] [SUPPORT]Flink Streaming Read hudi table which is in clustering,encounterd file not exists. [hudi]

2024-05-06 Thread via GitHub
danny0405 commented on issue #11090: URL: https://github.com/apache/hudi/issues/11090#issuecomment-2097276769 We did have the tests already in the repo for clustering and compaction skipping read, can you ensure the option takes effect and increase the numbers of retained commits before

[jira] [Closed] (HUDI-7663) Cannot discover new partitons when i using stream reading by flink1.1.6.1-hudi13.1

2024-05-06 Thread weitianpei (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] weitianpei closed HUDI-7663. Resolution: Fixed > Cannot discover new partitons when i using stream reading by > flink1.1.6.1-hudi13.1 >

[jira] [Commented] (HUDI-7234) Handle both inserts and updates in log blocks for partial updates

2024-05-06 Thread weitianpei (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17844086#comment-17844086 ] weitianpei commented on HUDI-7234: -- hi > Handle both inserts and updates in log blocks for partial

[jira] (HUDI-7234) Handle both inserts and updates in log blocks for partial updates

2024-05-06 Thread weitianpei (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7234 ] weitianpei deleted comment on HUDI-7234: -- was (Author: weitianpei): hi > Handle both inserts and updates in log blocks for partial updates >

Re: [I] [SUPPORT]Flink Streaming Read hudi table which is in clustering,encounterd file not exists. [hudi]

2024-05-06 Thread via GitHub
weitianpei commented on issue #11090: URL: https://github.com/apache/hudi/issues/11090#issuecomment-2097265211 Would you mind addind a test to solve this problem? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

Re: [I] [SUPPORT]Flink Streaming Read hudi table which is in clustering,encounterd file not exists. [hudi]

2024-05-06 Thread via GitHub
weitianpei commented on issue #11090: URL: https://github.com/apache/hudi/issues/11090#issuecomment-2097263209 I added the skip parameter in my downstream flink program,but the same problem happend again -- This is an automated message from the Apache Git Service. To respond to the

Re: [I] [SUPPORT]Flink Streaming Read hudi table which is in clustering,encounterd file not exists. [hudi]

2024-05-06 Thread via GitHub
danny0405 commented on issue #11090: URL: https://github.com/apache/hudi/issues/11090#issuecomment-2097257583 > clustered still. > And the downstream flink program read these files would met FileNOTEXTIES exception. Either clustering and compaction can be skipped in flink

Re: [I] [SUPPORT]Flink Streaming Read hudi table which is in clustering,encounterd file not exists. [hudi]

2024-05-06 Thread via GitHub
weitianpei commented on issue #11090: URL: https://github.com/apache/hudi/issues/11090#issuecomment-2097253674 @codope when will we solve this problem? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

Re: [I] [SUPPORT]Flink Streaming Read hudi table which is in clustering,encounterd file not exists. [hudi]

2024-05-06 Thread via GitHub
weitianpei commented on issue #11090: URL: https://github.com/apache/hudi/issues/11090#issuecomment-2097252047 ![7057026C-8A58-428C-BFD3-E2F75085E25D](https://github.com/apache/hudi/assets/30386282/7402d156-0506-47d8-8f38-700c74afcaac) please look this pic, April 17th,we found the little

[jira] [Updated] (HUDI-6778) Track schema in metadata table

2024-05-06 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-6778: - Status: In Progress (was: Open) > Track schema in metadata table >

[jira] [Updated] (HUDI-7146) Implement secondary index

2024-05-06 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-7146: -- Reviewers: Danny Chen > Implement secondary index > - > > Key:

Re: [I] [SUPPORT]Flink Streaming Read hudi table which is in clustering,encounterd file not exists. [hudi]

2024-05-06 Thread via GitHub
danny0405 commented on issue #11090: URL: https://github.com/apache/hudi/issues/11090#issuecomment-2097236100 If the job is not executing rollback repetitively, these files should be just a replacing of "COW" of files, for "COW", we create a new base file to replace the old one while the

Re: [I] [SUPPORT] Error using the property hoodie.datasource.write.drop.partition.columns [hudi]

2024-05-06 Thread via GitHub
danny0405 commented on issue #11144: URL: https://github.com/apache/hudi/issues/11144#issuecomment-2097231302 The contract here is: the partition **field** shoud be in the table schema anyway. -- This is an automated message from the Apache Git Service. To respond to the message, please

[jira] [Updated] (HUDI-7639) Refactor HoodieFileIndex so that different indexes can be used via optimizer rules

2024-05-06 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-7639: -- Story Points: 5 > Refactor HoodieFileIndex so that different indexes can be used via optimizer > rules

Re: [I] [SUPPORT]Flink Streaming Read hudi table which is in clustering,encounterd file not exists. [hudi]

2024-05-06 Thread via GitHub
weitianpei commented on issue #11090: URL: https://github.com/apache/hudi/issues/11090#issuecomment-2097217302 跟这个参数没有关系,文件不断的持续的在原地进行创建,并删除旧文件 > 2024年4月26日 11:11,Danny Chan ***@***.***> 写道: > > > There are some logs that reports the reader progress in the monitor

Re: [PR] [HUDI-7710] Use compaction.requested during conflict resolution [hudi]

2024-05-06 Thread via GitHub
danny0405 commented on code in PR #11151: URL: https://github.com/apache/hudi/pull/11151#discussion_r1591712468 ## hudi-client/hudi-client-common/src/test/java/org/apache/hudi/client/transaction/TestSimpleConcurrentFileWritesConflictResolutionStrategyWithMORTable.java: ## @@

Re: [PR] [HUDI-7522] Support find out the conflict instants in bucket partition when bucket id multiple [hudi]

2024-05-06 Thread via GitHub
danny0405 commented on PR #10898: URL: https://github.com/apache/hudi/pull/10898#issuecomment-2097199850 Thanks for the work, I have reviewed and applied a patch here: [7522.patch.zip](https://github.com/apache/hudi/files/15228302/7522.patch.zip) Please supplement the tests with

Re: [PR] [HUDI-7720] Fix HoodieTableFileSystemView NPE in fetchAllStoredFileGroups [hudi]

2024-05-06 Thread via GitHub
danny0405 commented on code in PR #11161: URL: https://github.com/apache/hudi/pull/11161#discussion_r1591700013 ## hudi-common/src/main/java/org/apache/hudi/common/table/view/HoodieTableFileSystemView.java: ## @@ -307,6 +307,10 @@ void

[jira] [Closed] (HUDI-7715) Partition TTL for Flink

2024-05-06 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen closed HUDI-7715. Resolution: Fixed Fixed via master branch: fdb94192508a3d76fdba63429d9b0df718316a7e > Partition TTL for

[jira] [Assigned] (HUDI-7715) Partition TTL for Flink

2024-05-06 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen reassigned HUDI-7715: Assignee: Danny Chen > Partition TTL for Flink > --- > > Key:

(hudi) branch master updated: [HUDI-7715] Partition TTL for Flink (#11156)

2024-05-06 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new fdb94192508 [HUDI-7715] Partition TTL for

Re: [PR] [HUDI-7715] Partition TTL for Flink [hudi]

2024-05-06 Thread via GitHub
danny0405 merged PR #11156: URL: https://github.com/apache/hudi/pull/11156 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Commented] (HUDI-7234) Handle both inserts and updates in log blocks for partial updates

2024-05-06 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17844069#comment-17844069 ] Vinoth Chandar commented on HUDI-7234: -- this to be handled in 1.1.0 along with partial update

[jira] [Updated] (HUDI-7234) Handle both inserts and updates in log blocks for partial updates

2024-05-06 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-7234: - Status: Open (was: In Progress) > Handle both inserts and updates in log blocks for partial

[jira] [Updated] (HUDI-7234) Handle both inserts and updates in log blocks for partial updates

2024-05-06 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-7234: - Fix Version/s: 1.1.0 (was: 1.0.0) > Handle both inserts and updates in log

[jira] [Updated] (HUDI-7541) Ensure extensibility to new indexes - vectors, search and other formats (CLP, unstructured data)

2024-05-06 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-7541: - Fix Version/s: 1.1.0 (was: 1.0.0) > Ensure extensibility to new indexes -

[jira] [Commented] (HUDI-7541) Ensure extensibility to new indexes - vectors, search and other formats (CLP, unstructured data)

2024-05-06 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17844068#comment-17844068 ] Vinoth Chandar commented on HUDI-7541: -- Punting this to 1.1 > Ensure extensibility to new indexes -

[jira] [Closed] (HUDI-7679) Ensure extensibility to unstructured data, logs (CLP), vectors, other index types

2024-05-06 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar closed HUDI-7679. Resolution: Duplicate > Ensure extensibility to unstructured data, logs (CLP), vectors, other index

[jira] [Updated] (HUDI-7538) Consolidate the CDC Formats (changelog format, RFC-51)

2024-05-06 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-7538: - Fix Version/s: (was: 1.1.0) > Consolidate the CDC Formats (changelog format, RFC-51) >

[jira] [Resolved] (HUDI-7679) Ensure extensibility to unstructured data, logs (CLP), vectors, other index types

2024-05-06 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar resolved HUDI-7679. -- > Ensure extensibility to unstructured data, logs (CLP), vectors, other index > types >

[jira] [Updated] (HUDI-7234) Handle both inserts and updates in log blocks for partial updates

2024-05-06 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-7234: - Status: In Progress (was: Open) > Handle both inserts and updates in log blocks for partial

[jira] [Updated] (HUDI-7541) Ensure extensibility to new indexes - vectors, search and other formats (CLP, unstructured data)

2024-05-06 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-7541: - Status: In Progress (was: Open) > Ensure extensibility to new indexes - vectors, search and

[jira] [Updated] (HUDI-7679) Ensure extensibility to unstructured data, logs (CLP), vectors, other index types

2024-05-06 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-7679: - Status: In Progress (was: Open) > Ensure extensibility to unstructured data, logs (CLP),

[jira] [Updated] (HUDI-7538) Consolidate the CDC Formats (changelog format, RFC-51)

2024-05-06 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-7538: - Fix Version/s: 1.1.0 > Consolidate the CDC Formats (changelog format, RFC-51) >

[jira] [Updated] (HUDI-7507) ongoing concurrent writers with smaller timestamp can cause issues with table services

2024-05-06 Thread Krishen Bhan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishen Bhan updated HUDI-7507: --- Description: *Scenarios:* Although HUDI operations hold a table lock when creating a .requested

Re: [PR] [HUDI-7350] Create hudi io factory [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11163: URL: https://github.com/apache/hudi/pull/11163#issuecomment-2097080044 ## CI report: * 7376b451044473ce16aad09a1d356a9140442f9c UNKNOWN * d491f7ed864af5c291d365dcfe9392a5bbc8dd2d UNKNOWN * ea258fe4883b5612f52ce68a0c2c33ec2c0ef089 Azure:

[jira] [Updated] (HUDI-7350) Introduce HoodieIOFactory to abstract the reader and writer implementation

2024-05-06 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7350: -- Status: Patch Available (was: In Progress) > Introduce HoodieIOFactory to abstract the reader

Re: [PR] [HUDI-7350] Create hudi io factory [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11163: URL: https://github.com/apache/hudi/pull/11163#issuecomment-2097037621 ## CI report: * 7376b451044473ce16aad09a1d356a9140442f9c UNKNOWN * d491f7ed864af5c291d365dcfe9392a5bbc8dd2d UNKNOWN * ea258fe4883b5612f52ce68a0c2c33ec2c0ef089 Azure:

Re: [PR] [HUDI-7350] Create hudi io factory [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11163: URL: https://github.com/apache/hudi/pull/11163#issuecomment-2097026611 ## CI report: * 7376b451044473ce16aad09a1d356a9140442f9c UNKNOWN * d491f7ed864af5c291d365dcfe9392a5bbc8dd2d UNKNOWN * ea258fe4883b5612f52ce68a0c2c33ec2c0ef089 UNKNOWN

Re: [PR] [HUDI-7350] Create hudi io factory [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11163: URL: https://github.com/apache/hudi/pull/11163#issuecomment-2097017280 ## CI report: * 7376b451044473ce16aad09a1d356a9140442f9c UNKNOWN * d491f7ed864af5c291d365dcfe9392a5bbc8dd2d UNKNOWN Bot commands @hudi-bot supports the

Re: [PR] [HUDI-7350] Create hudi io factory [hudi]

2024-05-06 Thread via GitHub
hudi-bot commented on PR #11163: URL: https://github.com/apache/hudi/pull/11163#issuecomment-2096958204 ## CI report: * 7376b451044473ce16aad09a1d356a9140442f9c UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run

[jira] [Updated] (HUDI-7350) Introduce HoodieIOFactory to abstract the reader and writer implementation

2024-05-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7350: - Labels: hoodie-storage pull-request-available (was: hoodie-storage) > Introduce HoodieIOFactory

[PR] [HUDI-7350] Create hudi io factory [hudi]

2024-05-06 Thread via GitHub
jonvex opened a new pull request, #11163: URL: https://github.com/apache/hudi/pull/11163 ### Change Logs Abstract io reader and writer to de-hadoop ### Impact de-hadooping for trino ### Risk level (write none, low medium or high below) low ###

Re: [PR] [MINOR] Use parent as the glob path when full file path specified [hudi]

2024-05-06 Thread via GitHub
the-other-tim-brown commented on code in PR #11150: URL: https://github.com/apache/hudi/pull/11150#discussion_r1591567763 ## hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/client/clustering/run/strategy/MultipleSparkJobExecutionStrategy.java: ## @@ -457,9 +457,10

[jira] [Updated] (HUDI-7350) Introduce HoodieIOFactory to abstract the reader and writer implementation

2024-05-06 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7350: -- Status: In Progress (was: Open) > Introduce HoodieIOFactory to abstract the reader and writer

(hudi) branch master updated: [HUDI-7710] Use compaction.requested during conflict resolution (#11151)

2024-05-06 Thread jonvex
This is an automated email from the ASF dual-hosted git repository. jonvex pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new bc07e3ed252 [HUDI-7710] Use compaction.requested

  1   2   3   >