[jira] [Closed] (HUDI-7583) Read log block header only for the schema and instant time

2024-04-09 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit closed HUDI-7583. - Resolution: Fixed > Read log block header only for the schema and instant time >

(hudi) branch master updated: [HUDI-7583] Read log block header only for the schema and instant time (#10984)

2024-04-09 Thread codope
This is an automated email from the ASF dual-hosted git repository. codope pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new c2f2e615106 [HUDI-7583] Read log block header

Re: [PR] [HUDI-7583] Read log block header only for the schema and instant time [hudi]

2024-04-09 Thread via GitHub
codope merged PR #10984: URL: https://github.com/apache/hudi/pull/10984 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] [HUDI-7597] Add logs of Kafka offsets when the checkpoint is out of bound [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10987: URL: https://github.com/apache/hudi/pull/10987#issuecomment-2046581284 ## CI report: * e608ca27d1d69ea9b6d6fe299ea0139f4fed04d5 Azure:

Re: [PR] [HUDI-7597] Add logs of Kafka offsets when the checkpoint is out of bound [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10987: URL: https://github.com/apache/hudi/pull/10987#issuecomment-2046574649 ## CI report: * e608ca27d1d69ea9b6d6fe299ea0139f4fed04d5 Azure:

[I] [SUPPORT]File Not Found Exception occurrs when Flink task read Hudi MOR table failure recover [hudi]

2024-04-09 Thread via GitHub
Sparsamkeit opened a new issue, #10988: URL: https://github.com/apache/hudi/issues/10988 **Describe the problem you faced** After Flink fails to read the Hudi MOR table task and restarts after a period of time, an exception will occur that the log file does not exist. This may

Re: [PR] [HUDI-7583] Read log block header only for the schema and instant time [hudi]

2024-04-09 Thread via GitHub
yihua commented on code in PR #10984: URL: https://github.com/apache/hudi/pull/10984#discussion_r1558866328 ## hudi-common/src/main/java/org/apache/hudi/common/table/TableSchemaResolver.java: ## @@ -389,7 +389,10 @@ private MessageType readSchemaFromLogFile(Path path) throws

Re: [I] RLI Spark Hudi Error occurs when executing map [hudi]

2024-04-09 Thread via GitHub
ad1happy2go commented on issue #10609: URL: https://github.com/apache/hudi/issues/10609#issuecomment-2046560389 @nsivabalan We were not able to reproduce this error in our setup. I went into multiple calls with @maheshguptags and setup the exact same setup in my local. But He is

[jira] [Updated] (HUDI-7595) Investigate and fix flaky tests in ITTestHoodieDataSource

2024-04-09 Thread Vova Kolmakov (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vova Kolmakov updated HUDI-7595: Description:   {code:java} Failures:  2024-04-10T00:47:48.7776428Z [ERROR]  

Re: [I] RLI Spark Hudi Error occurs when executing map [hudi]

2024-04-09 Thread via GitHub
maheshguptags commented on issue #10609: URL: https://github.com/apache/hudi/issues/10609#issuecomment-2046555120 @nsivabalan We haven't resolved the original issue and it is still open. -- This is an automated message from the Apache Git Service. To respond to the message, please

Re: [PR] [HUDI-7583] Read log block header only for the schema and instant time [hudi]

2024-04-09 Thread via GitHub
codope commented on code in PR #10984: URL: https://github.com/apache/hudi/pull/10984#discussion_r1558851028 ## hudi-common/src/main/java/org/apache/hudi/common/table/TableSchemaResolver.java: ## @@ -389,7 +389,10 @@ private MessageType readSchemaFromLogFile(Path path) throws

[jira] [Commented] (HUDI-7596) Enable Jacoco code coverage report across multiple modules

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17835592#comment-17835592 ] Ethan Guo commented on HUDI-7596: - https://github.com/apache/hudi/pull/10646 > Enable Jacoco code

Re: [PR] [HUDI-7597] Add logs of Kafka offsets when the checkpoint is out of bound [hudi]

2024-04-09 Thread via GitHub
yihua commented on code in PR #10987: URL: https://github.com/apache/hudi/pull/10987#discussion_r1558846954 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/helpers/KafkaOffsetGen.java: ## @@ -331,24 +331,35 @@ private List fetchPartitionInfos(KafkaConsumer

(hudi) branch master updated: [HUDI-7556] Fixing false positive validation with MDT validator (#10986)

2024-04-09 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new a23c5b783e7 [HUDI-7556] Fixing false positive

Re: [PR] [HUDI-7556] Fixing false positive validation with MDT validator [hudi]

2024-04-09 Thread via GitHub
yihua merged PR #10986: URL: https://github.com/apache/hudi/pull/10986 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] [HUDI-7597] Add logs of Kafka offsets when the checkpoint is out of bound [hudi]

2024-04-09 Thread via GitHub
codope commented on code in PR #10987: URL: https://github.com/apache/hudi/pull/10987#discussion_r1558838076 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/helpers/KafkaOffsetGen.java: ## @@ -331,24 +331,35 @@ private List fetchPartitionInfos(KafkaConsumer

Re: [PR] [HUDI-7597] Add logs of Kafka offsets when the checkpoint is out of bound [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10987: URL: https://github.com/apache/hudi/pull/10987#issuecomment-2046531837 ## CI report: * e608ca27d1d69ea9b6d6fe299ea0139f4fed04d5 Azure:

Re: [PR] [HUDI-6441] Passing custom Headers with Hudi Callback URL [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10970: URL: https://github.com/apache/hudi/pull/10970#issuecomment-2046531797 ## CI report: * dbceeac96d98d8b87a3a771650d59554480cca16 Azure:

Re: [PR] [HUDI-7597] Add logs of Kafka offsets when the checkpoint is out of bound [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10987: URL: https://github.com/apache/hudi/pull/10987#issuecomment-2046526881 ## CI report: * e608ca27d1d69ea9b6d6fe299ea0139f4fed04d5 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run

Re: [PR] [HUDI-6441] Passing custom Headers with Hudi Callback URL [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10970: URL: https://github.com/apache/hudi/pull/10970#issuecomment-2046526783 ## CI report: * dbceeac96d98d8b87a3a771650d59554480cca16 Azure:

Re: [PR] [HUDI-6441] Passing custom Headers with Hudi Callback URL [hudi]

2024-04-09 Thread via GitHub
wombatu-kun commented on PR #10970: URL: https://github.com/apache/hudi/pull/10970#issuecomment-2046522508 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [PR] [DO NOT MERGE][HUDI-7567] Add schema evolution to the filegroup reader [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10957: URL: https://github.com/apache/hudi/pull/10957#issuecomment-2046521267 ## CI report: * 31eb84b8fc7e0d8066633ff8f6bc92b14b8660e3 Azure:

Re: [PR] [HUDI-6441] Passing custom Headers with Hudi Callback URL [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10970: URL: https://github.com/apache/hudi/pull/10970#issuecomment-2046521306 ## CI report: * dbceeac96d98d8b87a3a771650d59554480cca16 Azure:

[jira] [Updated] (HUDI-7597) Add logs of Kafka offsets when the checkpoint is out of bound

2024-04-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7597: - Labels: pull-request-available (was: ) > Add logs of Kafka offsets when the checkpoint is out of

[PR] [HUDI-7597] Add logs of Kafka offsets when the checkpoint is out of bound [hudi]

2024-04-09 Thread via GitHub
yihua opened a new pull request, #10987: URL: https://github.com/apache/hudi/pull/10987 ### Change Logs _Describe context and summary for this change. Highlight if any code was copied._ ### Impact _Describe any public API or user-facing feature change or any performance

Re: [I] [SUPPORT] project hudi-common: Compilation failure: Compilation failure [hudi]

2024-04-09 Thread via GitHub
ZhangWeike2000 commented on issue #9744: URL: https://github.com/apache/hudi/issues/9744#issuecomment-2046495674 hudi compilation takes place why ? [ERROR] Failed to execute goal on project hudi-utilities_2.12: Could not resolve dependencies for project

[jira] [Created] (HUDI-7597) Add logs of Kafka offsets when the checkpoint is out of bound

2024-04-09 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-7597: --- Summary: Add logs of Kafka offsets when the checkpoint is out of bound Key: HUDI-7597 URL: https://issues.apache.org/jira/browse/HUDI-7597 Project: Apache Hudi Issue

[jira] [Updated] (HUDI-7596) Enable Jacoco code coverage report across multiple modules

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7596: Epic Link: HUDI-7537 > Enable Jacoco code coverage report across multiple modules >

Re: [PR] [HUDI-6441] Passing custom Headers with Hudi Callback URL [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10970: URL: https://github.com/apache/hudi/pull/10970#issuecomment-2046491796 ## CI report: * dbceeac96d98d8b87a3a771650d59554480cca16 Azure:

[jira] [Updated] (HUDI-7596) Enable Jacoco code coverage report across multiple modules

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7596: Summary: Enable Jacoco code coverage report across multiple modules (was: Enable code coverage report

[jira] [Created] (HUDI-7596) Enable code coverage report across multiple modules

2024-04-09 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-7596: --- Summary: Enable code coverage report across multiple modules Key: HUDI-7596 URL: https://issues.apache.org/jira/browse/HUDI-7596 Project: Apache Hudi Issue Type:

[jira] [Updated] (HUDI-7596) Enable Jacoco code coverage report across multiple modules

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7596: Fix Version/s: 0.15.0 1.0.0 > Enable Jacoco code coverage report across multiple modules

Re: [PR] [HUDI-6441] Passing custom Headers with Hudi Callback URL [hudi]

2024-04-09 Thread via GitHub
wombatu-kun commented on PR #10970: URL: https://github.com/apache/hudi/pull/10970#issuecomment-2046489546 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [PR] [HUDI-6441] Passing custom Headers with Hudi Callback URL [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10970: URL: https://github.com/apache/hudi/pull/10970#issuecomment-2046487469 ## CI report: * dbceeac96d98d8b87a3a771650d59554480cca16 Azure:

Re: [PR] [DO NOT MERGE][HUDI-7567] Add schema evolution to the filegroup reader [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10957: URL: https://github.com/apache/hudi/pull/10957#issuecomment-2046487411 ## CI report: * 70cef40f06ad614ecb7e00530201e06190d8f268 Azure:

Re: [PR] [DO NOT MERGE][HUDI-7567] Add schema evolution to the filegroup reader [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10957: URL: https://github.com/apache/hudi/pull/10957#issuecomment-2046482910 ## CI report: * 70cef40f06ad614ecb7e00530201e06190d8f268 Azure:

[jira] [Created] (HUDI-7595) Investigate and fix flaky tests in ITTestHoodieDataSource

2024-04-09 Thread Vova Kolmakov (Jira)
Vova Kolmakov created HUDI-7595: --- Summary: Investigate and fix flaky tests in ITTestHoodieDataSource Key: HUDI-7595 URL: https://issues.apache.org/jira/browse/HUDI-7595 Project: Apache Hudi

Re: [PR] [DO NOT MERGE][HUDI-7567] Add schema evolution to the filegroup reader [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10957: URL: https://github.com/apache/hudi/pull/10957#issuecomment-2046399897 ## CI report: * 3b8b4f1416e0d24dfd11e0eafe19d33eb7520077 Azure:

Re: [PR] [DO NOT MERGE][HUDI-7567] Add schema evolution to the filegroup reader [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10957: URL: https://github.com/apache/hudi/pull/10957#issuecomment-2046375342 ## CI report: * 3b8b4f1416e0d24dfd11e0eafe19d33eb7520077 Azure:

Re: [PR] [HUDI-6441] Passing custom Headers with Hudi Callback URL [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10970: URL: https://github.com/apache/hudi/pull/10970#issuecomment-2046353603 ## CI report: * a1a3769da4ed0f516a25cf5f030bf4eafb1a74f1 Azure:

Re: [PR] [DO NOT MERGE][HUDI-7567] Add schema evolution to the filegroup reader [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10957: URL: https://github.com/apache/hudi/pull/10957#issuecomment-2046353365 ## CI report: * 3b8b4f1416e0d24dfd11e0eafe19d33eb7520077 Azure:

Re: [PR] [HUDI-6971] OOM caused by configuring read.start_commit as earliest in stream reading [hudi]

2024-04-09 Thread via GitHub
zhuanshenbsj1 closed pull request #9906: [HUDI-6971] OOM caused by configuring read.start_commit as earliest in stream reading URL: https://github.com/apache/hudi/pull/9906 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

Re: [PR] [HUDI-6441] Passing custom Headers with Hudi Callback URL [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10970: URL: https://github.com/apache/hudi/pull/10970#issuecomment-2046308222 ## CI report: * 1b65081255315b4c5129b2d5ccea4c097ca15649 Azure:

Re: [PR] [DO NOT MERGE][HUDI-7567] Add schema evolution to the filegroup reader [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10957: URL: https://github.com/apache/hudi/pull/10957#issuecomment-2046308178 ## CI report: * c5d77d501854a7e4af2a5d1460a4173dc13100ec Azure:

[jira] [Assigned] (HUDI-7591) Implement InlineFS in HoodieStorage

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-7591: --- Assignee: Ethan Guo > Implement InlineFS in HoodieStorage > --- > >

[jira] [Assigned] (HUDI-7587) Move hadoop-dependent reader and writer implementation to hudi-hadoop-common module

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-7587: --- Assignee: Ethan Guo > Move hadoop-dependent reader and writer implementation to hudi-hadoop-common

[jira] [Assigned] (HUDI-6912) Avoid using Hadoop classes and APIs in HoodieFileGroupReader

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-6912: --- Assignee: Ethan Guo > Avoid using Hadoop classes and APIs in HoodieFileGroupReader >

[jira] [Assigned] (HUDI-7593) Create COW record reader based on HoodieStorage abstraction for Trino

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-7593: --- Assignee: Ethan Guo > Create COW record reader based on HoodieStorage abstraction for Trino >

Re: [PR] [HUDI-6441] Passing custom Headers with Hudi Callback URL [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10970: URL: https://github.com/apache/hudi/pull/10970#issuecomment-2046302372 ## CI report: * 1b65081255315b4c5129b2d5ccea4c097ca15649 Azure:

[jira] [Assigned] (HUDI-7594) Create MOR record reader based on HoodieStorage abstraction for Trino

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-7594: --- Assignee: Ethan Guo > Create MOR record reader based on HoodieStorage abstraction for Trino >

[jira] [Assigned] (HUDI-7592) Remove remaining hadoop usage in hudi-common module

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-7592: --- Assignee: Ethan Guo > Remove remaining hadoop usage in hudi-common module >

[jira] [Assigned] (HUDI-7588) Replace hadoop Configuration with StorageConfiguration in hudi-common module

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-7588: --- Assignee: Ethan Guo > Replace hadoop Configuration with StorageConfiguration in hudi-common module >

[jira] [Assigned] (HUDI-7589) Add configs to choose HoodieStorage and reader/writer implementation

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-7589: --- Assignee: Ethan Guo > Add configs to choose HoodieStorage and reader/writer implementation >

Re: [PR] [DO NOT MERGE][HUDI-7567] Add schema evolution to the filegroup reader [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10957: URL: https://github.com/apache/hudi/pull/10957#issuecomment-2046302291 ## CI report: * c5d77d501854a7e4af2a5d1460a4173dc13100ec Azure:

[jira] [Closed] (HUDI-7559) Fix functional index (on column stats): Handle NPE in filterQueriesWithRecordKey(...)

2024-04-09 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit closed HUDI-7559. - Resolution: Fixed > Fix functional index (on column stats): Handle NPE in >

[jira] [Closed] (HUDI-7480) initializeFunctionalIndexPartition is called multiple times

2024-04-09 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit closed HUDI-7480. - Resolution: Fixed > initializeFunctionalIndexPartition is called multiple times >

[jira] [Closed] (HUDI-7572) Avoid to schedule empty compaction plan without log files

2024-04-09 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit closed HUDI-7572. - Resolution: Fixed > Avoid to schedule empty compaction plan without log files >

Re: [I] [SUPPORT]insert_overwrite_table table slow [hudi]

2024-04-09 Thread via GitHub
wkhappy1 commented on issue #10979: URL: https://github.com/apache/hudi/issues/10979#issuecomment-2046293495 @ad1happy2go yes ,we read all columns, because we need overwrite table.we test bulk_insert with overwrite ,it seems not need to cache too big rdd,but bulk_insert with overwrite,it

Re: [I] [SUPPORT]Exception when executing log compaction : Unsupported Operation Exception [hudi]

2024-04-09 Thread via GitHub
danny0405 commented on issue #10982: URL: https://github.com/apache/hudi/issues/10982#issuecomment-2046277007 This is an known bug which is fixed recently: https://github.com/apache/hudi/pull/10194 -- This is an automated message from the Apache Git Service. To respond to the message,

Re: [I] [SUPPORT]Exception when executing log compaction : Unsupported Operation Exception [hudi]

2024-04-09 Thread via GitHub
danny0405 closed issue #10982: [SUPPORT]Exception when executing log compaction : Unsupported Operation Exception URL: https://github.com/apache/hudi/issues/10982 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

(hudi) branch master updated: [MINOR] Optimize print write error msg in StreamWriteOperatorCoordinator#doCommit (#10809)

2024-04-09 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 016bcf769b6 [MINOR] Optimize print write error

Re: [PR] [MINOR] Optimize print write error msg in StreamWriteOperatorCoordinator#doCommit [hudi]

2024-04-09 Thread via GitHub
danny0405 merged PR #10809: URL: https://github.com/apache/hudi/pull/10809 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Updated] (HUDI-7269) Fallback to key-based merging if there is no positions in log header

2024-04-09 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7269: -- Status: In Progress (was: Open) > Fallback to key-based merging if there is no positions in

[jira] [Updated] (HUDI-7587) Move hadoop-dependent reader and writer implementation to hudi-hadoop-common module

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7587: Sprint: Sprint 2024-03-25 > Move hadoop-dependent reader and writer implementation to hudi-hadoop-common >

[jira] [Updated] (HUDI-7594) Create MOR record reader based on HoodieStorage abstraction for Trino

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7594: Sprint: Sprint 2024-03-25 > Create MOR record reader based on HoodieStorage abstraction for Trino >

[jira] [Updated] (HUDI-7592) Remove remaining hadoop usage in hudi-common module

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7592: Sprint: Sprint 2024-03-25 > Remove remaining hadoop usage in hudi-common module >

[jira] [Updated] (HUDI-7593) Create COW record reader based on HoodieStorage abstraction for Trino

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7593: Sprint: Sprint 2024-03-25 > Create COW record reader based on HoodieStorage abstraction for Trino >

[jira] [Updated] (HUDI-7591) Implement InlineFS in HoodieStorage

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7591: Sprint: Sprint 2024-03-25 > Implement InlineFS in HoodieStorage > --- > >

[jira] [Updated] (HUDI-7589) Add configs to choose HoodieStorage and reader/writer implementation

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7589: Sprint: Sprint 2024-03-25 > Add configs to choose HoodieStorage and reader/writer implementation >

[jira] [Updated] (HUDI-7588) Replace hadoop Configuration with StorageConfiguration in hudi-common module

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7588: Sprint: Sprint 2024-03-25 > Replace hadoop Configuration with StorageConfiguration in hudi-common module >

[jira] [Updated] (HUDI-7593) Create COW record reader based on HoodieStorage abstraction for Trino

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7593: Fix Version/s: 0.15.0 1.0.0 > Create COW record reader based on HoodieStorage

[jira] [Updated] (HUDI-7593) Create COW record reader based on HoodieStorage abstraction for Trino

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7593: Labels: hoodie-storage (was: ) > Create COW record reader based on HoodieStorage abstraction for Trino >

[jira] [Updated] (HUDI-6912) Avoid using Hadoop classes and APIs in HoodieFileGroupReader

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-6912: Sprint: Sprint 2024-03-25 > Avoid using Hadoop classes and APIs in HoodieFileGroupReader >

[jira] [Updated] (HUDI-6912) Avoid using Hadoop classes and APIs in HoodieFileGroupReader

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-6912: Fix Version/s: 0.15.0 1.0.0 > Avoid using Hadoop classes and APIs in

[jira] [Updated] (HUDI-7591) Implement InlineFS in HoodieStorage

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7591: Fix Version/s: 0.15.0 1.0.0 > Implement InlineFS in HoodieStorage >

[jira] [Updated] (HUDI-7589) Add configs to choose HoodieStorage and reader/writer implementation

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7589: Fix Version/s: 0.15.0 1.0.0 > Add configs to choose HoodieStorage and reader/writer

[jira] [Updated] (HUDI-7350) Introduce HoodieIOFactory to abstract the reader and writer implementation

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7350: Fix Version/s: 0.15.0 > Introduce HoodieIOFactory to abstract the reader and writer implementation >

[jira] [Updated] (HUDI-7588) Replace hadoop Configuration with StorageConfiguration in hudi-common module

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7588: Fix Version/s: 0.15.0 1.0.0 > Replace hadoop Configuration with StorageConfiguration in

[jira] [Updated] (HUDI-7592) Remove remaining hadoop usage in hudi-common module

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7592: Fix Version/s: 0.15.0 1.0.0 > Remove remaining hadoop usage in hudi-common module >

[jira] [Updated] (HUDI-7594) Create MOR record reader based on HoodieStorage abstraction for Trino

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7594: Fix Version/s: 0.15.0 1.0.0 > Create MOR record reader based on HoodieStorage

[jira] [Created] (HUDI-7594) Create MOR record reader based on HoodieStorage abstraction for Trino

2024-04-09 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-7594: --- Summary: Create MOR record reader based on HoodieStorage abstraction for Trino Key: HUDI-7594 URL: https://issues.apache.org/jira/browse/HUDI-7594 Project: Apache Hudi

Re: [PR] [HUDI-6441] Passing custom Headers with Hudi Callback URL [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10970: URL: https://github.com/apache/hudi/pull/10970#issuecomment-2046256409 ## CI report: * 1b65081255315b4c5129b2d5ccea4c097ca15649 Azure:

Re: [PR] [HUDI-7556] Fixing false positive validation with MDT validator [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10986: URL: https://github.com/apache/hudi/pull/10986#issuecomment-2046256472 ## CI report: * 5b7c31a247dd3ca2a8a264fb4a25ad6a83dddc67 Azure:

[jira] [Created] (HUDI-7592) Remove remaining hadoop usage in hudi-common module

2024-04-09 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-7592: --- Summary: Remove remaining hadoop usage in hudi-common module Key: HUDI-7592 URL: https://issues.apache.org/jira/browse/HUDI-7592 Project: Apache Hudi Issue Type: Task

[jira] [Updated] (HUDI-7592) Remove remaining hadoop usage in hudi-common module

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7592: Labels: hoodie-storage (was: ) > Remove remaining hadoop usage in hudi-common module >

[jira] [Updated] (HUDI-7593) Create COW record reader based on HoodieStorage abstraction for Trino

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7593: Story Points: 4 (was: 8) > Create COW record reader based on HoodieStorage abstraction for Trino >

[jira] [Updated] (HUDI-7593) Create COW record reader based on HoodieStorage abstraction for Trino

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7593: Summary: Create COW record reader based on HoodieStorage abstraction for Trino (was: Create new record

[jira] [Created] (HUDI-7593) Create new record reader classes based on HoodieStorage abstraction for Trino

2024-04-09 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-7593: --- Summary: Create new record reader classes based on HoodieStorage abstraction for Trino Key: HUDI-7593 URL: https://issues.apache.org/jira/browse/HUDI-7593 Project: Apache Hudi

[jira] [Updated] (HUDI-7589) Add configs to choose HoodieStorage and reader/writer implementation

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7589: Labels: hoodie-storage (was: ) > Add configs to choose HoodieStorage and reader/writer implementation >

[jira] [Created] (HUDI-7590) Add configs to choose HoodieStorage and reader/writer implementation

2024-04-09 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-7590: --- Summary: Add configs to choose HoodieStorage and reader/writer implementation Key: HUDI-7590 URL: https://issues.apache.org/jira/browse/HUDI-7590 Project: Apache Hudi

[jira] [Created] (HUDI-7589) Add configs to choose HoodieStorage and reader/writer implementation

2024-04-09 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-7589: --- Summary: Add configs to choose HoodieStorage and reader/writer implementation Key: HUDI-7589 URL: https://issues.apache.org/jira/browse/HUDI-7589 Project: Apache Hudi

[jira] [Closed] (HUDI-7590) Add configs to choose HoodieStorage and reader/writer implementation

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo closed HUDI-7590. --- Resolution: Duplicate > Add configs to choose HoodieStorage and reader/writer implementation >

[jira] [Created] (HUDI-7591) Implement InlineFS in HoodieStorage

2024-04-09 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-7591: --- Summary: Implement InlineFS in HoodieStorage Key: HUDI-7591 URL: https://issues.apache.org/jira/browse/HUDI-7591 Project: Apache Hudi Issue Type: Task

[jira] [Updated] (HUDI-7588) Replace hadoop Configuration with StorageConfiguration in hudi-common module

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7588: Labels: hoodie-storage (was: ) > Replace hadoop Configuration with StorageConfiguration in hudi-common

[jira] [Created] (HUDI-7588) Replace hadoop Configuration with StorageConfiguration in hudi-common module

2024-04-09 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-7588: --- Summary: Replace hadoop Configuration with StorageConfiguration in hudi-common module Key: HUDI-7588 URL: https://issues.apache.org/jira/browse/HUDI-7588 Project: Apache Hudi

Re: [PR] [HUDI-6441] Passing custom Headers with Hudi Callback URL [hudi]

2024-04-09 Thread via GitHub
hudi-bot commented on PR #10970: URL: https://github.com/apache/hudi/pull/10970#issuecomment-2046251558 ## CI report: * 1b65081255315b4c5129b2d5ccea4c097ca15649 Azure:

[jira] [Updated] (HUDI-6912) Avoid using Hadoop classes and APIs in HoodieFileGroupReader

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-6912: Labels: hoodie-storage (was: ) > Avoid using Hadoop classes and APIs in HoodieFileGroupReader >

[jira] [Updated] (HUDI-6912) Avoid using Hadoop classes and APIs in HoodieFileGroupReader

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-6912: Story Points: 1 > Avoid using Hadoop classes and APIs in HoodieFileGroupReader >

[jira] [Updated] (HUDI-7350) Introduce HoodieIOFactory to abstract the reader and writer implementation

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7350: Story Points: 4 > Introduce HoodieIOFactory to abstract the reader and writer implementation >

[jira] [Updated] (HUDI-7350) Introduce HoodieIOFactory to abstract the reader and writer implementation

2024-04-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-7350: Labels: hoodie-storage (was: ) > Introduce HoodieIOFactory to abstract the reader and writer

  1   2   3   >