thomasg19930417 commented on issue #8882:
URL: https://github.com/apache/hudi/issues/8882#issuecomment-1576140270
Thank you for your reply. Once the test is successful, I will submit a PR
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on
danny0405 commented on issue #8882:
URL: https://github.com/apache/hudi/issues/8882#issuecomment-1576138294
Yeah, I think so, just need to check that we fix all the classes that are
not version compatible.
--
This is an automated message from the Apache Git Service.
To respond to the mess
yihua opened a new pull request, #8885:
URL: https://github.com/apache/hudi/pull/8885
### Change Logs
_Describe context and summary for this change. Highlight if any code was
copied._
### Impact
_Describe any public API or user-facing feature change or any performance
i
boneanxs commented on code in PR #8876:
URL: https://github.com/apache/hudi/pull/8876#discussion_r1217591203
##
hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/MergeOnReadIncrementalRelation.scala:
##
@@ -60,11 +60,14 @@ case class MergeOnReadIncrementalRe
hudi-bot commented on PR #8883:
URL: https://github.com/apache/hudi/pull/8883#issuecomment-1576115024
## CI report:
* 875c216b5e2c8fc496a48abb69506465c295ac6e Azure:
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1759
[
https://issues.apache.org/jira/browse/HUDI-6317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nicholas Jiang updated HUDI-6317:
-
Status: In Progress (was: Open)
> Streaming read should skip clustering instants to avoid duplica
hudi-bot commented on PR #8452:
URL: https://github.com/apache/hudi/pull/8452#issuecomment-1576113925
## CI report:
* 8082df232089396b2a9f9be2b915e51b3645f172 UNKNOWN
* 36d8a94e5d19e0b06fb48d7161d7ec4a43ac0338 Azure:
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2
[
https://issues.apache.org/jira/browse/HUDI-6317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated HUDI-6317:
-
Labels: pull-request-available (was: )
> Streaming read should skip clustering instants to avoid
SteNicholas opened a new pull request, #8884:
URL: https://github.com/apache/hudi/pull/8884
### Change Logs
At present, the default value of `read.streaming.skip_clustering` is false,
which could cause the situation that streaming reading reads the replaced file
slices of clustering,
[
https://issues.apache.org/jira/browse/HUDI-6317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nicholas Jiang updated HUDI-6317:
-
Summary: Streaming read should skip clustering instants to avoid duplicated
reading (was: Streami
thomasg19930417 commented on issue #8882:
URL: https://github.com/apache/hudi/issues/8882#issuecomment-1576108820
> Yeah, do you have intreast to contribute a fix for this? We can write our
own impl for `StringInternUtils` because it does not have good version
compatibility.
![image](ht
boneanxs commented on PR #8452:
URL: https://github.com/apache/hudi/pull/8452#issuecomment-1576107771
@hudi-bot run azure
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To
hudi-bot commented on PR #8883:
URL: https://github.com/apache/hudi/pull/8883#issuecomment-1576107687
## CI report:
* 875c216b5e2c8fc496a48abb69506465c295ac6e UNKNOWN
Bot commands
@hudi-bot supports the following commands:
- `@hudi-bot run azure` re-run the
[
https://issues.apache.org/jira/browse/HUDI-6317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nicholas Jiang updated HUDI-6317:
-
Summary: Streaming read should skip clustering instants to avoid deplicated
reading (was: Streami
[
https://issues.apache.org/jira/browse/HUDI-6317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nicholas Jiang updated HUDI-6317:
-
Description: At present, the default value of
read.streaming.skip_clustering is false, which could
hudi-bot commented on PR #8452:
URL: https://github.com/apache/hudi/pull/8452#issuecomment-1576096351
## CI report:
* 8082df232089396b2a9f9be2b915e51b3645f172 UNKNOWN
* 36d8a94e5d19e0b06fb48d7161d7ec4a43ac0338 Azure:
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2
hudi-bot commented on PR #6643:
URL: https://github.com/apache/hudi/pull/6643#issuecomment-1576093875
## CI report:
* bf69cb9457ef0d0690a0d30a5e7b5d3db399b75e Azure:
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1759
[
https://issues.apache.org/jira/browse/HUDI-6317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nicholas Jiang updated HUDI-6317:
-
Description: At present, the default value of
read.streaming.skip_clustering is false, which could
[
https://issues.apache.org/jira/browse/HUDI-6317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nicholas Jiang updated HUDI-6317:
-
Description: At present, the default value of
read.streaming.skip_clustering is false, which could
voonhous opened a new pull request, #8883:
URL: https://github.com/apache/hudi/pull/8883
### Change Logs
Fixing checkstyle and removed unnecessary index in `ParquetSplitReaderUtil`.
### Impact
_Describe any public API or user-facing feature change or any performance
impa
flashJd commented on PR #6643:
URL: https://github.com/apache/hudi/pull/6643#issuecomment-1576085073
@xushiyan @nsivabalan I've rebased master, the CI error seems has no
bussiness with this PR
--
This is an automated message from the Apache Git Service.
To respond to the message, please l
[
https://issues.apache.org/jira/browse/HUDI-6317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nicholas Jiang updated HUDI-6317:
-
Summary: Streaming read should skip clustering instants (was: The default
value of read.streaming
Nicholas Jiang created HUDI-6317:
Summary: The default value of read.streaming.skip_clustering
should be true
Key: HUDI-6317
URL: https://issues.apache.org/jira/browse/HUDI-6317
Project: Apache Hudi
xushiyan commented on code in PR #8876:
URL: https://github.com/apache/hudi/pull/8876#discussion_r1217449442
##
hudi-common/src/main/java/org/apache/hudi/common/table/timeline/TimelineUtils.java:
##
@@ -295,4 +296,27 @@ public static Option
getEarliestInstantForMetadataArchival
xushiyan commented on code in PR #8876:
URL: https://github.com/apache/hudi/pull/8876#discussion_r1217469861
##
hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/MergeOnReadIncrementalRelation.scala:
##
@@ -60,11 +60,14 @@ case class MergeOnReadIncrementalRe
xushiyan commented on code in PR #8876:
URL: https://github.com/apache/hudi/pull/8876#discussion_r1217451252
##
hudi-common/src/main/java/org/apache/hudi/common/table/timeline/TimelineUtils.java:
##
@@ -295,4 +296,27 @@ public static Option
getEarliestInstantForMetadataArchival
xushiyan commented on code in PR #8876:
URL: https://github.com/apache/hudi/pull/8876#discussion_r1217449442
##
hudi-common/src/main/java/org/apache/hudi/common/table/timeline/TimelineUtils.java:
##
@@ -295,4 +296,27 @@ public static Option
getEarliestInstantForMetadataArchival
hudi-bot commented on PR #6643:
URL: https://github.com/apache/hudi/pull/6643#issuecomment-1576012261
## CI report:
* bf69cb9457ef0d0690a0d30a5e7b5d3db399b75e Azure:
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1759
danny0405 commented on PR #8783:
URL: https://github.com/apache/hudi/pull/8783#issuecomment-1576002997
@suryaprasanna Hi, can you update and fix the test failures?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
UR
boneanxs commented on code in PR #8876:
URL: https://github.com/apache/hudi/pull/8876#discussion_r1217398502
##
hudi-common/src/main/java/org/apache/hudi/common/table/timeline/TimelineUtils.java:
##
@@ -295,4 +296,27 @@ public static Option
getEarliestInstantForMetadataArchival
hudi-bot commented on PR #8452:
URL: https://github.com/apache/hudi/pull/8452#issuecomment-1575980425
## CI report:
* 8082df232089396b2a9f9be2b915e51b3645f172 UNKNOWN
* 9e5504e078b93d1997cf901868234e36c69dd97e Azure:
[SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2
[
https://issues.apache.org/jira/browse/HUDI-6021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
weiming reassigned HUDI-6021:
-
Assignee: weiming
> insert overwrite table will delete entire data
>
hudi-bot commented on PR #8452:
URL: https://github.com/apache/hudi/pull/8452#issuecomment-1575975338
## CI report:
* 8082df232089396b2a9f9be2b915e51b3645f172 UNKNOWN
* 9e5504e078b93d1997cf901868234e36c69dd97e Azure:
[SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2
danny0405 commented on issue #8276:
URL: https://github.com/apache/hudi/issues/8276#issuecomment-1575962096
> the first checkpoint after savepoint
Got it, I need to figure out what's the savepoint's effect to the ensueing
checkpoint.
--
This is an automated message from the Apache
danny0405 commented on code in PR #8729:
URL: https://github.com/apache/hudi/pull/8729#discussion_r1217387626
##
hudi-spark-datasource/hudi-spark3.2plus-common/src/main/scala/org/apache/spark/sql/catalyst/plans/logcal/HoodieTableChanges.scala:
##
@@ -0,0 +1,91 @@
+/*
+ * License
gfunc commented on issue #8276:
URL: https://github.com/apache/hudi/issues/8276#issuecomment-1575957699
@danny0405 Sorry for any confusion, I meant the above-mentioned specific
scenario: the first checkpoint after savepoint. Normal checkpoint is relatively
slow but ok since we had a bad par
danny0405 commented on issue #8882:
URL: https://github.com/apache/hudi/issues/8882#issuecomment-1575951654
Yeah, do you have intreast to contribute a fix for this? We can write our
own impl for `StringInternUtils` because it does not have good version
compatibility.
--
This is an automa
thomasg19930417 commented on issue #8882:
URL: https://github.com/apache/hudi/issues/8882#issuecomment-1575949058
Can these methods be extended once to be better compatible with hive?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to G
thomasg19930417 commented on issue #8882:
URL: https://github.com/apache/hudi/issues/8882#issuecomment-1575945668
I confirmed that this class does not exist in the currently used hive
version, so it only supports part of the Hive2.x version
--
This is an automated message from the Apache
thomasg19930417 commented on issue #8882:
URL: https://github.com/apache/hudi/issues/8882#issuecomment-1575944762
I found a similar issue https://github.com/apache/hudi/issues/3795
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to Gi
thomasg19930417 opened a new issue, #8882:
URL: https://github.com/apache/hudi/issues/8882
env:
hudi 0.12.1
hive 2.2.0
config:
set hive.input.format=org.apache.hadoop.hive.ql.io.CombineHiveInputFormat;
dd jar hdfs://mycluster/hudi/jars/hudi-hadoop-mr-bundle-0.10.0.jar;
ChestnutQiang commented on PR #7652:
URL: https://github.com/apache/hudi/pull/7652#issuecomment-1575942105
Hello, has this pr been fixed? @complone
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go t
hudi-bot commented on PR #6643:
URL: https://github.com/apache/hudi/pull/6643#issuecomment-1575896384
## CI report:
* 9bb916dce004481edaa9891aad1e8768e27bae92 Azure:
[SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1757
hudi-bot commented on PR #6643:
URL: https://github.com/apache/hudi/pull/6643#issuecomment-1575892345
## CI report:
* 9bb916dce004481edaa9891aad1e8768e27bae92 Azure:
[SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1757
bvaradar commented on code in PR #8847:
URL: https://github.com/apache/hudi/pull/8847#discussion_r1217216256
##
hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieBootstrapMORRDD.scala:
##
@@ -0,0 +1,78 @@
+/*
+ * Licensed to the Apache Software Foundat
hudi-bot commented on PR #8881:
URL: https://github.com/apache/hudi/pull/8881#issuecomment-1575692706
## CI report:
* 7b9d3ca6b52733dc531f54d6bf21ca449cc3254f Azure:
[SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1759
hudi-bot commented on PR #8881:
URL: https://github.com/apache/hudi/pull/8881#issuecomment-1575643792
## CI report:
* 7b9d3ca6b52733dc531f54d6bf21ca449cc3254f Azure:
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1759
hudi-bot commented on PR #8876:
URL: https://github.com/apache/hudi/pull/8876#issuecomment-1575643776
## CI report:
* c62a77245b24274bf5b1e594b36b4a7a56ea2c73 UNKNOWN
* 2793a5f1eb861ed6d938c33d471bcf2d56e3051e Azure:
[SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2
hudi-bot commented on PR #8881:
URL: https://github.com/apache/hudi/pull/8881#issuecomment-1575641146
## CI report:
* 7b9d3ca6b52733dc531f54d6bf21ca449cc3254f UNKNOWN
Bot commands
@hudi-bot supports the following commands:
- `@hudi-bot run azure` re-run the
nsivabalan opened a new pull request, #8881:
URL: https://github.com/apache/hudi/pull/8881
### Change Logs
Adding log block metrics to track corrupted lock blocks and rollback blocks.
Users need to enable 'hoodie.metrics.compaction.log.blocks.on' to enable the
metrics.
### Im
[
https://issues.apache.org/jira/browse/HUDI-6316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated HUDI-6316:
-
Labels: pull-request-available (was: )
> Add metrics to track corrupted blocks and rollback block
sivabalan narayanan created HUDI-6316:
-
Summary: Add metrics to track corrupted blocks and rollback blocks
Key: HUDI-6316
URL: https://issues.apache.org/jira/browse/HUDI-6316
Project: Apache Hudi
xushiyan commented on code in PR #8876:
URL: https://github.com/apache/hudi/pull/8876#discussion_r1216842875
##
hudi-common/src/main/java/org/apache/hudi/common/table/timeline/TimelineUtils.java:
##
@@ -295,4 +296,24 @@ public static Option
getEarliestInstantForMetadataArchival
[
https://issues.apache.org/jira/browse/HUDI-6225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated HUDI-6225:
-
Labels: pull-request-available (was: )
> Documentation for hudi_table_changes() and hudi_table_ch
kazdy opened a new pull request, #8880:
URL: https://github.com/apache/hudi/pull/8880
### Change Logs
add hudi_table_changes(by_path) docs to spark quickstart guide
### Impact
none
### Risk level (write none, low medium or high below)
none
### Doc
hudi-bot commented on PR #8876:
URL: https://github.com/apache/hudi/pull/8876#issuecomment-1575580495
## CI report:
* 222567017d635b9ee36041599bce3d164a49b437 Azure:
[CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=175
hudi-bot commented on PR #8876:
URL: https://github.com/apache/hudi/pull/8876#issuecomment-1575563607
## CI report:
* 222567017d635b9ee36041599bce3d164a49b437 Azure:
[CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=175
hudi-bot commented on PR #8876:
URL: https://github.com/apache/hudi/pull/8876#issuecomment-1575560526
## CI report:
* 2c3232cbd9e7148da101a13f9cb7edb1276ef31d Azure:
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1758
hudi-bot commented on PR #8876:
URL: https://github.com/apache/hudi/pull/8876#issuecomment-1575546727
## CI report:
* 2c3232cbd9e7148da101a13f9cb7edb1276ef31d Azure:
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1758
hudi-bot commented on PR #8876:
URL: https://github.com/apache/hudi/pull/8876#issuecomment-1575544106
## CI report:
* 2c3232cbd9e7148da101a13f9cb7edb1276ef31d Azure:
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1758
hudi-bot commented on PR #8876:
URL: https://github.com/apache/hudi/pull/8876#issuecomment-1575540960
## CI report:
* 2c3232cbd9e7148da101a13f9cb7edb1276ef31d Azure:
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1758
leosanqing commented on issue #8872:
URL: https://github.com/apache/hudi/issues/8872#issuecomment-1575502157
> Just disable the vec execution of Hive.
Thx, it work.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub a
hudi-bot commented on PR #8876:
URL: https://github.com/apache/hudi/pull/8876#issuecomment-1575482777
## CI report:
* af6fd1b63a7ac8746d14224d8bd12cc955534c01 Azure:
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1757
hudi-bot commented on PR #8876:
URL: https://github.com/apache/hudi/pull/8876#issuecomment-1575480347
## CI report:
* af6fd1b63a7ac8746d14224d8bd12cc955534c01 Azure:
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1757
64 matches
Mail list logo