[GitHub] [hudi] xushiyan closed issue #8316: [SUPPORT] INSERT operation performance vs UPSERT operation

2023-04-26 Thread via GitHub
xushiyan closed issue #8316: [SUPPORT] INSERT operation performance vs UPSERT operation URL: https://github.com/apache/hudi/issues/8316 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] zhuanshenbsj1 commented on pull request #8505: [HUDI-6106] Spark offline compaction/Clustering Job will do clean like Flink job

2023-04-26 Thread via GitHub
zhuanshenbsj1 commented on PR #8505: URL: https://github.com/apache/hudi/pull/8505#issuecomment-1522901577 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] voonhous opened a new pull request, #8579: [MINOR] Added docs on gotchas when using PartialUpdateAvroPayload

2023-04-26 Thread via GitHub
voonhous opened a new pull request, #8579: URL: https://github.com/apache/hudi/pull/8579 ### Change Logs Updated java docs and added tests to demonstrate gotchas when using PartialUpdateAvroPayload. ### Impact None, changes made are docs and tests to clarify gotchas on

[GitHub] [hudi] ad1happy2go commented on issue #8362: [SUPPORT] Table services with optimistic concurrency control and multiple writers

2023-04-26 Thread via GitHub
ad1happy2go commented on issue #8362: URL: https://github.com/apache/hudi/issues/8362#issuecomment-1522883618 @jtmzheng yes we can do that. Were you able to use it and test it out? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [hudi] ad1happy2go commented on issue #8428: [SUPPORT]: When trying to UPSERT, Getting issues like : An error occurred while calling o168.save. org/apache/spark/sql/avro/SchemaConverters$

2023-04-26 Thread via GitHub
ad1happy2go commented on issue #8428: URL: https://github.com/apache/hudi/issues/8428#issuecomment-1522881616 @Madan16 Were you able to resolve it. Can you try to use by our bundle jar once instead on using --datalake-format. -- This is an automated message from the Apache Git Service.

[GitHub] [hudi] ad1happy2go commented on issue #8447: [SUPPORT] Docker Demo Issue With Current master(0.14.0-SNAPSHOT)

2023-04-26 Thread via GitHub
ad1happy2go commented on issue #8447: URL: https://github.com/apache/hudi/issues/8447#issuecomment-1522877868 I will raise a JIRA to look into it more. Ideally should succeed with master too. -- This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [hudi] ad1happy2go commented on issue #8400: [SUPPORT] Hudi Offline Compaction in EMR Serverless 6.10 for YouTube Video

2023-04-26 Thread via GitHub
ad1happy2go commented on issue #8400: URL: https://github.com/apache/hudi/issues/8400#issuecomment-1522876762 I will try to reproduce this on my end then. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [hudi] ad1happy2go commented on issue #8436: [SUPPORT] run hoodie cleaner process as a spark submit request on EMR 6.9

2023-04-26 Thread via GitHub
ad1happy2go commented on issue #8436: URL: https://github.com/apache/hudi/issues/8436#issuecomment-1522874205 @alexone95 Can you provide the timeline before and after the issue to look into it more. -- This is an automated message from the Apache Git Service. To respond to the message,

[jira] [Updated] (HUDI-6120) fetchAllLogsMergedFileSlice will read basefile which it does not expect

2023-04-26 Thread Jianhui Dong (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jianhui Dong updated HUDI-6120: --- Summary: fetchAllLogsMergedFileSlice will read basefile which it does not expect (was: Streaming

[GitHub] [hudi] Mulavar commented on pull request #8529: [HUDI-6120]filter base file when there is only one file slice fetched

2023-04-26 Thread via GitHub
Mulavar commented on PR #8529: URL: https://github.com/apache/hudi/pull/8529#issuecomment-1522869185 I agreed with you that IncrementalInputSplits has no error with this patch, and I made a mistake about it. However I'm thinking about whether the logic of this method itself is correct,

[hudi] branch feature-hudi-console updated: [Improve] how to build updated (#8578)

2023-04-26 Thread forwardxu
This is an automated email from the ASF dual-hosted git repository. forwardxu pushed a commit to branch feature-hudi-console in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/feature-hudi-console by this push: new 29f88e9ba49

[GitHub] [hudi] XuQianJin-Stars merged pull request #8578: [Improve] hudi-console module how to build updated

2023-04-26 Thread via GitHub
XuQianJin-Stars merged PR #8578: URL: https://github.com/apache/hudi/pull/8578 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] wolfboys opened a new pull request, #8578: [Improve] how to build updated

2023-04-26 Thread via GitHub
wolfboys opened a new pull request, #8578: URL: https://github.com/apache/hudi/pull/8578 [Improve] how to build updated -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [hudi] hudi-bot commented on pull request #8503: [HUDI-6047] Clustering operation on consistent hashing index resulting in duplicate data

2023-04-26 Thread via GitHub
hudi-bot commented on PR #8503: URL: https://github.com/apache/hudi/pull/8503#issuecomment-1522863540 ## CI report: * 0738d975df341763e384b9ac9bcad14b006c9c47 UNKNOWN * 5f32d219b15c32363f5886dcc3787a19aabd4b7b Azure:

[GitHub] [hudi] ad1happy2go commented on issue #8509: [SUPPORT] Support MetaSync Even there is no new data, Just create the table if not exist in DeltaStream

2023-04-26 Thread via GitHub
ad1happy2go commented on issue #8509: URL: https://github.com/apache/hudi/issues/8509#issuecomment-1522858391 @Amar1404 What input source you are using. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [hudi] hudi-bot commented on pull request #8503: [HUDI-6047] Clustering operation on consistent hashing index resulting in duplicate data

2023-04-26 Thread via GitHub
hudi-bot commented on PR #8503: URL: https://github.com/apache/hudi/pull/8503#issuecomment-1522855906 ## CI report: * 0738d975df341763e384b9ac9bcad14b006c9c47 UNKNOWN * 6302a97fd391f4823c08d0c5f945719aa5457757 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #8452: [HUDI-6077] Add more partition push down filters

2023-04-26 Thread via GitHub
hudi-bot commented on PR #8452: URL: https://github.com/apache/hudi/pull/8452#issuecomment-1522855713 ## CI report: * f751e8281b606c4ae2c43c3aefc5be2cd2cbea66 Azure:

[GitHub] [hudi] rohan-uptycs commented on a diff in pull request #8503: [HUDI-6047] Clustering operation on consistent hashing index resulting in duplicate data

2023-04-26 Thread via GitHub
rohan-uptycs commented on code in PR #8503: URL: https://github.com/apache/hudi/pull/8503#discussion_r1177394054 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/HoodieTimelineArchiver.java: ## @@ -509,7 +509,15 @@ private Stream

[GitHub] [hudi] rohan-uptycs commented on a diff in pull request #8503: [HUDI-6047] Clustering operation on consistent hashing index resulting in duplicate data

2023-04-26 Thread via GitHub
rohan-uptycs commented on code in PR #8503: URL: https://github.com/apache/hudi/pull/8503#discussion_r1177394054 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/HoodieTimelineArchiver.java: ## @@ -509,7 +509,15 @@ private Stream

[GitHub] [hudi] rohan-uptycs commented on a diff in pull request #8503: [HUDI-6047] Clustering operation on consistent hashing index resulting in duplicate data

2023-04-26 Thread via GitHub
rohan-uptycs commented on code in PR #8503: URL: https://github.com/apache/hudi/pull/8503#discussion_r1177389386 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/HoodieTimelineArchiver.java: ## @@ -509,7 +509,15 @@ private Stream

[GitHub] [hudi] rmahindra123 commented on a diff in pull request #8514: [HUDI-6113] Support multiple transformers using the same config keys in DeltaStreamer

2023-04-26 Thread via GitHub
rmahindra123 commented on code in PR #8514: URL: https://github.com/apache/hudi/pull/8514#discussion_r1177389358 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/transform/ChainedTransformer.java: ## @@ -19,24 +19,61 @@ package org.apache.hudi.utilities.transform;

[GitHub] [hudi] rmahindra123 commented on a diff in pull request #8514: [HUDI-6113] Support multiple transformers using the same config keys in DeltaStreamer

2023-04-26 Thread via GitHub
rmahindra123 commented on code in PR #8514: URL: https://github.com/apache/hudi/pull/8514#discussion_r1177388252 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/transform/ChainedTransformer.java: ## @@ -19,24 +19,73 @@ package org.apache.hudi.utilities.transform;

[GitHub] [hudi] rmahindra123 commented on a diff in pull request #8514: [HUDI-6113] Support multiple transformers using the same config keys in DeltaStreamer

2023-04-26 Thread via GitHub
rmahindra123 commented on code in PR #8514: URL: https://github.com/apache/hudi/pull/8514#discussion_r1177386451 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/transform/ChainedTransformer.java: ## @@ -19,24 +19,73 @@ package org.apache.hudi.utilities.transform;

[GitHub] [hudi] rohan-uptycs commented on a diff in pull request #8503: [HUDI-6047] Clustering operation on consistent hashing index resulting in duplicate data

2023-04-26 Thread via GitHub
rohan-uptycs commented on code in PR #8503: URL: https://github.com/apache/hudi/pull/8503#discussion_r1177386709 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/HoodieTimelineArchiver.java: ## @@ -509,7 +509,15 @@ private Stream

[GitHub] [hudi] rmahindra123 commented on a diff in pull request #8514: [HUDI-6113] Support multiple transformers using the same config keys in DeltaStreamer

2023-04-26 Thread via GitHub
rmahindra123 commented on code in PR #8514: URL: https://github.com/apache/hudi/pull/8514#discussion_r1177386451 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/transform/ChainedTransformer.java: ## @@ -19,24 +19,73 @@ package org.apache.hudi.utilities.transform;

[GitHub] [hudi] rohan-uptycs commented on a diff in pull request #8503: [HUDI-6047] Clustering operation on consistent hashing index resulting in duplicate data

2023-04-26 Thread via GitHub
rohan-uptycs commented on code in PR #8503: URL: https://github.com/apache/hudi/pull/8503#discussion_r1177385951 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/HoodieTimelineArchiver.java: ## @@ -509,7 +509,15 @@ private Stream

[GitHub] [hudi] SteNicholas commented on a diff in pull request #8503: [HUDI-6047] Clustering operation on consistent hashing index resulting in duplicate data

2023-04-26 Thread via GitHub
SteNicholas commented on code in PR #8503: URL: https://github.com/apache/hudi/pull/8503#discussion_r1177385692 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/HoodieTimelineArchiver.java: ## @@ -509,7 +509,15 @@ private Stream

[GitHub] [hudi] SteNicholas commented on a diff in pull request #8503: [HUDI-6047] Clustering operation on consistent hashing index resulting in duplicate data

2023-04-26 Thread via GitHub
SteNicholas commented on code in PR #8503: URL: https://github.com/apache/hudi/pull/8503#discussion_r1177385296 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/HoodieTimelineArchiver.java: ## @@ -509,7 +509,15 @@ private Stream

[GitHub] [hudi] SteNicholas commented on a diff in pull request #8503: [HUDI-6047] Clustering operation on consistent hashing index resulting in duplicate data

2023-04-26 Thread via GitHub
SteNicholas commented on code in PR #8503: URL: https://github.com/apache/hudi/pull/8503#discussion_r1177384033 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/HoodieTimelineArchiver.java: ## @@ -509,7 +509,15 @@ private Stream

<    1   2   3