Re: [I] commits_.archive is not move to archived folder [hudi]

2023-10-12 Thread via GitHub
njalan closed issue #9812: commits_.archive is not move to archived folder URL: https://github.com/apache/hudi/issues/9812 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [PR] [HUDI-6924] Fix hoodie table config not wok in table properties [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9836: URL: https://github.com/apache/hudi/pull/9836#issuecomment-1759762568 ## CI report: * ec64b99d8525d88a1844e434000d83d06e74fe75 Azure:

Re: [PR] [HUDI-6924] Fix hoodie table config not wok in table properties [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9836: URL: https://github.com/apache/hudi/pull/9836#issuecomment-1759744758 ## CI report: * ec64b99d8525d88a1844e434000d83d06e74fe75 Azure:

Re: [PR] [HUDI-6878] Fix Overwrite error when ingest multiple tables [hudi]

2023-10-12 Thread via GitHub
jonvex commented on PR #9749: URL: https://github.com/apache/hudi/pull/9749#issuecomment-1759735573 @stream2000 could you write a test or provide a some sample code to trigger this issue? I'm a little unclear if this is solving a race condition or something else -- This is an automated

Re: [I] [SUPPORT] persist write status RDD in spark compaction job caused the resources could not be released in time [hudi]

2023-10-12 Thread via GitHub
KnightChess commented on issue #9591: URL: https://github.com/apache/hudi/issues/9591#issuecomment-1759690720 Is this problem solved? @beyond1920 @yihua I think delete the cache code can solve this problem -- This is an automated message from the Apache Git Service. To respond to the

Re: [PR] [HUDI-6924] Fix hoodie table config not wok in table properties [hudi]

2023-10-12 Thread via GitHub
wecharyu commented on code in PR #9836: URL: https://github.com/apache/hudi/pull/9836#discussion_r1356846519 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/spark/sql/hudi/HoodieOptionConfig.scala: ## @@ -199,7 +184,7 @@ object HoodieOptionConfig { //

Re: [PR] [HUDI-6480] Flink support non-blocking concurrency control [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9850: URL: https://github.com/apache/hudi/pull/9850#issuecomment-1759544663 ## CI report: * 72aebcc59f5ebebc64402dc8d1d9a491474b1dd0 Azure:

[jira] [Closed] (HUDI-6809) Optimizing the judgment of generating clustering plans

2023-10-12 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen closed HUDI-6809. Resolution: Fixed Fixed via master branch: 205d5039cef59e9c10403501e8773b66b62517e0 > Optimizing the

[jira] [Updated] (HUDI-6809) Optimizing the judgment of generating clustering plans

2023-10-12 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-6809: - Fix Version/s: 1.0.0 > Optimizing the judgment of generating clustering plans >

[hudi] branch master updated (3108af46061 -> 205d5039cef)

2023-10-12 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from 3108af46061 [HUDI-6917] Fix docker integ tests (#9843) add 205d5039cef [HUDI-6809] Optimizing the judgment of

Re: [PR] [HUDI-6809] Optimizing the judgment of generating clustering plans [hudi]

2023-10-12 Thread via GitHub
danny0405 merged PR #9585: URL: https://github.com/apache/hudi/pull/9585 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] [HUDI-6480] Flink lockless multi-writer [hudi]

2023-10-12 Thread via GitHub
danny0405 closed pull request #9125: [HUDI-6480] Flink lockless multi-writer URL: https://github.com/apache/hudi/pull/9125 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [PR] [HUDI-6480] Flink support non-blocking concurrency control [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9850: URL: https://github.com/apache/hudi/pull/9850#issuecomment-1759527066 ## CI report: * 72aebcc59f5ebebc64402dc8d1d9a491474b1dd0 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[jira] [Updated] (HUDI-6937) CopyOnWriteInsertHandler#consume will cause clustering performance degradation

2023-10-12 Thread kwang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kwang updated HUDI-6937: Fix Version/s: (was: 0.14.1) > CopyOnWriteInsertHandler#consume will cause clustering performance degradation >

[jira] [Updated] (HUDI-6937) CopyOnWriteInsertHandler#consume will cause clustering performance degradation

2023-10-12 Thread kwang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kwang updated HUDI-6937: Description: We upgraded Hudi from 0.12 to 0.14, and found that the offline clustering performance dropped by half.

[jira] [Updated] (HUDI-6937) CopyOnWriteInsertHandler#consume will cause clustering performance degradation

2023-10-12 Thread kwang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kwang updated HUDI-6937: Description: We upgraded Hudi from 0.12 to 0.14, and found that the offline clustering performance dropped by half.

[jira] [Updated] (HUDI-6937) CopyOnWriteInsertHandler#consume will cause clustering performance degradation

2023-10-12 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-6937: -- Fix Version/s: 0.14.1 (was: 0.14.0) > CopyOnWriteInsertHandler#consume will

[jira] [Updated] (HUDI-6937) CopyOnWriteInsertHandler#consume will cause clustering performance degradation

2023-10-12 Thread kwang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kwang updated HUDI-6937: Attachment: hudi-0.12-flamegraph.png hudi-0.12-log.png hudi-0.14-flamegraph.png

[jira] [Updated] (HUDI-6937) CopyOnWriteInsertHandler#consume will cause clustering performance degradation

2023-10-12 Thread kwang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kwang updated HUDI-6937: Component/s: spark (was: hudi-utilities) > CopyOnWriteInsertHandler#consume will cause

[PR] [HUDI-6480] Flink support non-blocking concurrency control [hudi]

2023-10-12 Thread via GitHub
beyond1920 opened a new pull request, #9850: URL: https://github.com/apache/hudi/pull/9850 ### Change Logs Since [#9776](https://github.com/apache/hudi/pull/9776) is merged, this pr aims to support multiple streaming writers into the same MOR table with bucket index. Set the

[jira] [Created] (HUDI-6937) CopyOnWriteInsertHandler#consume will cause clustering performance degradation

2023-10-12 Thread kwang (Jira)
kwang created HUDI-6937: --- Summary: CopyOnWriteInsertHandler#consume will cause clustering performance degradation Key: HUDI-6937 URL: https://issues.apache.org/jira/browse/HUDI-6937 Project: Apache Hudi

[hudi] branch master updated: [HUDI-6917] Fix docker integ tests (#9843)

2023-10-12 Thread codope
This is an automated email from the ASF dual-hosted git repository. codope pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 3108af46061 [HUDI-6917] Fix docker integ tests

Re: [PR] [HUDI-6917] Fix docker integ tests [hudi]

2023-10-12 Thread via GitHub
codope merged PR #9843: URL: https://github.com/apache/hudi/pull/9843 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] [HUDI-6809] Optimizing the judgment of generating clustering plans [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9585: URL: https://github.com/apache/hudi/pull/9585#issuecomment-1759443245 ## CI report: * 8b20991bc60e247d8d581c671cdae45a0a24368d Azure:

Re: [I] [SUPPORT] CDC file clean not work [hudi]

2023-10-12 Thread via GitHub
danny0405 closed issue #9557: [SUPPORT] CDC file clean not work URL: https://github.com/apache/hudi/issues/9557 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[jira] [Closed] (HUDI-6927) CDC file clean not work

2023-10-12 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen closed HUDI-6927. Resolution: Fixed Fixed via master branch: af4c5abd123dbea476377aaa1996640fc39142ed > CDC file clean not

Re: [I] [SUPPORT]Hudi file compaction in JobManager blocks flink job checkpoint even using offline compaction [hudi]

2023-10-12 Thread via GitHub
cmmp6 commented on issue #9849: URL: https://github.com/apache/hudi/issues/9849#issuecomment-1759438345 Seems option hoodie.compact.inline will open inline compaction in flink pipeline and do compaction in JobManager. -- This is an automated message from the Apache Git Service. To

[hudi] branch master updated: [HUDI-6927] CDC file clean not work (#9841)

2023-10-12 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new af4c5abd123 [HUDI-6927] CDC file clean not

Re: [PR] [HUDI-6927] CDC file clean not work [hudi]

2023-10-12 Thread via GitHub
danny0405 merged PR #9841: URL: https://github.com/apache/hudi/pull/9841 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] [HUDI-2141] Support flink stream write metrics [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9118: URL: https://github.com/apache/hudi/pull/9118#issuecomment-1759430102 ## CI report: * f6d7dd97c73898206da91b17144326a7dbbffae8 UNKNOWN * c62db1fdf94ee2c1f9b9e539f7a4b1bb866beb7e UNKNOWN * 4b4d15361b3096e27cc3c54c3347c2cf9224c895 Azure:

Re: [I] [SUPPORT] Flink - Async Compaction Not Triggered With time_elapsed as COMPACTION_TRIGGER_STRATEGY [hudi]

2023-10-12 Thread via GitHub
ertanden commented on issue #9201: URL: https://github.com/apache/hudi/issues/9201#issuecomment-1759424950 I think I have the same issue in local Docker setup with `Flink 1.15.2` and `Hudi 0.14.0`. Source is Kafka upsert topic. ``` 'compaction.trigger.strategy' =

Re: [I] [SUPPORT]Hudi file compaction in JobManager blocks flink job checkpoint even using offline compaction [hudi]

2023-10-12 Thread via GitHub
cmmp6 commented on issue #9849: URL: https://github.com/apache/hudi/issues/9849#issuecomment-1759389213 ![image](https://github.com/apache/hudi/assets/11750215/0788bfc5-f060-4b7f-9aac-0b06b97315d4) I make a mistake. I configure the table with option compaction.async.enabled=false and

Re: [I] [SUPPORT]Hudi file compaction in JobManager blocks flink job checkpoint even using offline compaction [hudi]

2023-10-12 Thread via GitHub
danny0405 commented on issue #9849: URL: https://github.com/apache/hudi/issues/9849#issuecomment-1759363717 you can disable the compaction with option `compaction.async.enabled` = `false` -- This is an automated message from the Apache Git Service. To respond to the message, please log

Re: [I] [SUPPORT]Hudi file compaction in JobManager blocks flink job checkpoint even using offline compaction [hudi]

2023-10-12 Thread via GitHub
cmmp6 commented on issue #9849: URL: https://github.com/apache/hudi/issues/9849#issuecomment-1759360333 > It seems you already disable the online compaction in the pipeline, but why there are logs of compaction in the pipeline, I'm confused. y. I'm confused too. And it's wired that

Re: [PR] [HUDI-6927] CDC file clean not work [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9841: URL: https://github.com/apache/hudi/pull/9841#issuecomment-1759346057 ## CI report: * 7cc75814a8d31fe885ce7538c963a40691d06a5c UNKNOWN * f5d5f74b8c790e16895fe2537e3a95618b98a5f8 Azure:

Re: [I] [SUPPORT]Hudi file compaction in JobManager blocks flink job checkpoint even using offline compaction [hudi]

2023-10-12 Thread via GitHub
danny0405 commented on issue #9849: URL: https://github.com/apache/hudi/issues/9849#issuecomment-1759341901 It seems you already disable the online compaction in the pipeline, but why there are logs of compaction in the pipeline, I'm confused. -- This is an automated message from the

Re: [I] [SUPPORT]Hudi file compaction in JobManager blocks flink job checkpoint even using offline compaction [hudi]

2023-10-12 Thread via GitHub
cmmp6 commented on issue #9849: URL: https://github.com/apache/hudi/issues/9849#issuecomment-1759281595 @danny0405 Could u help? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[I] [SUPPORT]Hudi file compaction in JobManager blocks flink job checkpoint even using offline compaction [hudi]

2023-10-12 Thread via GitHub
cmmp6 opened a new issue, #9849: URL: https://github.com/apache/hudi/issues/9849 **Describe the problem you faced** 1. using Flink job to consume kafka data and upsert to Hudi (MOR table, bucket index). 2. flink job checkpoint is blocked and timeout finally. 3. find file compaction

Re: [I] [SUPPORT] Unstable Execution Time and Many RequestHandler WARN Logs [hudi]

2023-10-12 Thread via GitHub
danny0405 commented on issue #8100: URL: https://github.com/apache/hudi/issues/8100#issuecomment-1759271106 Got it, you can have a try. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [PR] [HUDI-2141] Support flink stream write metrics [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9118: URL: https://github.com/apache/hudi/pull/9118#issuecomment-1759269289 ## CI report: * f6d7dd97c73898206da91b17144326a7dbbffae8 UNKNOWN * c62db1fdf94ee2c1f9b9e539f7a4b1bb866beb7e UNKNOWN * 4b4d15361b3096e27cc3c54c3347c2cf9224c895 Azure:

Re: [PR] [HUDI-6927] CDC file clean not work [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9841: URL: https://github.com/apache/hudi/pull/9841#issuecomment-1759255846 ## CI report: * 7cc75814a8d31fe885ce7538c963a40691d06a5c UNKNOWN * 4e95a02ebb3d482ae0c6bf1c94ec96e8a6658674 Azure:

Re: [PR] [HUDI-6809] Optimizing the judgment of generating clustering plans [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9585: URL: https://github.com/apache/hudi/pull/9585#issuecomment-1759254119 ## CI report: * 1e4eb20f9bc3fad10ae7869c5132b671cbcbf344 Azure:

Re: [PR] [HUDI-6872] Test out of box schema evolution for deltastreamer [hudi]

2023-10-12 Thread via GitHub
codope commented on code in PR #9743: URL: https://github.com/apache/hudi/pull/9743#discussion_r1356499698 ## hudi-client/hudi-spark-client/src/main/scala/org/apache/hudi/AvroConversionUtils.scala: ## @@ -202,7 +202,16 @@ object AvroConversionUtils { } case

Re: [PR] [HUDI-2141] Support flink stream write metrics [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9118: URL: https://github.com/apache/hudi/pull/9118#issuecomment-1759253106 ## CI report: * f6d7dd97c73898206da91b17144326a7dbbffae8 UNKNOWN * c62db1fdf94ee2c1f9b9e539f7a4b1bb866beb7e UNKNOWN * 4b4d15361b3096e27cc3c54c3347c2cf9224c895 UNKNOWN

Re: [PR] [HUDI-6927] CDC file clean not work [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9841: URL: https://github.com/apache/hudi/pull/9841#issuecomment-1759188189 ## CI report: * 7cc75814a8d31fe885ce7538c963a40691d06a5c UNKNOWN * 4e95a02ebb3d482ae0c6bf1c94ec96e8a6658674 Azure:

Re: [PR] [HUDI-6809] Optimizing the judgment of generating clustering plans [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9585: URL: https://github.com/apache/hudi/pull/9585#issuecomment-1759187310 ## CI report: * 1e4eb20f9bc3fad10ae7869c5132b671cbcbf344 Azure:

Re: [PR] [HUDI-6927] CDC file clean not work [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9841: URL: https://github.com/apache/hudi/pull/9841#issuecomment-1759173867 ## CI report: * 7cc75814a8d31fe885ce7538c963a40691d06a5c UNKNOWN * 4e95a02ebb3d482ae0c6bf1c94ec96e8a6658674 Azure:

Re: [PR] [HUDI-2141] Support flink stream write metrics [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9118: URL: https://github.com/apache/hudi/pull/9118#issuecomment-1759172124 ## CI report: * f6d7dd97c73898206da91b17144326a7dbbffae8 UNKNOWN * c62db1fdf94ee2c1f9b9e539f7a4b1bb866beb7e UNKNOWN Bot commands @hudi-bot supports the

Re: [PR] [HUDI-6927] CDC file clean not work [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9841: URL: https://github.com/apache/hudi/pull/9841#issuecomment-1759159632 ## CI report: * 7cc75814a8d31fe885ce7538c963a40691d06a5c UNKNOWN * 4e95a02ebb3d482ae0c6bf1c94ec96e8a6658674 Azure:

Re: [PR] [HUDI-6786] HoodieFileGroupReader integration [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9819: URL: https://github.com/apache/hudi/pull/9819#issuecomment-1759159511 ## CI report: * a4985db0ce22fb4b4f2518ed70bd96890024a08b UNKNOWN * 3402439aad67a67eaf72fe0dff27cfaf8d1bb80e Azure:

Re: [PR] [HUDI-2141] Support flink stream write metrics [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9118: URL: https://github.com/apache/hudi/pull/9118#issuecomment-1759157812 ## CI report: * f6d7dd97c73898206da91b17144326a7dbbffae8 UNKNOWN * c62db1fdf94ee2c1f9b9e539f7a4b1bb866beb7e UNKNOWN Bot commands @hudi-bot supports the

Re: [PR] [HUDI-6927] CDC file clean not work [hudi]

2023-10-12 Thread via GitHub
zhuanshenbsj1 commented on PR #9841: URL: https://github.com/apache/hudi/pull/9841#issuecomment-1759151735 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [I] [SUPPORT] Unstable Execution Time and Many RequestHandler WARN Logs [hudi]

2023-10-12 Thread via GitHub
lovemylover042 commented on issue #8100: URL: https://github.com/apache/hudi/issues/8100#issuecomment-1759119020 > You can always refresh the fs view but the refreshing itself is costly. But it better than every task init local file system view. That would be very slowly. -- This

Re: [PR] [HUDI-2141] Support flink stream write metrics [hudi]

2023-10-12 Thread via GitHub
danny0405 commented on PR #9118: URL: https://github.com/apache/hudi/pull/9118#issuecomment-1759104229 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [I] [SUPPORT] Unstable Execution Time and Many RequestHandler WARN Logs [hudi]

2023-10-12 Thread via GitHub
danny0405 commented on issue #8100: URL: https://github.com/apache/hudi/issues/8100#issuecomment-1759099553 You can always refresh the fs view but the refreshing itself is costly. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

Re: [I] [SUPPORT] Unstable Execution Time and Many RequestHandler WARN Logs [hudi]

2023-10-12 Thread via GitHub
lovemylover042 commented on issue #8100: URL: https://github.com/apache/hudi/issues/8100#issuecomment-1759086656 > Did you ever try the latest release, the fs view should perform better. Sorry but i had just upgraded to 0.10.1 from 0.8.0 recently. I will concern upgrading to 0.14 at

Re: [I] [SUPPORT] Enable metadata table, Spark write mor table duplicate data [hudi]

2023-10-12 Thread via GitHub
liujinhui1994 commented on issue #9714: URL: https://github.com/apache/hudi/issues/9714#issuecomment-1759014656 @ad1happy2go That's really strange -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

Re: [PR] [HUDI-6786] HoodieFileGroupReader integration [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9819: URL: https://github.com/apache/hudi/pull/9819#issuecomment-1759011099 ## CI report: * a4985db0ce22fb4b4f2518ed70bd96890024a08b UNKNOWN * 85bf27abe36ef2a6500ed323e64d6598649c95c2 Azure:

Re: [PR] [HUDI-6786] HoodieFileGroupReader integration [hudi]

2023-10-12 Thread via GitHub
hudi-bot commented on PR #9819: URL: https://github.com/apache/hudi/pull/9819#issuecomment-1758999054 ## CI report: * a4985db0ce22fb4b4f2518ed70bd96890024a08b UNKNOWN * 85bf27abe36ef2a6500ed323e64d6598649c95c2 Azure:

<    1   2