[GitHub] [hudi] hudi-bot commented on pull request #4064: [MINOR] Fix typo,rename 'HooodieAvroDeserializer' to 'HoodieAvroDeserializer'

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4064: URL: https://github.com/apache/hudi/pull/4064#issuecomment-975214170 ## CI report: * 1efb70bdd2759b28e1e2de07a0f08ac70a429702 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4064: [MINOR] Fix typo,rename 'HooodieAvroDeserializer' to 'HoodieAvroDeserializer'

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4064: URL: https://github.com/apache/hudi/pull/4064#issuecomment-975209724 ## CI report: * 1efb70bdd2759b28e1e2de07a0f08ac70a429702 Azure:

[GitHub] [hudi] dongkelun commented on pull request #4064: [MINOR] Fix typo,rename 'HooodieAvroDeserializer' to 'HoodieAvroDeserializer'

2021-11-21 Thread GitBox
dongkelun commented on pull request #4064: URL: https://github.com/apache/hudi/pull/4064#issuecomment-975213728 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] hudi-bot commented on pull request #4022: [HUDI-1870] Add CI build task for spark 3.0.x

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4022: URL: https://github.com/apache/hudi/pull/4022#issuecomment-975211077 ## CI report: * 2be004c0215c5e99579e0b554e3c57a33d3d4776 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4022: [HUDI-1870] Add CI build task for spark 3.0.x

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4022: URL: https://github.com/apache/hudi/pull/4022#issuecomment-975209646 ## CI report: * 02651d763e6bf79a001e0bc24be280519acd2462 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4064: [MINOR] Fix typo,rename 'HooodieAvroDeserializer' to 'HoodieAvroDeserializer'

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4064: URL: https://github.com/apache/hudi/pull/4064#issuecomment-975180315 ## CI report: * 1efb70bdd2759b28e1e2de07a0f08ac70a429702 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4064: [MINOR] Fix typo,rename 'HooodieAvroDeserializer' to 'HoodieAvroDeserializer'

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4064: URL: https://github.com/apache/hudi/pull/4064#issuecomment-975209724 ## CI report: * 1efb70bdd2759b28e1e2de07a0f08ac70a429702 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4022: [HUDI-1870] Add CI build task for spark 3.0.x

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4022: URL: https://github.com/apache/hudi/pull/4022#issuecomment-975072189 ## CI report: * 02651d763e6bf79a001e0bc24be280519acd2462 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4022: [HUDI-1870] Add CI build task for spark 3.0.x

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4022: URL: https://github.com/apache/hudi/pull/4022#issuecomment-975209646 ## CI report: * 02651d763e6bf79a001e0bc24be280519acd2462 Azure:

[GitHub] [hudi] veenaypatil commented on issue #4017: [SUPPORT] ETL failure , Caused by: java.io.FileNotFoundException: No such file or directory

2021-11-21 Thread GitBox
veenaypatil commented on issue #4017: URL: https://github.com/apache/hudi/issues/4017#issuecomment-975202982 @xushiyan we were on 2.3.2 version on older cluster, on the new one it is 3.0.2 where it worked. I am closing this issue as the ETL is working is working after migrating to 3.x

[GitHub] [hudi] veenaypatil closed issue #4017: [SUPPORT] ETL failure , Caused by: java.io.FileNotFoundException: No such file or directory

2021-11-21 Thread GitBox
veenaypatil closed issue #4017: URL: https://github.com/apache/hudi/issues/4017 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] hudi-bot commented on pull request #3857: [HUDI-2332] Add clustering and compaction in Kafka Connect Sink

2021-11-21 Thread GitBox
hudi-bot commented on pull request #3857: URL: https://github.com/apache/hudi/pull/3857#issuecomment-975193320 ## CI report: * 2bcd66d567ba9fab68a33d0419ed9d6e707ff168 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #3857: [HUDI-2332] Add clustering and compaction in Kafka Connect Sink

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #3857: URL: https://github.com/apache/hudi/pull/3857#issuecomment-975188012 ## CI report: * 2bcd66d567ba9fab68a33d0419ed9d6e707ff168 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #3857: [HUDI-2332] Add clustering and compaction in Kafka Connect Sink

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #3857: URL: https://github.com/apache/hudi/pull/3857#issuecomment-975186138 ## CI report: * 403dbd73c6e9c6a6c645e5ef26a7c92d0e19e629 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #3857: [HUDI-2332] Add clustering and compaction in Kafka Connect Sink

2021-11-21 Thread GitBox
hudi-bot commented on pull request #3857: URL: https://github.com/apache/hudi/pull/3857#issuecomment-975188012 ## CI report: * 2bcd66d567ba9fab68a33d0419ed9d6e707ff168 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #3857: [HUDI-2332] Add clustering and compaction in Kafka Connect Sink

2021-11-21 Thread GitBox
hudi-bot commented on pull request #3857: URL: https://github.com/apache/hudi/pull/3857#issuecomment-975186138 ## CI report: * 403dbd73c6e9c6a6c645e5ef26a7c92d0e19e629 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #3857: [HUDI-2332] Add clustering and compaction in Kafka Connect Sink

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #3857: URL: https://github.com/apache/hudi/pull/3857#issuecomment-975184667 ## CI report: * 403dbd73c6e9c6a6c645e5ef26a7c92d0e19e629 Azure:

[GitHub] [hudi] nikita-sheremet-clearscale commented on issue #4062: [SUPPORT] How debug hive sync?

2021-11-21 Thread GitBox
nikita-sheremet-clearscale commented on issue #4062: URL: https://github.com/apache/hudi/issues/4062#issuecomment-975185576 There are log statements in hudi sources like: `Starting commit :` but it does not appear in logs. Is there a way to turn it on? -- This is an automated message

[GitHub] [hudi] hudi-bot removed a comment on pull request #3857: [HUDI-2332] Add clustering and compaction in Kafka Connect Sink

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #3857: URL: https://github.com/apache/hudi/pull/3857#issuecomment-973711926 ## CI report: * 403dbd73c6e9c6a6c645e5ef26a7c92d0e19e629 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #3857: [HUDI-2332] Add clustering and compaction in Kafka Connect Sink

2021-11-21 Thread GitBox
hudi-bot commented on pull request #3857: URL: https://github.com/apache/hudi/pull/3857#issuecomment-975184667 ## CI report: * 403dbd73c6e9c6a6c645e5ef26a7c92d0e19e629 Azure:

[GitHub] [hudi] yihua commented on a change in pull request #3857: [HUDI-2332] Add clustering and compaction in Kafka Connect Sink

2021-11-21 Thread GitBox
yihua commented on a change in pull request #3857: URL: https://github.com/apache/hudi/pull/3857#discussion_r753972409 ## File path: hudi-client/hudi-java-client/src/main/java/org/apache/hudi/client/clustering/run/strategy/JavaExecutionStrategy.java ## @@ -0,0 +1,245 @@ +/* +

[GitHub] [hudi] hudi-bot removed a comment on pull request #4063: [HUDI-1290] Add Debezium Source for deltastreamer

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4063: URL: https://github.com/apache/hudi/pull/4063#issuecomment-975156126 ## CI report: * ec415ec26674d5c0454cd7102a8c6f434e7aa8e8 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4063: [HUDI-1290] Add Debezium Source for deltastreamer

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4063: URL: https://github.com/apache/hudi/pull/4063#issuecomment-975182486 ## CI report: * 76e79070afacbcf0691c8839de8cd1703b2c97ff Azure:

[jira] [Commented] (HUDI-2430) Make decimal compatible with hudi for flink writer

2021-11-21 Thread Shu Li Zheng (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17447213#comment-17447213 ] Shu Li Zheng commented on HUDI-2430: [~danny0405],hello danny,i found a compatilbe problem ,when i use

[GitHub] [hudi] hudi-bot removed a comment on pull request #4064: [MINOR] Fix typo,rename 'HooodieAvroDeserializer' to 'HoodieAvroDeserializer'

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4064: URL: https://github.com/apache/hudi/pull/4064#issuecomment-975179313 ## CI report: * 1efb70bdd2759b28e1e2de07a0f08ac70a429702 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run

[GitHub] [hudi] hudi-bot commented on pull request #4064: [MINOR] Fix typo,rename 'HooodieAvroDeserializer' to 'HoodieAvroDeserializer'

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4064: URL: https://github.com/apache/hudi/pull/4064#issuecomment-975180315 ## CI report: * 1efb70bdd2759b28e1e2de07a0f08ac70a429702 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4064: [MINOR] Fix typo,rename 'HooodieAvroDeserializer' to 'HoodieAvroDeserializer'

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4064: URL: https://github.com/apache/hudi/pull/4064#issuecomment-975179313 ## CI report: * 1efb70bdd2759b28e1e2de07a0f08ac70a429702 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[GitHub] [hudi] dongkelun opened a new pull request #4064: [MINOR] Fix typo,rename 'HooodieAvroDeserializer' to 'HoodieAvroDeserializer'

2021-11-21 Thread GitBox
dongkelun opened a new pull request #4064: URL: https://github.com/apache/hudi/pull/4064 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the

[jira] [Commented] (HUDI-2500) Spark datasource delete not working on Spark SQL created table

2021-11-21 Thread Yann Byron (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17447205#comment-17447205 ] Yann Byron commented on HUDI-2500: -- [~xushiyan]  if write by dataframe without database and table

[GitHub] [hudi] xiarixiaoyao commented on pull request #4013: [HUDI-2778] Optimize statistics collection related codes and add some docs for z-order add fix some bugs

2021-11-21 Thread GitBox
xiarixiaoyao commented on pull request #4013: URL: https://github.com/apache/hudi/pull/4013#issuecomment-975161713 @vinothchandar @leesf @alexeykudinkin could we merge this patch to master? this patch can solve most of the problems in #4026 and #4060 -- This is an automated message

[GitHub] [hudi] xiarixiaoyao commented on pull request #4013: [HUDI-2778] Optimize statistics collection related codes and add some docs for z-order add fix some bugs

2021-11-21 Thread GitBox
xiarixiaoyao commented on pull request #4013: URL: https://github.com/apache/hudi/pull/4013#issuecomment-975160541 @leesf we can build indexes for Dataskipping Manually。 step1: we can use ZCurveOptimizeHelper.getMinMaxValue to get min-max statistics info for current table ste2:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4063: [HUDI-1290] Add Debezium Source for deltastreamer

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4063: URL: https://github.com/apache/hudi/pull/4063#issuecomment-975137993 ## CI report: * ec415ec26674d5c0454cd7102a8c6f434e7aa8e8 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4063: [HUDI-1290] Add Debezium Source for deltastreamer

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4063: URL: https://github.com/apache/hudi/pull/4063#issuecomment-975156126 ## CI report: * ec415ec26674d5c0454cd7102a8c6f434e7aa8e8 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4054: [MINOR] Optimize imports and delete useless or duplicate imports

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4054: URL: https://github.com/apache/hudi/pull/4054#issuecomment-975113891 ## CI report: * db846abe427bccb49dc4e32229bf7866766c5aba Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4054: [MINOR] Optimize imports and delete useless or duplicate imports

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4054: URL: https://github.com/apache/hudi/pull/4054#issuecomment-975148069 ## CI report: * 564251f48f363cadd09273a0692c0922b562592e Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4063: [HUDI-1290] Add Debezium Source for deltastreamer

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4063: URL: https://github.com/apache/hudi/pull/4063#issuecomment-975135488 ## CI report: * ec415ec26674d5c0454cd7102a8c6f434e7aa8e8 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4063: [HUDI-1290] Add Debezium Source for deltastreamer

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4063: URL: https://github.com/apache/hudi/pull/4063#issuecomment-975137993 ## CI report: * ec415ec26674d5c0454cd7102a8c6f434e7aa8e8 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4063: [HUDI-1290] Add Debezium Source for deltastreamer

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4063: URL: https://github.com/apache/hudi/pull/4063#issuecomment-975120249 ## CI report: * ec415ec26674d5c0454cd7102a8c6f434e7aa8e8 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4063: [HUDI-1290] Add Debezium Source for deltastreamer

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4063: URL: https://github.com/apache/hudi/pull/4063#issuecomment-975135488 ## CI report: * ec415ec26674d5c0454cd7102a8c6f434e7aa8e8 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4063: [HUDI-1290] Add Debezium Source for deltastreamer

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4063: URL: https://github.com/apache/hudi/pull/4063#issuecomment-975117985 ## CI report: * ec415ec26674d5c0454cd7102a8c6f434e7aa8e8 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run

[GitHub] [hudi] hudi-bot commented on pull request #4063: [HUDI-1290] Add Debezium Source for deltastreamer

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4063: URL: https://github.com/apache/hudi/pull/4063#issuecomment-975120249 ## CI report: * ec415ec26674d5c0454cd7102a8c6f434e7aa8e8 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4063: [HUDI-1290] Add Debezium Source for deltastreamer

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4063: URL: https://github.com/apache/hudi/pull/4063#issuecomment-975117985 ## CI report: * ec415ec26674d5c0454cd7102a8c6f434e7aa8e8 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[GitHub] [hudi] rmahindra123 opened a new pull request #4063: [HUDI-1290] Add Debezium Source for deltastreamer

2021-11-21 Thread GitBox
rmahindra123 opened a new pull request #4063: URL: https://github.com/apache/hudi/pull/4063 ## What is the purpose of the pull request Add Debezium Source for deltastreamer, allowing deltastreamer users to ingest the debezium change log records from kafka for Postgres DB. ##

[GitHub] [hudi] hudi-bot commented on pull request #4054: [MINOR] Optimize imports and delete useless or duplicate imports

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4054: URL: https://github.com/apache/hudi/pull/4054#issuecomment-975113891 ## CI report: * db846abe427bccb49dc4e32229bf7866766c5aba Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4054: [MINOR] Optimize imports and delete useless or duplicate imports

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4054: URL: https://github.com/apache/hudi/pull/4054#issuecomment-975061051 ## CI report: * db846abe427bccb49dc4e32229bf7866766c5aba Azure:

[jira] [Updated] (HUDI-1290) Implement Debezium avro source for Delta Streamer

2021-11-21 Thread Rajesh Mahindra (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Mahindra updated HUDI-1290: -- Status: Patch Available (was: In Progress) > Implement Debezium avro source for Delta Streamer

[GitHub] [hudi] hudi-bot commented on pull request #4013: [HUDI-2778] Optimize statistics collection related codes and add some docs for z-order add fix some bugs

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4013: URL: https://github.com/apache/hudi/pull/4013#issuecomment-975109801 ## CI report: * 7dcf6c5957115a3fb4bf84decd21d6b2792be857 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4013: [HUDI-2778] Optimize statistics collection related codes and add some docs for z-order add fix some bugs

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4013: URL: https://github.com/apache/hudi/pull/4013#issuecomment-975060958 ## CI report: * e95361f7e109251511059817b7bc12591cd1671a Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4022: [HUDI-1870] Add CI build task for spark 3.0.x

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4022: URL: https://github.com/apache/hudi/pull/4022#issuecomment-975070619 ## CI report: * d9567364d2535e68fbbc49e97bc2ee10fd2f5efd Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4022: [HUDI-1870] Add CI build task for spark 3.0.x

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4022: URL: https://github.com/apache/hudi/pull/4022#issuecomment-975072189 ## CI report: * 02651d763e6bf79a001e0bc24be280519acd2462 Azure:

[jira] [Updated] (HUDI-1870) Move spark avro serialization class into hudi repo

2021-11-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-1870: - Labels: pull-request-available sev:critical (was: sev:critical) > Move spark avro serialization

[GitHub] [hudi] hudi-bot commented on pull request #4022: [HUDI-1870] Add CI build task for spark 3.0.x

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4022: URL: https://github.com/apache/hudi/pull/4022#issuecomment-975070619 ## CI report: * d9567364d2535e68fbbc49e97bc2ee10fd2f5efd Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4022: [HUDI-1870] Add CI build task for spark 3.0.x

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4022: URL: https://github.com/apache/hudi/pull/4022#issuecomment-975014921 ## CI report: * d9567364d2535e68fbbc49e97bc2ee10fd2f5efd Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4054: [MINOR] Optimize imports and delete useless or duplicate imports

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4054: URL: https://github.com/apache/hudi/pull/4054#issuecomment-975061051 ## CI report: * db846abe427bccb49dc4e32229bf7866766c5aba Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4054: [MINOR] Optimize imports and delete useless or duplicate imports

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4054: URL: https://github.com/apache/hudi/pull/4054#issuecomment-975058512 ## CI report: * 6ee1ea4b43d9da455c7b79dbbc93bd23200240ec Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4013: [HUDI-2778] Optimize statistics collection related codes and add some docs for z-order add fix some bugs

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4013: URL: https://github.com/apache/hudi/pull/4013#issuecomment-975058276 ## CI report: * e95361f7e109251511059817b7bc12591cd1671a Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4013: [HUDI-2778] Optimize statistics collection related codes and add some docs for z-order add fix some bugs

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4013: URL: https://github.com/apache/hudi/pull/4013#issuecomment-975060958 ## CI report: * e95361f7e109251511059817b7bc12591cd1671a Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4054: [MINOR] Optimize imports and delete useless or duplicate imports

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4054: URL: https://github.com/apache/hudi/pull/4054#issuecomment-975053218 ## CI report: * 6ee1ea4b43d9da455c7b79dbbc93bd23200240ec Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4054: [MINOR] Optimize imports and delete useless or duplicate imports

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4054: URL: https://github.com/apache/hudi/pull/4054#issuecomment-975058512 ## CI report: * 6ee1ea4b43d9da455c7b79dbbc93bd23200240ec Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4013: [HUDI-2778] Optimize statistics collection related codes and add some docs for z-order add fix some bugs

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4013: URL: https://github.com/apache/hudi/pull/4013#issuecomment-974773010 ## CI report: * e95361f7e109251511059817b7bc12591cd1671a Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4013: [HUDI-2778] Optimize statistics collection related codes and add some docs for z-order add fix some bugs

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4013: URL: https://github.com/apache/hudi/pull/4013#issuecomment-975058276 ## CI report: * e95361f7e109251511059817b7bc12591cd1671a Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4054: [MINOR] Optimize imports and delete useless or duplicate imports

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4054: URL: https://github.com/apache/hudi/pull/4054#issuecomment-975053218 ## CI report: * 6ee1ea4b43d9da455c7b79dbbc93bd23200240ec Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4054: [MINOR] Optimize imports and delete useless or duplicate imports

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4054: URL: https://github.com/apache/hudi/pull/4054#issuecomment-975049092 ## CI report: * 6ee1ea4b43d9da455c7b79dbbc93bd23200240ec Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4054: [MINOR] Optimize imports and delete useless or duplicate imports

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4054: URL: https://github.com/apache/hudi/pull/4054#issuecomment-974623589 ## CI report: * 6ee1ea4b43d9da455c7b79dbbc93bd23200240ec Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4054: [MINOR] Optimize imports and delete useless or duplicate imports

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4054: URL: https://github.com/apache/hudi/pull/4054#issuecomment-975049092 ## CI report: * 6ee1ea4b43d9da455c7b79dbbc93bd23200240ec Azure:

[GitHub] [hudi] boneanxs commented on pull request #4014: [HUDI-2779] Cache BaseDir if HudiTableNotFound Exception thrown

2021-11-21 Thread GitBox
boneanxs commented on pull request #4014: URL: https://github.com/apache/hudi/pull/4014#issuecomment-975041971 @codope, @nsivabalan, gentle ping... Could you pls take a look, also cc @vinothchandar , maybe you can give us more inputs as you implemented this PathFilter :D -- This is an

[jira] [Commented] (HUDI-2799) Fix the classloader of flink write task

2021-11-21 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17447178#comment-17447178 ] Danny Chen commented on HUDI-2799: -- Fixed via master branch: 8281cbf7624c3a4eb90bf58671daf76843d00819 >

[jira] [Resolved] (HUDI-2799) Fix the classloader of flink write task

2021-11-21 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen resolved HUDI-2799. -- > Fix the classloader of flink write task > --- > >

[hudi] branch master updated (2533a9c -> 8281cbf)

2021-11-21 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from 2533a9c [MINOR] Fix typos (#4053) add 8281cbf [HUDI-2799] Fix the classloader of flink write task (#4042)

[GitHub] [hudi] danny0405 merged pull request #4042: [HUDI-2799] Fix the classloader of flink write task

2021-11-21 Thread GitBox
danny0405 merged pull request #4042: URL: https://github.com/apache/hudi/pull/4042 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] hudi-bot commented on pull request #3519: [DO NOT MERGE] 0.9.0 release patch for flink

2021-11-21 Thread GitBox
hudi-bot commented on pull request #3519: URL: https://github.com/apache/hudi/pull/3519#issuecomment-975016697 ## CI report: * d022aa7a5bd94492c7c3e96dc5b1288268520087 UNKNOWN * ba6a7000bdde9fa7a786a93912aa9cda05e00d21 UNKNOWN * 621ac47b3c6bc2726bd3b999f4d711f600bf8d60

[GitHub] [hudi] hudi-bot removed a comment on pull request #3519: [DO NOT MERGE] 0.9.0 release patch for flink

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #3519: URL: https://github.com/apache/hudi/pull/3519#issuecomment-975009550 ## CI report: * d022aa7a5bd94492c7c3e96dc5b1288268520087 UNKNOWN * ba6a7000bdde9fa7a786a93912aa9cda05e00d21 UNKNOWN *

[GitHub] [hudi] hudi-bot removed a comment on pull request #4022: [MINOR] Add CI build task for spark 3.0.x

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4022: URL: https://github.com/apache/hudi/pull/4022#issuecomment-975012824 ## CI report: * d716f0d7104e13727e769d859767ef343cd45407 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4022: [MINOR] Add CI build task for spark 3.0.x

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4022: URL: https://github.com/apache/hudi/pull/4022#issuecomment-975014921 ## CI report: * d9567364d2535e68fbbc49e97bc2ee10fd2f5efd Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4022: [MINOR] Add CI build task for spark 3.0.x

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #4022: URL: https://github.com/apache/hudi/pull/4022#issuecomment-972246932 ## CI report: * d716f0d7104e13727e769d859767ef343cd45407 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4022: [MINOR] Add CI build task for spark 3.0.x

2021-11-21 Thread GitBox
hudi-bot commented on pull request #4022: URL: https://github.com/apache/hudi/pull/4022#issuecomment-975012824 ## CI report: * d716f0d7104e13727e769d859767ef343cd45407 Azure:

[GitHub] [hudi] mincwang commented on pull request #4056: [HUDI-2808] Supports deduplication for streaming write

2021-11-21 Thread GitBox
mincwang commented on pull request #4056: URL: https://github.com/apache/hudi/pull/4056#issuecomment-975011368 cc @danny0405 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[hudi] branch asf-site updated: [MINOR] Fix RocketMQ logo in landing page (#4061)

2021-11-21 Thread vinoyang
This is an automated email from the ASF dual-hosted git repository. vinoyang pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new c57cc91 [MINOR] Fix RocketMQ logo in

[GitHub] [hudi] yanghua merged pull request #4061: [MINOR] Fix RocketMQ logo in landing page

2021-11-21 Thread GitBox
yanghua merged pull request #4061: URL: https://github.com/apache/hudi/pull/4061 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] hudi-bot removed a comment on pull request #3519: [DO NOT MERGE] 0.9.0 release patch for flink

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #3519: URL: https://github.com/apache/hudi/pull/3519#issuecomment-968507785 ## CI report: * d022aa7a5bd94492c7c3e96dc5b1288268520087 UNKNOWN * ba6a7000bdde9fa7a786a93912aa9cda05e00d21 UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #3519: [DO NOT MERGE] 0.9.0 release patch for flink

2021-11-21 Thread GitBox
hudi-bot commented on pull request #3519: URL: https://github.com/apache/hudi/pull/3519#issuecomment-975009550 ## CI report: * d022aa7a5bd94492c7c3e96dc5b1288268520087 UNKNOWN * ba6a7000bdde9fa7a786a93912aa9cda05e00d21 UNKNOWN * 870dd79b2dd00f9cf5ff3bed715fcfc35c122d09

[GitHub] [hudi] hudi-bot removed a comment on pull request #3991: [HUDI-2737] Use earliest instant for async compaction and clustering jobs

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #3991: URL: https://github.com/apache/hudi/pull/3991#issuecomment-974922055 ## CI report: * 6a1b37be3d0e4bd361040b4e1dc59bd3274eb5c3 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #3991: [HUDI-2737] Use earliest instant for async compaction and clustering jobs

2021-11-21 Thread GitBox
hudi-bot commented on pull request #3991: URL: https://github.com/apache/hudi/pull/3991#issuecomment-974954149 ## CI report: * 2a40db3adb4f3f972f45a84e892164d78b6121b1 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #3991: [HUDI-2737] Use earliest instant for async compaction and clustering jobs

2021-11-21 Thread GitBox
hudi-bot commented on pull request #3991: URL: https://github.com/apache/hudi/pull/3991#issuecomment-974922055 ## CI report: * 6a1b37be3d0e4bd361040b4e1dc59bd3274eb5c3 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #3991: [HUDI-2737] Use earliest instant for async compaction and clustering jobs

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #3991: URL: https://github.com/apache/hudi/pull/3991#issuecomment-974921664 ## CI report: * 6a1b37be3d0e4bd361040b4e1dc59bd3274eb5c3 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #3991: [HUDI-2737] Use earliest instant for async compaction and clustering jobs

2021-11-21 Thread GitBox
hudi-bot commented on pull request #3991: URL: https://github.com/apache/hudi/pull/3991#issuecomment-974921664 ## CI report: * 6a1b37be3d0e4bd361040b4e1dc59bd3274eb5c3 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #3991: [HUDI-2737] Use earliest instant for async compaction and clustering jobs

2021-11-21 Thread GitBox
hudi-bot removed a comment on pull request #3991: URL: https://github.com/apache/hudi/pull/3991#issuecomment-973670617 ## CI report: * 6a1b37be3d0e4bd361040b4e1dc59bd3274eb5c3 Azure:

[GitHub] [hudi] nikita-sheremet-clearscale opened a new issue #4062: [SUPPORT] How debug hyve sync?

2021-11-21 Thread GitBox
nikita-sheremet-clearscale opened a new issue #4062: URL: https://github.com/apache/hudi/issues/4062 **To Reproduce** Steps to reproduce the behavior: 1. read parquet files from s3 prefix 2. write parquet files to s3 prefix with enable-hive-sync option **Expected

[GitHub] [hudi] Limess edited a comment on issue #3933: [SUPPORT] Large amount of disk spill on initial upsert/bulk insert

2021-11-21 Thread GitBox
Limess edited a comment on issue #3933: URL: https://github.com/apache/hudi/issues/3933#issuecomment-974883342 Thanks! We're using bulk insert for this job and are happy with the performance vs regular upsert. Re: parallelism, we bumped this up after: 1. Reading

[GitHub] [hudi] Limess commented on issue #3933: [SUPPORT] Large amount of disk spill on initial upsert/bulk insert

2021-11-21 Thread GitBox
Limess commented on issue #3933: URL: https://github.com/apache/hudi/issues/3933#issuecomment-974883342 Thanks! We're using bulk insert for this job and are happy with the performance vs regular upsert. Re: parallelism, we bumped this up after: 1. Reading

[GitHub] [hudi] nsivabalan commented on issue #3854: [SUPPORT] Lower performance using 0.9.0 vs 0.8.0

2021-11-21 Thread GitBox
nsivabalan commented on issue #3854: URL: https://github.com/apache/hudi/issues/3854#issuecomment-974870498 Hey hi. Can you give it a try with open source across two versions. @umehrot2 : Can you chime in wrt EMR spark versions. Is there any performance patches expected for hudi 0.9.0

[GitHub] [hudi] nsivabalan commented on issue #3394: [SUPPORT] Question on hudi's default behaviour for UPSERT

2021-11-21 Thread GitBox
nsivabalan commented on issue #3394: URL: https://github.com/apache/hudi/issues/3394#issuecomment-974819625 can you try setting `hoodie.datasource.write.precombine.field`. It should get applied to `hoodie.payload.ordering.field`. -- This is an automated message from the Apache Git

[GitHub] [hudi] leesf commented on a change in pull request #4053: [MINOR] Fix typos

2021-11-21 Thread GitBox
leesf commented on a change in pull request #4053: URL: https://github.com/apache/hudi/pull/4053#discussion_r753736718 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/HiveIncrementalPuller.java ## @@ -106,14 +106,14 @@ private Connection connection;

[GitHub] [hudi] leesf commented on a change in pull request #4013: [HUDI-2778] Optimize statistics collection related codes and add some docs for z-order add fix some bugs

2021-11-21 Thread GitBox
leesf commented on a change in pull request #4013: URL: https://github.com/apache/hudi/pull/4013#discussion_r753737063 ## File path: hudi-common/src/main/java/org/apache/hudi/common/model/HoodieColumnRangeMetadata.java ## @@ -30,16 +28,21 @@ private final String

[GitHub] [hudi] mincwang commented on pull request #3703: [HUDI-2480] FileSlice after pending compaction-requested instant-time…

2021-11-21 Thread GitBox
mincwang commented on pull request #3703: URL: https://github.com/apache/hudi/pull/3703#issuecomment-974790661 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [hudi] nsivabalan commented on issue #4043: [SUPPORT] java.lang.ClassCastException: java.lang.String cannot be cast to org.apache.spark.sql.Row error when writing particular source data after

2021-11-21 Thread GitBox
nsivabalan commented on issue #4043: URL: https://github.com/apache/hudi/issues/4043#issuecomment-974824144 May I know whats the new column you are adding just in writer2? In desc you are describing as `_hoodie_deleted_date`, but I don't see any such field in your target table schema. may

[GitHub] [hudi] garyli1019 commented on pull request #3703: [HUDI-2480] FileSlice after pending compaction-requested instant-time…

2021-11-21 Thread GitBox
garyli1019 commented on pull request #3703: URL: https://github.com/apache/hudi/pull/3703#issuecomment-974789158 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [hudi] alexeykudinkin edited a comment on pull request #4013: [HUDI-2778] Optimize statistics collection related codes and add some docs for z-order add fix some bugs

2021-11-21 Thread GitBox
alexeykudinkin edited a comment on pull request #4013: URL: https://github.com/apache/hudi/pull/4013#issuecomment-974745454 @xiarixiaoyao thanks for addressing the issues! After our testing we've also tried to squash some bugs in https://github.com/apache/hudi/pull/4026 and

[GitHub] [hudi] nikita-sheremet-clearscale edited a comment on issue #4044: [SUPPORT] Question on hudi's insert statment taking too long

2021-11-21 Thread GitBox
nikita-sheremet-clearscale edited a comment on issue #4044: URL: https://github.com/apache/hudi/issues/4044#issuecomment-974849011 @xushiyan Many thanks for the quick reply!!! Hudi config is: ``` hoodie.datasource.hive_sync.database -> hudi

[GitHub] [hudi] xushiyan commented on issue #3905: [SUPPORT] Transform from kafka complains about table not found when using transformer.sql

2021-11-21 Thread GitBox
xushiyan commented on issue #3905: URL: https://github.com/apache/hudi/issues/3905#issuecomment-974696334 @JB-data I don't think it goes to hive; it simply creates a spark sql table using that current spark session. -- This is an automated message from the Apache Git Service. To respond

  1   2   3   >