[GitHub] [hudi] hudi-bot removed a comment on pull request #4984: [HUDI-3583] Fix MarkerBasedRollbackStrategy NoSuchElementException

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4984: URL: https://github.com/apache/hudi/pull/4984#issuecomment-1064863475 ## CI report: * 015f7f0e07d3f0efbd8d3a728f802fc5572a8f52 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4984: [HUDI-3583] Fix MarkerBasedRollbackStrategy NoSuchElementException

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4984: URL: https://github.com/apache/hudi/pull/4984#issuecomment-1064865517 ## CI report: * 015f7f0e07d3f0efbd8d3a728f802fc5572a8f52 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4984: [HUDI-3583] Fix MarkerBasedRollbackStrategy NoSuchElementException

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4984: URL: https://github.com/apache/hudi/pull/4984#issuecomment-1061697101 ## CI report: * 015f7f0e07d3f0efbd8d3a728f802fc5572a8f52 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4984: [HUDI-3583] Fix MarkerBasedRollbackStrategy NoSuchElementException

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4984: URL: https://github.com/apache/hudi/pull/4984#issuecomment-1064863475 ## CI report: * 015f7f0e07d3f0efbd8d3a728f802fc5572a8f52 Azure:

[GitHub] [hudi] danny0405 commented on a change in pull request #5018: [HUDI-3559] fix flink Bucket Index with COW table type `NoSuchElementException` cause of deduplicateRecords method in FlinkWriteH

2022-03-10 Thread GitBox
danny0405 commented on a change in pull request #5018: URL: https://github.com/apache/hudi/pull/5018#discussion_r824461742 ## File path: hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/table/action/commit/FlinkWriteHelper.java ## @@ -113,5 +114,10 @@ public static

[GitHub] [hudi] hudi-bot commented on pull request #4872: [HUDI-3475] Support run compaction / clustering job in Service

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4872: URL: https://github.com/apache/hudi/pull/4872#issuecomment-1064853531 ## CI report: * 0fd561ae050f39c022862eae351c73b323a61e05 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4872: [HUDI-3475] Support run compaction / clustering job in Service

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4872: URL: https://github.com/apache/hudi/pull/4872#issuecomment-1064851908 ## CI report: * 0fd561ae050f39c022862eae351c73b323a61e05 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4872: [HUDI-3475] Support run compaction / clustering job in Service

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4872: URL: https://github.com/apache/hudi/pull/4872#issuecomment-1064851908 ## CI report: * 0fd561ae050f39c022862eae351c73b323a61e05 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4872: [HUDI-3475] Support run compaction / clustering job in Service

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4872: URL: https://github.com/apache/hudi/pull/4872#issuecomment-1048383460 ## CI report: * 0fd561ae050f39c022862eae351c73b323a61e05 Azure:

[GitHub] [hudi] wangxianghu edited a comment on pull request #4969: [HUDI-3569] Introduce ChainedJsonKafkaSourePostProcessor to support setting multi processors at one time

2022-03-10 Thread GitBox
wangxianghu edited a comment on pull request #4969: URL: https://github.com/apache/hudi/pull/4969#issuecomment-1064849682 hi @nsivabalan can we add this processor ? it is very useful in scenarios with diversified data requirements. In our comany, we have use this feature to add

[GitHub] [hudi] wangxianghu edited a comment on pull request #4969: [HUDI-3569] Introduce ChainedJsonKafkaSourePostProcessor to support setting multi processors at one time

2022-03-10 Thread GitBox
wangxianghu edited a comment on pull request #4969: URL: https://github.com/apache/hudi/pull/4969#issuecomment-1064849682 hi @nsivabalan can we add this processor ? it is very useful in scenarios with diversified data requirements. In our comany, we have use this feature to add

[GitHub] [hudi] wangxianghu commented on pull request #4969: [HUDI-3569] Introduce ChainedJsonKafkaSourePostProcessor to support setting multi processors at one time

2022-03-10 Thread GitBox
wangxianghu commented on pull request #4969: URL: https://github.com/apache/hudi/pull/4969#issuecomment-1064849682 hi @nsivabalan can we add this processor ? it is very useful in scenarios with diversified data requirements. In our comany, we have use this feature to add multiple

[GitHub] [hudi] prashantwason commented on a change in pull request #4640: [HUDI-3225] [RFC-45] for async metadata indexing

2022-03-10 Thread GitBox
prashantwason commented on a change in pull request #4640: URL: https://github.com/apache/hudi/pull/4640#discussion_r824447974 ## File path: rfc/rfc-45/rfc-45.md ## @@ -0,0 +1,264 @@ + + +# RFC-45: Asynchronous Metadata Indexing + +## Proposers + +- @codope +- @manojpec + +##

[GitHub] [hudi] hudi-bot commented on pull request #5013: [HUDI-3593] Restore TypedProperties and flush checksum in table config

2022-03-10 Thread GitBox
hudi-bot commented on pull request #5013: URL: https://github.com/apache/hudi/pull/5013#issuecomment-1064845719 ## CI report: * a2e2b2ecd3ffe2974fac5e6472c2ab273f4d13c4 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5013: [HUDI-3593] Restore TypedProperties and flush checksum in table config

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #5013: URL: https://github.com/apache/hudi/pull/5013#issuecomment-1064844072 ## CI report: * a2e2b2ecd3ffe2974fac5e6472c2ab273f4d13c4 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5013: [HUDI-3593] Restore TypedProperties and flush checksum in table config

2022-03-10 Thread GitBox
hudi-bot commented on pull request #5013: URL: https://github.com/apache/hudi/pull/5013#issuecomment-1064844072 ## CI report: * a2e2b2ecd3ffe2974fac5e6472c2ab273f4d13c4 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5013: [HUDI-3593] Restore TypedProperties and flush checksum in table config

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #5013: URL: https://github.com/apache/hudi/pull/5013#issuecomment-1064395857 ## CI report: * a2e2b2ecd3ffe2974fac5e6472c2ab273f4d13c4 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4925: [HUDI-3103] Enable MultiTableDeltaStreamer to update a single sink table from multiple source tables

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4925: URL: https://github.com/apache/hudi/pull/4925#issuecomment-1064843916 ## CI report: * 018bb851445f7eabaa0bd4cc2b362f269d6fec59 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4925: [HUDI-3103] Enable MultiTableDeltaStreamer to update a single sink table from multiple source tables

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4925: URL: https://github.com/apache/hudi/pull/4925#issuecomment-1064842219 ## CI report: * 018bb851445f7eabaa0bd4cc2b362f269d6fec59 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5019: [HUDI-3575] Use HoodieTestDataGenerator#TRIP_SCHEMA as example schema in TestSchemaPostProcessor

2022-03-10 Thread GitBox
hudi-bot commented on pull request #5019: URL: https://github.com/apache/hudi/pull/5019#issuecomment-1064842384 ## CI report: * 3b6b326bb3650689e8ad78504ccaca3df2700998 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5019: [HUDI-3575] Use HoodieTestDataGenerator#TRIP_SCHEMA as example schema in TestSchemaPostProcessor

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #5019: URL: https://github.com/apache/hudi/pull/5019#issuecomment-1064840735 ## CI report: * 3b6b326bb3650689e8ad78504ccaca3df2700998 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot

[GitHub] [hudi] hudi-bot commented on pull request #4925: [HUDI-3103] Enable MultiTableDeltaStreamer to update a single sink table from multiple source tables

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4925: URL: https://github.com/apache/hudi/pull/4925#issuecomment-1064842219 ## CI report: * 018bb851445f7eabaa0bd4cc2b362f269d6fec59 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4925: [HUDI-3103] Enable MultiTableDeltaStreamer to update a single sink table from multiple source tables

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4925: URL: https://github.com/apache/hudi/pull/4925#issuecomment-1055496401 ## CI report: * 018bb851445f7eabaa0bd4cc2b362f269d6fec59 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5019: [HUDI-3575] Use HoodieTestDataGenerator#TRIP_SCHEMA as example schema in TestSchemaPostProcessor

2022-03-10 Thread GitBox
hudi-bot commented on pull request #5019: URL: https://github.com/apache/hudi/pull/5019#issuecomment-1064840735 ## CI report: * 3b6b326bb3650689e8ad78504ccaca3df2700998 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[jira] [Updated] (HUDI-3575) Use HoodieTestDataGenerator#TRIP_SCHEMA as example schema in TestSchemaPostProcessor

2022-03-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3575: - Labels: pull-request-available (was: ) > Use HoodieTestDataGenerator#TRIP_SCHEMA as example

[GitHub] [hudi] wangxianghu opened a new pull request #5019: [HUDI-3575] Use HoodieTestDataGenerator#TRIP_SCHEMA as example schema in TestSchemaPostProcessor

2022-03-10 Thread GitBox
wangxianghu opened a new pull request #5019: URL: https://github.com/apache/hudi/pull/5019 ## What is the purpose of the pull request *Use standard test schema in our UT instead of a shema from a specific enterprise data* ## Brief change log ## Verify this pull request

[GitHub] [hudi] hudi-bot commented on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-1064838694 ## CI report: * 4a9c78781cc4efcf3f13d6f12836b6fc3e738878 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-1064827082 ## CI report: * 4a9c78781cc4efcf3f13d6f12836b6fc3e738878 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4888: [HUDI-3396][Stacked on 4877] Refactoring `MergeOnReadRDD` to avoid duplication, fetch only projected columns

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4888: URL: https://github.com/apache/hudi/pull/4888#issuecomment-1064831876 ## CI report: * b07cca5112163e153385c690203603b74542ace6 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4888: [HUDI-3396][Stacked on 4877] Refactoring `MergeOnReadRDD` to avoid duplication, fetch only projected columns

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4888: URL: https://github.com/apache/hudi/pull/4888#issuecomment-1064748224 ## CI report: * e0afa9f1de90411220a6c1d25c0c9e43f09f6baf Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-1064827082 ## CI report: * 4a9c78781cc4efcf3f13d6f12836b6fc3e738878 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-1064825631 ## CI report: * 6f55461f206b4608607bc8ce706d9fa451dd2ab7 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-1064825631 ## CI report: * 6f55461f206b4608607bc8ce706d9fa451dd2ab7 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-1064806027 ## CI report: * 6f55461f206b4608607bc8ce706d9fa451dd2ab7 Azure:

[jira] [Commented] (HUDI-3607) Support backend switch in HoodieFlinkStreamer

2022-03-10 Thread Jira
[ https://issues.apache.org/jira/browse/HUDI-3607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17504744#comment-17504744 ] 刘方奇 commented on HUDI-3607: --- [~wangxianghu] Could you help to take a glance? Can assign it to me. > Support

[GitHub] [hudi] guanziyue commented on a change in pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2022-03-10 Thread GitBox
guanziyue commented on a change in pull request #4264: URL: https://github.com/apache/hudi/pull/4264#discussion_r824428011 ## File path: hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/table/action/commit/SparkMergeHelper.java ## @@ -101,13 +101,13 @@ public void

[GitHub] [hudi] guanziyue commented on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2022-03-10 Thread GitBox
guanziyue commented on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-1064821297 > @guanziyue thank you for taking the time to troubleshoot this concurrency issues and implement the fix! > > I echo @vinothchandar concerns and i think we're taking a

[hudi] branch master updated (83cff3a -> 18cdad9)

2022-03-10 Thread garyli
This is an automated email from the ASF dual-hosted git repository. garyli pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from 83cff3a [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema (#4972) add

[GitHub] [hudi] garyli1019 merged pull request #4326: [HUDI-2999] [RFC-42] RFC for consistent hashing index

2022-03-10 Thread GitBox
garyli1019 merged pull request #4326: URL: https://github.com/apache/hudi/pull/4326 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4982: [HUDI-3567] Refactor HoodieCommonUtils to make code more reasonable

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4982: URL: https://github.com/apache/hudi/pull/4982#issuecomment-1064816693 ## CI report: * 282ca401f8e2a93d7703f592041b854959291d41 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4982: [HUDI-3567] Refactor HoodieCommonUtils to make code more reasonable

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4982: URL: https://github.com/apache/hudi/pull/4982#issuecomment-1064820630 ## CI report: * 282ca401f8e2a93d7703f592041b854959291d41 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5015: [HUDI-3513] Make sure Column Stats does not fail in case it fails to load previous Index Table state

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #5015: URL: https://github.com/apache/hudi/pull/5015#issuecomment-1064739346 ## CI report: * 16c497f48a922830b3fbcb833bca203c292158da Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5015: [HUDI-3513] Make sure Column Stats does not fail in case it fails to load previous Index Table state

2022-03-10 Thread GitBox
hudi-bot commented on pull request #5015: URL: https://github.com/apache/hudi/pull/5015#issuecomment-1064819300 ## CI report: * 16c497f48a922830b3fbcb833bca203c292158da Azure:

[GitHub] [hudi] huberylee commented on pull request #4982: [HUDI-3567] Refactor HoodieCommonUtils to make code more reasonable

2022-03-10 Thread GitBox
huberylee commented on pull request #4982: URL: https://github.com/apache/hudi/pull/4982#issuecomment-1064819324 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[jira] [Created] (HUDI-3607) Support backend switch in HoodieFlinkStreamer

2022-03-10 Thread Jira
刘方奇 created HUDI-3607: - Summary: Support backend switch in HoodieFlinkStreamer Key: HUDI-3607 URL: https://issues.apache.org/jira/browse/HUDI-3607 Project: Apache Hudi Issue Type: Improvement

[GitHub] [hudi] hudi-bot removed a comment on pull request #4982: [HUDI-3567] Refactor HoodieCommonUtils to make code more reasonable

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4982: URL: https://github.com/apache/hudi/pull/4982#issuecomment-1064734674 ## CI report: * 282ca401f8e2a93d7703f592041b854959291d41 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4982: [HUDI-3567] Refactor HoodieCommonUtils to make code more reasonable

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4982: URL: https://github.com/apache/hudi/pull/4982#issuecomment-1064816693 ## CI report: * 282ca401f8e2a93d7703f592041b854959291d41 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4971: [HUDI-3556] Re-use rollback instant for rolling back of clustering and compaction if rollback failed mid-way

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4971: URL: https://github.com/apache/hudi/pull/4971#issuecomment-1064813820 ## CI report: * 74ace6ca3f717a41d54047bb44ea52fedb94e1ce Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4971: [HUDI-3556] Re-use rollback instant for rolling back of clustering and compaction if rollback failed mid-way

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4971: URL: https://github.com/apache/hudi/pull/4971#issuecomment-1064815294 ## CI report: * 74ace6ca3f717a41d54047bb44ea52fedb94e1ce Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4971: [HUDI-3556] Re-use rollback instant for rolling back of clustering and compaction if rollback failed mid-way

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4971: URL: https://github.com/apache/hudi/pull/4971#issuecomment-1064813820 ## CI report: * 74ace6ca3f717a41d54047bb44ea52fedb94e1ce Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4971: [HUDI-3556] Re-use rollback instant for rolling back of clustering and compaction if rollback failed mid-way

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4971: URL: https://github.com/apache/hudi/pull/4971#issuecomment-1064772913 ## CI report: * 74ace6ca3f717a41d54047bb44ea52fedb94e1ce Azure:

[GitHub] [hudi] boneanxs edited a comment on pull request #4999: [HUDI-3592] Fix NPE of DefaultHoodieRecordPayload if Property is empty

2022-03-10 Thread GitBox
boneanxs edited a comment on pull request #4999: URL: https://github.com/apache/hudi/pull/4999#issuecomment-1064809186 @nsivabalan @xushiyan @XuQianJin-Stars could you pls review this? -- This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [hudi] boneanxs commented on pull request #4999: [HUDI-3592] Fix NPE of DefaultHoodieRecordPayload if Property is empty

2022-03-10 Thread GitBox
boneanxs commented on pull request #4999: URL: https://github.com/apache/hudi/pull/4999#issuecomment-1064809186 @nsivabalan @xushiyan could you pls review this? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [hudi] hudi-bot commented on pull request #5018: [HUDI-3559] fix flink Bucket Index with COW table type `NoSuchElementException` cause of deduplicateRecords method in FlinkWriteHelper out of

2022-03-10 Thread GitBox
hudi-bot commented on pull request #5018: URL: https://github.com/apache/hudi/pull/5018#issuecomment-1064807946 ## CI report: * b9e437b2c2942ba29945d1d21c7e214e350e4333 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5018: [HUDI-3559] fix flink Bucket Index with COW table type `NoSuchElementException` cause of deduplicateRecords method in FlinkWriteHelper

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #5018: URL: https://github.com/apache/hudi/pull/5018#issuecomment-1064806485 ## CI report: * b9e437b2c2942ba29945d1d21c7e214e350e4333 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot

[GitHub] [hudi] hudi-bot commented on pull request #5018: [HUDI-3559] fix flink Bucket Index with COW table type `NoSuchElementException` cause of deduplicateRecords method in FlinkWriteHelper out of

2022-03-10 Thread GitBox
hudi-bot commented on pull request #5018: URL: https://github.com/apache/hudi/pull/5018#issuecomment-1064806485 ## CI report: * b9e437b2c2942ba29945d1d21c7e214e350e4333 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[GitHub] [hudi] wxplovecc commented on pull request #4981: [HUDI-3559] fix flink Bucket Index with COW table type `NoSuchElementException` cause o…

2022-03-10 Thread GitBox
wxplovecc commented on pull request #4981: URL: https://github.com/apache/hudi/pull/4981#issuecomment-1064806140 https://github.com/apache/hudi/pull/5018 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [hudi] hudi-bot commented on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-1064806027 ## CI report: * 6f55461f206b4608607bc8ce706d9fa451dd2ab7 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-1064804622 ## CI report: * 6f55461f206b4608607bc8ce706d9fa451dd2ab7 Azure:

[GitHub] [hudi] wxplovecc opened a new pull request #5018: [HUDI-3559] fix flink Bucket Index with COW table type `NoSuchElementException` cause of deduplicateRecords method in FlinkWriteHelper out of

2022-03-10 Thread GitBox
wxplovecc opened a new pull request #5018: URL: https://github.com/apache/hudi/pull/5018 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the

[GitHub] [hudi] hudi-bot commented on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-1064804622 ## CI report: * 6f55461f206b4608607bc8ce706d9fa451dd2ab7 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-1044252671 ## CI report: * 6f55461f206b4608607bc8ce706d9fa451dd2ab7 Azure:

[jira] [Closed] (HUDI-184) Integrate Hudi with Apache Flink

2022-03-10 Thread vinoyang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vinoyang closed HUDI-184. - Resolution: Implemented This feature has been tracked via https://issues.apache.org/jira/browse/HUDI-1521 >

[jira] [Reopened] (HUDI-184) Integrate Hudi with Apache Flink

2022-03-10 Thread vinoyang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vinoyang reopened HUDI-184: --- > Integrate Hudi with Apache Flink > > > Key: HUDI-184 >

[GitHub] [hudi] guanziyue commented on a change in pull request #4913: [HUDI-1517] create marker file for every log file

2022-03-10 Thread GitBox
guanziyue commented on a change in pull request #4913: URL: https://github.com/apache/hudi/pull/4913#discussion_r824411709 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/HoodieAppendHandle.java ## @@ -113,22 +116,37 @@ // Header metadata for

[GitHub] [hudi] hudi-bot commented on pull request #5017: [HUDI-3606] Add `org.objenesis:objenesis` to hudi-timeline-server-bundle pom

2022-03-10 Thread GitBox
hudi-bot commented on pull request #5017: URL: https://github.com/apache/hudi/pull/5017#issuecomment-1064800761 ## CI report: * d1211dd592bcb9e3df60b80b9585d2eda9f0b8ab Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5017: [HUDI-3606] Add `org.objenesis:objenesis` to hudi-timeline-server-bundle pom

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #5017: URL: https://github.com/apache/hudi/pull/5017#issuecomment-1064799451 ## CI report: * d1211dd592bcb9e3df60b80b9585d2eda9f0b8ab UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot

[GitHub] [hudi] hudi-bot commented on pull request #5017: [HUDI-3606] Add `org.objenesis:objenesis` to hudi-timeline-server-bundle pom

2022-03-10 Thread GitBox
hudi-bot commented on pull request #5017: URL: https://github.com/apache/hudi/pull/5017#issuecomment-1064799451 ## CI report: * d1211dd592bcb9e3df60b80b9585d2eda9f0b8ab UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[jira] [Updated] (HUDI-3606) ClassNotFoundException: org.objenesis.strategy.InstantiatorStrategy

2022-03-10 Thread cdmikechen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] cdmikechen updated HUDI-3606: - Description: When using *hudi-timeline-server-bundle* in hadoop server (3.2.2), hudi will occasionally

[jira] [Updated] (HUDI-3606) ClassNotFoundException: org.objenesis.strategy.InstantiatorStrategy

2022-03-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3606: - Labels: pull-request-available (was: ) > ClassNotFoundException:

[GitHub] [hudi] cdmikechen opened a new pull request #5017: [HUDI-3606] Add `org.objenesis:objenesis` to hudi-timeline-server-bundle pom

2022-03-10 Thread GitBox
cdmikechen opened a new pull request #5017: URL: https://github.com/apache/hudi/pull/5017 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the

[jira] [Closed] (HUDI-609) Implement a Flink specific HoodieIndex

2022-03-10 Thread vinoyang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vinoyang closed HUDI-609. - Resolution: Won't Do > Implement a Flink specific HoodieIndex > -- > >

[jira] [Closed] (HUDI-608) Implement a flink datastream execution context

2022-03-10 Thread vinoyang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vinoyang closed HUDI-608. - Resolution: Won't Do > Implement a flink datastream execution context >

[jira] [Closed] (HUDI-184) Integrate Hudi with Apache Flink

2022-03-10 Thread vinoyang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vinoyang closed HUDI-184. - Resolution: Won't Do > Integrate Hudi with Apache Flink > > >

[jira] [Created] (HUDI-3606) ClassNotFoundException: org.objenesis.strategy.InstantiatorStrategy

2022-03-10 Thread cdmikechen (Jira)
cdmikechen created HUDI-3606: Summary: ClassNotFoundException: org.objenesis.strategy.InstantiatorStrategy Key: HUDI-3606 URL: https://issues.apache.org/jira/browse/HUDI-3606 Project: Apache Hudi

[GitHub] [hudi] hudi-bot commented on pull request #4877: [HUDI-3457][Stacked on 4818] Refactored Spark DataSource Relations to avoid code duplication

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4877: URL: https://github.com/apache/hudi/pull/4877#issuecomment-1064793172 ## CI report: * 2940f46a133ca3142f7ebb26b8c6f20583d7f395 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4877: [HUDI-3457][Stacked on 4818] Refactored Spark DataSource Relations to avoid code duplication

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4877: URL: https://github.com/apache/hudi/pull/4877#issuecomment-1064717467 ## CI report: * d875e412abc29bf6a0e8a6fa7bef747ded15d60b Azure:

[GitHub] [hudi] wxplovecc closed pull request #4654: [HUDI-3286] duplicate records when flink task restart with index.bootstrap=true

2022-03-10 Thread GitBox
wxplovecc closed pull request #4654: URL: https://github.com/apache/hudi/pull/4654 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Closed] (HUDI-3522) Introduce DropColumnSchemaPostProcessor to support drop columns from schema

2022-03-10 Thread Xianghu Wang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianghu Wang closed HUDI-3522. -- Resolution: Fixed Resolved via master : 83cff3afee15e129034eb51e68a1734c55d85da2 > Introduce

[GitHub] [hudi] wxplovecc closed pull request #4981: [HUDI-3559] fix flink Bucket Index with COW table type `NoSuchElementException` cause o…

2022-03-10 Thread GitBox
wxplovecc closed pull request #4981: URL: https://github.com/apache/hudi/pull/4981 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[hudi] branch master updated (9dc6df5 -> 83cff3a)

2022-03-10 Thread wangxianghu
This is an automated email from the ASF dual-hosted git repository. wangxianghu pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from 9dc6df5 [HUDI-3595] Fixing NULL schema provider for empty batch (#5002) add 83cff3a [HUDI-3522] Introduce

[GitHub] [hudi] wangxianghu merged pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

2022-03-10 Thread GitBox
wangxianghu merged pull request #4972: URL: https://github.com/apache/hudi/pull/4972 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4996: [HUDI-3594][Stacked on 4948] Supporting Composite Expressions over Data Table Columns in Data Skipping flow

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4996: URL: https://github.com/apache/hudi/pull/4996#issuecomment-1064734709 ## CI report: * 25578be3436f3a95af26f99368dd581efc5062e0 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4996: [HUDI-3594][Stacked on 4948] Supporting Composite Expressions over Data Table Columns in Data Skipping flow

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4996: URL: https://github.com/apache/hudi/pull/4996#issuecomment-1064776265 ## CI report: * 9de43c5d691fa4a4f383a4647ddefa4798fa127d Azure:

[jira] [Comment Edited] (HUDI-3593) AsyncClustering failed because of ConcurrentModificationException

2022-03-10 Thread shibei (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17504719#comment-17504719 ] shibei edited comment on HUDI-3593 at 3/11/22, 5:04 AM: Another failure

[jira] [Comment Edited] (HUDI-3593) AsyncClustering failed because of ConcurrentModificationException

2022-03-10 Thread shibei (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17504719#comment-17504719 ] shibei edited comment on HUDI-3593 at 3/11/22, 5:03 AM: Another failure

[GitHub] [hudi] xushiyan commented on a change in pull request #4962: [HUDI-3355] Issue with out of order commits in the timeline when ingestion writers using SparkAllowUpdateStrategy

2022-03-10 Thread GitBox
xushiyan commented on a change in pull request #4962: URL: https://github.com/apache/hudi/pull/4962#discussion_r824386018 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/utils/TransactionUtils.java ## @@ -137,4 +165,20 @@ throw new

[GitHub] [hudi] hudi-bot commented on pull request #4948: [HUDI-3514] Rebase Data Skipping flow to rely on MT Column Stats index

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4948: URL: https://github.com/apache/hudi/pull/4948#issuecomment-1064773990 ## CI report: * 14366cac6e233cb85ee94307a7f62f6184ed5b34 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4948: [HUDI-3514] Rebase Data Skipping flow to rely on MT Column Stats index

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4948: URL: https://github.com/apache/hudi/pull/4948#issuecomment-1064707297 ## CI report: * 4421752bef3dd3b53cd896f7d3ca23bb49d22034 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4971: [HUDI-3556] Re-use rollback instant for rolling back of clustering and compaction if rollback failed mid-way

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4971: URL: https://github.com/apache/hudi/pull/4971#issuecomment-1064705899 ## CI report: * 8e89371fed3d147b43959a73e3e6a33cfaefd32c Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4971: [HUDI-3556] Re-use rollback instant for rolling back of clustering and compaction if rollback failed mid-way

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4971: URL: https://github.com/apache/hudi/pull/4971#issuecomment-1064772913 ## CI report: * 74ace6ca3f717a41d54047bb44ea52fedb94e1ce Azure:

[jira] [Comment Edited] (HUDI-3593) AsyncClustering failed because of ConcurrentModificationException

2022-03-10 Thread shibei (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17504719#comment-17504719 ] shibei edited comment on HUDI-3593 at 3/11/22, 4:10 AM: {code:java} at

[jira] [Commented] (HUDI-3593) AsyncClustering failed because of ConcurrentModificationException

2022-03-10 Thread shibei (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17504719#comment-17504719 ] shibei commented on HUDI-3593: -- Another failure   {code:java} at

[GitHub] [hudi] hudi-bot removed a comment on pull request #4489: [HUDI-3135] Fix Delete partitions with metadata table and fix show partitions in spark sql

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4489: URL: https://github.com/apache/hudi/pull/4489#issuecomment-1064705595 ## CI report: * e74a30e1b9f4395780cfe412d3574dabe2ae9f57 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4489: [HUDI-3135] Fix Delete partitions with metadata table and fix show partitions in spark sql

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4489: URL: https://github.com/apache/hudi/pull/4489#issuecomment-1064751722 ## CI report: * d17343318be38b5a9b0953004700aa72f4fed689 Azure:

[GitHub] [hudi] melin opened a new issue #5016: [SUPPORT] Add AS OF syntax support

2022-03-10 Thread GitBox
melin opened a new issue #5016: URL: https://github.com/apache/hudi/issues/5016 Use sql to query the specified version data ``` SELECT * FROM default.people10m VERSION AS OF 0; SELECT * FROM default.people10m TIMESTAMP AS OF '2019-01-29 00:37:58'; ``` -- This is an

[GitHub] [hudi] hudi-bot commented on pull request #4888: [HUDI-3396][Stacked on 4877] Refactoring `MergeOnReadRDD` to avoid duplication, fetch only projected columns

2022-03-10 Thread GitBox
hudi-bot commented on pull request #4888: URL: https://github.com/apache/hudi/pull/4888#issuecomment-1064748224 ## CI report: * e0afa9f1de90411220a6c1d25c0c9e43f09f6baf Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4888: [HUDI-3396][Stacked on 4877] Refactoring `MergeOnReadRDD` to avoid duplication, fetch only projected columns

2022-03-10 Thread GitBox
hudi-bot removed a comment on pull request #4888: URL: https://github.com/apache/hudi/pull/4888#issuecomment-1064723457 ## CI report: * e0afa9f1de90411220a6c1d25c0c9e43f09f6baf Azure:

[hudi] branch master updated (fa5e750 -> 9dc6df5)

2022-03-10 Thread sivabalan
This is an automated email from the ASF dual-hosted git repository. sivabalan pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from fa5e750 [HUDI-3586] Add Trino Queries in integration tests (#4988) add 9dc6df5 [HUDI-3595] Fixing NULL

[GitHub] [hudi] nsivabalan merged pull request #5002: [HUDI-3595] Fixing NULL schema provider for empty batch

2022-03-10 Thread GitBox
nsivabalan merged pull request #5002: URL: https://github.com/apache/hudi/pull/5002 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

  1   2   3   4   5   >