[jira] [Updated] (HUDI-3435) Do not throw exception when instant to rollback does not exist in metadata table active timeline

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3435: - Story Points: 1 > Do not throw exception when instant to rollback does not exist in metadata > table

[jira] [Updated] (HUDI-3453) Metadata table throws NPE when scheduling compaction plan

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3453: - Story Points: 2 > Metadata table throws NPE when scheduling compaction plan >

[GitHub] [hudi] danny0405 commented on issue #4979: [SUPPORT] After the changelog mode is enabled, the - U operation records long ago have not been deleted

2022-03-07 Thread GitBox
danny0405 commented on issue #4979: URL: https://github.com/apache/hudi/issues/4979#issuecomment-1061498111 Yes, the spark engine has not adapter to the `__hoodie_operation` metadata field yet. -- This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [hudi] hudi-bot commented on pull request #4970: [HUDI-3574] Improve maven module configs for different spark profiles

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4970: URL: https://github.com/apache/hudi/pull/4970#issuecomment-1061497426 ## CI report: * 06dd2353e935d1541b969cbb686911446e16e448 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4970: [HUDI-3574] Improve maven module configs for different spark profiles

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4970: URL: https://github.com/apache/hudi/pull/4970#issuecomment-1061470391 ## CI report: * 5ece1ed59acf8199f0a8bfdf7c443c422ed52c12 Azure:

[jira] [Commented] (HUDI-2677) Add DFS based message queue for flink writer

2022-03-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17502775#comment-17502775 ] Danny Chen commented on HUDI-2677: -- Another patch via master for release 0.11:

[jira] [Updated] (HUDI-2677) Add DFS based message queue for flink writer

2022-03-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2677: - Fix Version/s: 0.11.0 > Add DFS based message queue for flink writer >

[hudi] branch master updated (b6bdb46 -> fe53bd2)

2022-03-07 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from b6bdb46 [MINOR][HUDI-3460]Fix HoodieDataSourceITCase add fe53bd2 [HUDI-2677] Add DFS based message queue for

[GitHub] [hudi] danny0405 merged pull request #4961: [HUDI-2677] Add DFS based message queue for flink writer[part3]

2022-03-07 Thread GitBox
danny0405 merged pull request #4961: URL: https://github.com/apache/hudi/pull/4961 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] waywtdcc edited a comment on issue #4683: [SUPPORT] Hive ro table read error

2022-03-07 Thread GitBox
waywtdcc edited a comment on issue #4683: URL: https://github.com/apache/hudi/issues/4683#issuecomment-1061486790 > will wait for a week and close out the github issue if we don't hear back from you. Sorry, I didn't notice. This is the problem of hive query. The timestamp type

[GitHub] [hudi] waywtdcc closed issue #4683: [SUPPORT] Hive ro table read error

2022-03-07 Thread GitBox
waywtdcc closed issue #4683: URL: https://github.com/apache/hudi/issues/4683 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] waywtdcc commented on issue #4683: [SUPPORT] Hive ro table read error

2022-03-07 Thread GitBox
waywtdcc commented on issue #4683: URL: https://github.com/apache/hudi/issues/4683#issuecomment-1061486790 > will wait for a week and close out the github issue if we don't hear back from you. Sorry, I didn't notice. This is the problem of hive query. The timestamp type field

[GitHub] [hudi] waywtdcc removed a comment on issue #4683: [SUPPORT] Hive ro table read error

2022-03-07 Thread GitBox
waywtdcc removed a comment on issue #4683: URL: https://github.com/apache/hudi/issues/4683#issuecomment-1061486598 > @waywtdcc 堆栈跟踪显示“不支持的类型:可选的 int96 ts”,它并非源自 hudi。看起来像 hive ql 指定 bigint 的问题?你尝试过不同的数据类型,比如 TIMESTAMP`ts`吗?这也是`ts`来自`users_cdc_hive`还是`user_cdc17_ro`?您可能需要仔细检查两个表的模式并确保未使用

[GitHub] [hudi] waywtdcc commented on issue #4683: [SUPPORT] Hive ro table read error

2022-03-07 Thread GitBox
waywtdcc commented on issue #4683: URL: https://github.com/apache/hudi/issues/4683#issuecomment-1061486598 > @waywtdcc 堆栈跟踪显示“不支持的类型:可选的 int96 ts”,它并非源自 hudi。看起来像 hive ql 指定 bigint 的问题?你尝试过不同的数据类型,比如 TIMESTAMP`ts`吗?这也是`ts`来自`users_cdc_hive`还是`user_cdc17_ro`?您可能需要仔细检查两个表的模式并确保未使用 int96。

[GitHub] [hudi] waywtdcc opened a new issue #4683: [SUPPORT] Hive ro table read error

2022-03-07 Thread GitBox
waywtdcc opened a new issue #4683: URL: https://github.com/apache/hudi/issues/4683 **To Reproduce** ` CREATE TABLE `hudi.user_cdc17`( ) ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' STORED AS INPUTFORMAT

[jira] [Updated] (HUDI-3559) NoSuchElementException when use BUCKET index in flink cow table

2022-03-07 Thread konwu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] konwu updated HUDI-3559: Description: *Environment:* ```sql CREATE TABLE test_source (  userid int, ts TIMESTAMP(3) ) WITH (  'connector'

[GitHub] [hudi] boneanxs commented on pull request #4954: [HUDI-3561] Avoid including whole `MultipleSparkJobExecutionStrategy` object into the closure for Spark to serialize

2022-03-07 Thread GitBox
boneanxs commented on pull request #4954: URL: https://github.com/apache/hudi/pull/4954#issuecomment-1061481475 Hi guys, I also met this exception when enable async clustering in a HoodieSparkStreaming job, not the same as the stacktrace this issue hit, following is the stacktrace I met,

[GitHub] [hudi] hudi-bot removed a comment on pull request #4961: [HUDI-2677] Add DFS based message queue for flink writer[part3]

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4961: URL: https://github.com/apache/hudi/pull/4961#issuecomment-1061401955 ## CI report: * 239df2ffee6cfcc9f617d50b74bbc0a850447233 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4961: [HUDI-2677] Add DFS based message queue for flink writer[part3]

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4961: URL: https://github.com/apache/hudi/pull/4961#issuecomment-1061475599 ## CI report: * 8cae5088607eb4e091630774d16d4f1ce7b130f9 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4970: [HUDI-3574] Improve maven module configs for different spark profiles

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4970: URL: https://github.com/apache/hudi/pull/4970#issuecomment-1061401987 ## CI report: * 5ece1ed59acf8199f0a8bfdf7c443c422ed52c12 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4970: [HUDI-3574] Improve maven module configs for different spark profiles

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4970: URL: https://github.com/apache/hudi/pull/4970#issuecomment-1061470391 ## CI report: * 5ece1ed59acf8199f0a8bfdf7c443c422ed52c12 Azure:

[GitHub] [hudi] scxwhite commented on pull request #4926: [HUDI-3566]add thread factory in BoundedInMemoryExecutor

2022-03-07 Thread GitBox
scxwhite commented on pull request #4926: URL: https://github.com/apache/hudi/pull/4926#issuecomment-1061465457 @leesf please review again. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] nleena123 opened a new issue #4980: unable reprocess the data with old schema through Apache hudi ,Caused by: HoodieUpsertException: Error upserting bucketType UPDATE for partition :34

2022-03-07 Thread GitBox
nleena123 opened a new issue #4980: URL: https://github.com/apache/hudi/issues/4980 Hi All, I am unable to re-process the data with old schema through apache Hudi. Getting below exception while running the job.

[jira] [Created] (HUDI-3580) [RFC-TBD] Support minor compaction for MOR tables

2022-03-07 Thread Surya Prasanna Yalla (Jira)
Surya Prasanna Yalla created HUDI-3580: -- Summary: [RFC-TBD] Support minor compaction for MOR tables Key: HUDI-3580 URL: https://issues.apache.org/jira/browse/HUDI-3580 Project: Apache Hudi

[GitHub] [hudi] waywtdcc opened a new issue #4979: [SUPPORT] After the changelog mode is enabled, the previous - U operation records are not deleted

2022-03-07 Thread GitBox
waywtdcc opened a new issue #4979: URL: https://github.com/apache/hudi/issues/4979 After the changelog mode is turned on, the flick synchronizes the changelog mode, but the - U operation record of a long time ago has not been deleted

[GitHub] [hudi] waywtdcc closed issue #4683: [SUPPORT] Hive ro table read error

2022-03-07 Thread GitBox
waywtdcc closed issue #4683: URL: https://github.com/apache/hudi/issues/4683 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] wangxianghu commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

2022-03-07 Thread GitBox
wangxianghu commented on pull request #4972: URL: https://github.com/apache/hudi/pull/4972#issuecomment-1061443825 hi @XuQianJin-Stars, could you please help review this as well ? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [hudi] waywtdcc commented on pull request #4874: [HUDI-3487] The global index is enabled regardless of changlog

2022-03-07 Thread GitBox
waywtdcc commented on pull request #4874: URL: https://github.com/apache/hudi/pull/4874#issuecomment-1061433397 > The changelog mode has retraction message itself, so no need to additional retraction for the deletions of different partition. > > The right solution for spark source

[GitHub] [hudi] hudi-bot commented on pull request #4977: [HUDI-2871] Decouple metrics dependencies from hudi-client-common

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4977: URL: https://github.com/apache/hudi/pull/4977#issuecomment-1061432784 ## CI report: * e97d0cef7d647047dcd23d42cf2c3ba33c8271a7 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4977: [HUDI-2871] Decouple metrics dependencies from hudi-client-common

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4977: URL: https://github.com/apache/hudi/pull/4977#issuecomment-1061393430 ## CI report: * e97d0cef7d647047dcd23d42cf2c3ba33c8271a7 Azure:

[GitHub] [hudi] FelixKJose edited a comment on issue #4891: Clustering not working on large table and partitions

2022-03-07 Thread GitBox
FelixKJose edited a comment on issue #4891: URL: https://github.com/apache/hudi/issues/4891#issuecomment-1061421066 @codope @suryaprasanna Thank you for the detailed information. Couple of questions: 1. Let's say my each partitions (date) are large partitions (eg. 6.5 TB

[GitHub] [hudi] FelixKJose commented on issue #4891: Clustering not working on large table and partitions

2022-03-07 Thread GitBox
FelixKJose commented on issue #4891: URL: https://github.com/apache/hudi/issues/4891#issuecomment-1061421066 @codope @suryaprasanna Thank you for the detailed information. Couple of questions: 1. Let's say my each partitions (date) are large partitions (eg. 6.5 TB uncompressed

[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4972: URL: https://github.com/apache/hudi/pull/4972#issuecomment-1061067561 ## CI report: * 750e07438a19600c55d37e7107f88ba04416e218 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4972: URL: https://github.com/apache/hudi/pull/4972#issuecomment-1061419844 ## CI report: * 750e07438a19600c55d37e7107f88ba04416e218 Azure:

[GitHub] [hudi] wangxianghu commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

2022-03-07 Thread GitBox
wangxianghu commented on pull request #4972: URL: https://github.com/apache/hudi/pull/4972#issuecomment-1061418575 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] hudi-bot commented on pull request #4958: [HUDI-3558] [Stacked 3123/3085] Consistent bucket index: bucket resizing (split) & concurrent write during resizing

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4958: URL: https://github.com/apache/hudi/pull/4958#issuecomment-1061410462 ## CI report: * 7bf1ce3392b26c8a8ea6b414a289acec2fccfa6e Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4958: [HUDI-3558] [Stacked 3123/3085] Consistent bucket index: bucket resizing (split) & concurrent write during resizing

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4958: URL: https://github.com/apache/hudi/pull/4958#issuecomment-1061407644 ## CI report: * 7bf1ce3392b26c8a8ea6b414a289acec2fccfa6e Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4848: [HUDI-3258] HoodieData for metadata index records, bloom and colstats init

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4848: URL: https://github.com/apache/hudi/pull/4848#issuecomment-1061410266 ## CI report: * fa193b7961e309d335cb24f5d35102bfa80111a7 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4848: [HUDI-3258] HoodieData for metadata index records, bloom and colstats init

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4848: URL: https://github.com/apache/hudi/pull/4848#issuecomment-1061373965 ## CI report: * ff1f746fc4826a6432ec2078ae3e6c8536a038f1 Azure:

[GitHub] [hudi] YuweiXiao commented on pull request #4958: [HUDI-3558] [Stacked 3123/3085] Consistent bucket index: bucket resizing (split) & concurrent write during resizing

2022-03-07 Thread GitBox
YuweiXiao commented on pull request #4958: URL: https://github.com/apache/hudi/pull/4958#issuecomment-1061410016 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] awpengfei opened a new issue #4978: [SUPPORT] Wrong table path when using Hive to query xxx_rt table before the first compaction

2022-03-07 Thread GitBox
awpengfei opened a new issue #4978: URL: https://github.com/apache/hudi/issues/4978 **Describe the problem you faced** When using Hive to query xxx_rt table,if there is no parquet file but only log files, we get a wrong table path. But when the parquet files are generated, the table

[GitHub] [hudi] hudi-bot commented on pull request #4958: [HUDI-3558] [Stacked 3123/3085] Consistent bucket index: bucket resizing (split) & concurrent write during resizing

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4958: URL: https://github.com/apache/hudi/pull/4958#issuecomment-1061407644 ## CI report: * 7bf1ce3392b26c8a8ea6b414a289acec2fccfa6e Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4958: [HUDI-3558] [Stacked 3123/3085] Consistent bucket index: bucket resizing (split) & concurrent write during resizing

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4958: URL: https://github.com/apache/hudi/pull/4958#issuecomment-1061375143 ## CI report: * 004b689405ece55f44c902102acaf8cdcce338e7 Azure:

[jira] [Assigned] (HUDI-3577) NPE in HoodieTimelineArchiver

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-3577: Assignee: Raymond Xu > NPE in HoodieTimelineArchiver > - > >

[jira] [Updated] (HUDI-3577) NPE in HoodieTimelineArchiver

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3577: - Sprint: Hudi-Sprint-Mar-07 > NPE in HoodieTimelineArchiver > - > >

[jira] [Closed] (HUDI-2965) Fix layout optimization to appropriately handle nested columns references

2022-03-07 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin closed HUDI-2965. - Assignee: Alexey Kudinkin Resolution: Invalid This is actually not possible (currently),

[jira] [Assigned] (HUDI-3166) Implement new HoodieIndex based on metadata indices

2022-03-07 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit reassigned HUDI-3166: - Assignee: Sagar Sumit (was: Manoj Govindassamy) > Implement new HoodieIndex based on metadata

[jira] [Assigned] (HUDI-3317) Partition specific pointed lookup/reading strategy for metadata table

2022-03-07 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit reassigned HUDI-3317: - Assignee: Sagar Sumit (was: Manoj Govindassamy) > Partition specific pointed lookup/reading

[jira] [Assigned] (HUDI-3288) Partition specific compaction strategy for the metadata table

2022-03-07 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit reassigned HUDI-3288: - Assignee: Sagar Sumit (was: Manoj Govindassamy) > Partition specific compaction strategy for

[GitHub] [hudi] hudi-bot commented on pull request #4961: [HUDI-2677] Add DFS based message queue for flink writer[part3]

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4961: URL: https://github.com/apache/hudi/pull/4961#issuecomment-1061401955 ## CI report: * 239df2ffee6cfcc9f617d50b74bbc0a850447233 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4961: [HUDI-2677] Add DFS based message queue for flink writer[part3]

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4961: URL: https://github.com/apache/hudi/pull/4961#issuecomment-1061397515 ## CI report: * 239df2ffee6cfcc9f617d50b74bbc0a850447233 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4970: [HUDI-3574] Improve maven module configs for different spark profiles

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4970: URL: https://github.com/apache/hudi/pull/4970#issuecomment-1061401987 ## CI report: * 5ece1ed59acf8199f0a8bfdf7c443c422ed52c12 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4970: [HUDI-3574] Improve maven module configs for different spark profiles

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4970: URL: https://github.com/apache/hudi/pull/4970#issuecomment-1061399684 ## CI report: * 5ece1ed59acf8199f0a8bfdf7c443c422ed52c12 Azure:

[GitHub] [hudi] zhangyue19921010 commented on issue #4803: [SUPPORT] Clustering throwing exception

2022-03-07 Thread GitBox
zhangyue19921010 commented on issue #4803: URL: https://github.com/apache/hudi/issues/4803#issuecomment-1061400775 Ack. Sorry, almost forget this issue. Will do more research and response ASAP -- This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [hudi] XuQianJin-Stars removed a comment on pull request #4970: [HUDI-3574] Improve maven module configs for different spark profiles

2022-03-07 Thread GitBox
XuQianJin-Stars removed a comment on pull request #4970: URL: https://github.com/apache/hudi/pull/4970#issuecomment-1061399577 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [hudi] hudi-bot removed a comment on pull request #4970: [HUDI-3574] Improve maven module configs for different spark profiles

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4970: URL: https://github.com/apache/hudi/pull/4970#issuecomment-1061396176 ## CI report: * 5ece1ed59acf8199f0a8bfdf7c443c422ed52c12 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4970: [HUDI-3574] Improve maven module configs for different spark profiles

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4970: URL: https://github.com/apache/hudi/pull/4970#issuecomment-1061399684 ## CI report: * 5ece1ed59acf8199f0a8bfdf7c443c422ed52c12 Azure:

[GitHub] [hudi] XuQianJin-Stars commented on pull request #4970: [HUDI-3574] Improve maven module configs for different spark profiles

2022-03-07 Thread GitBox
XuQianJin-Stars commented on pull request #4970: URL: https://github.com/apache/hudi/pull/4970#issuecomment-1061399577 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] hudi-bot commented on pull request #4961: [HUDI-2677] Add DFS based message queue for flink writer[part3]

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4961: URL: https://github.com/apache/hudi/pull/4961#issuecomment-1061397515 ## CI report: * 239df2ffee6cfcc9f617d50b74bbc0a850447233 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4961: [HUDI-2677] Add DFS based message queue for flink writer[part3]

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4961: URL: https://github.com/apache/hudi/pull/4961#issuecomment-1061396105 ## CI report: * 63d8674f838502dd0ff99e58ef1384bb5ee4ab12 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4961: [HUDI-2677] Add DFS based message queue for flink writer[part3]

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4961: URL: https://github.com/apache/hudi/pull/4961#issuecomment-1061396105 ## CI report: * 63d8674f838502dd0ff99e58ef1384bb5ee4ab12 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4970: [HUDI-3574] Improve maven module configs for different spark profiles

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4970: URL: https://github.com/apache/hudi/pull/4970#issuecomment-1061394696 ## CI report: * d0aeb916976adab68cef15f5978d4d8ac4d0b532 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4970: [HUDI-3574] Improve maven module configs for different spark profiles

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4970: URL: https://github.com/apache/hudi/pull/4970#issuecomment-1061396176 ## CI report: * 5ece1ed59acf8199f0a8bfdf7c443c422ed52c12 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4961: [HUDI-2677] Add DFS based message queue for flink writer[part3]

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4961: URL: https://github.com/apache/hudi/pull/4961#issuecomment-1061375166 ## CI report: * 63d8674f838502dd0ff99e58ef1384bb5ee4ab12 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4976: fix NPE when run schdule using spark-sql if the commits time < hoodie.compact.inline.max.delta.commits

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4976: URL: https://github.com/apache/hudi/pull/4976#issuecomment-1061394716 ## CI report: * eee4d7b357550b2f3aa184783d4d30e983ae44b1 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4976: fix NPE when run schdule using spark-sql if the commits time < hoodie.compact.inline.max.delta.commits

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4976: URL: https://github.com/apache/hudi/pull/4976#issuecomment-1061361127 ## CI report: * eee4d7b357550b2f3aa184783d4d30e983ae44b1 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4970: [HUDI-3574] Improve maven module configs for different spark profiles

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4970: URL: https://github.com/apache/hudi/pull/4970#issuecomment-1061375190 ## CI report: * d0aeb916976adab68cef15f5978d4d8ac4d0b532 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4970: [HUDI-3574] Improve maven module configs for different spark profiles

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4970: URL: https://github.com/apache/hudi/pull/4970#issuecomment-1061394696 ## CI report: * d0aeb916976adab68cef15f5978d4d8ac4d0b532 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4977: [HUDI-2871] Decouple metrics dependencies from hudi-client-common

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4977: URL: https://github.com/apache/hudi/pull/4977#issuecomment-1061393430 ## CI report: * e97d0cef7d647047dcd23d42cf2c3ba33c8271a7 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4977: [HUDI-2871] Decouple metrics dependencies from hudi-client-common

2022-03-07 Thread GitBox
hudi-bot removed a comment on pull request #4977: URL: https://github.com/apache/hudi/pull/4977#issuecomment-1061392129 ## CI report: * e97d0cef7d647047dcd23d42cf2c3ba33c8271a7 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot

[GitHub] [hudi] hudi-bot commented on pull request #4977: [HUDI-2871] Decouple metrics dependencies from hudi-client-common

2022-03-07 Thread GitBox
hudi-bot commented on pull request #4977: URL: https://github.com/apache/hudi/pull/4977#issuecomment-1061392129 ## CI report: * e97d0cef7d647047dcd23d42cf2c3ba33c8271a7 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[jira] [Updated] (HUDI-512) Support for Index functions on columns to generate logical or micro partitioning

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-512: Sprint: Hudi-Sprint-Jan-24, Hudi-Sprint-Jan-31, Hudi-Sprint-Feb-7, Hudi-Sprint-Feb-14, Hudi-Sprint-Feb-22,

[jira] [Updated] (HUDI-2752) The MOR DELETE block breaks the event time sequence of CDC

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2752: - Sprint: Hudi-Sprint-Jan-3, Hudi-Sprint-Jan-10, Hudi-Sprint-Feb-22, Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07

[jira] [Updated] (HUDI-3411) Incorrect Record Key Field property Handling

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3411: - Sprint: Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07 (was: Hudi-Sprint-Mar-01) > Incorrect Record Key Field

[jira] [Updated] (HUDI-3571) Add failure injection tests for spark datasource

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3571: - Sprint: Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07 (was: Hudi-Sprint-Mar-01) > Add failure injection tests

[jira] [Updated] (HUDI-3404) Auto adjust configs for metadata table enablement or turn off metadata table if necessary

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3404: - Sprint: Hudi-Sprint-Feb-7, Hudi-Sprint-Feb-14, Hudi-Sprint-Feb-22, Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07

[jira] [Updated] (HUDI-1370) Scoping work needed to support bootstrapped data table and RFC-15 together

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1370: - Sprint: Hudi-Sprint-Jan-18, Hudi-Sprint-Jan-24, Hudi-Sprint-Jan-31, Hudi-Sprint-Feb-7,

[jira] [Updated] (HUDI-3075) Docs for Debezium source

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3075: - Sprint: Hudi-Sprint-Jan-3, Hudi-Sprint-Jan-10, Hudi-Sprint-Jan-18, Hudi-Sprint-Jan-24,

[jira] [Updated] (HUDI-2695) Documentation

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2695: - Sprint: Hudi-Sprint-Jan-3, Hudi-Sprint-Jan-10, Hudi-Sprint-Jan-18, Hudi-Sprint-Jan-24,

[jira] [Updated] (HUDI-2606) Ensure query engines not access MDT if disabled

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2606: - Sprint: Hudi-Sprint-Feb-14, Hudi-Sprint-Feb-22, Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07 (was:

[jira] [Updated] (HUDI-3396) Make sure Spark reads only Projected Columns for both MOR/COW

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3396: - Sprint: Hudi-Sprint-Feb-7, Hudi-Sprint-Feb-14, Hudi-Sprint-Feb-22, Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07

[jira] [Updated] (HUDI-3457) Refactor Spark Relations to avoid code duplication

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3457: - Sprint: Hudi-Sprint-Feb-14, Hudi-Sprint-Feb-22, Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07 (was:

[jira] [Updated] (HUDI-3556) re-use rollback instant for compaction and clustering

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3556: - Sprint: Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07 (was: Hudi-Sprint-Mar-01) > re-use rollback instant for

[jira] [Updated] (HUDI-2757) Support AWS Glue API for metastore sync

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2757: - Sprint: Hudi-Sprint-Feb-7, Hudi-Sprint-Feb-14, Hudi-Sprint-Feb-22, Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07

[jira] [Updated] (HUDI-3514) Leverage MT Column-stats Index in HoodieFileIndex

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3514: - Sprint: Hudi-Sprint-Feb-22, Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07 (was: Hudi-Sprint-Feb-22,

[jira] [Updated] (HUDI-3382) Support removal of bloom and column stats indexes

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3382: - Sprint: Hudi-Sprint-Feb-7, Hudi-Sprint-Feb-14, Hudi-Sprint-Feb-22, Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07

[jira] [Updated] (HUDI-2439) Refactor table.action.commit package (CommitActionExecutors) in hudi-client module

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2439: - Sprint: Hudi-Sprint-Jan-18, Hudi-Sprint-Feb-14, Hudi-Sprint-Feb-22, Hudi-Sprint-Mar-01,

[jira] [Updated] (HUDI-3284) Restore hudi-presto-bundle changes and upgrade presto version in docker setup

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3284: - Sprint: Hudi-Sprint-Jan-24, Hudi-Sprint-Jan-31, Hudi-Sprint-Feb-7, Hudi-Sprint-Feb-14,

[jira] [Updated] (HUDI-3374) metadata index for secondary keys

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3374: - Sprint: Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07 (was: Hudi-Sprint-Mar-01) > metadata index for secondary

[jira] [Updated] (HUDI-2871) Decouple metrics dependencies from hudi-client-common

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2871: - Sprint: Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07 (was: Hudi-Sprint-Mar-01) > Decouple metrics dependencies

[jira] [Updated] (HUDI-3494) MOR compaction and archive settings may prevent metadata table compaction

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3494: - Sprint: Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07 (was: Hudi-Sprint-Mar-01) > MOR compaction and archive

[jira] [Updated] (HUDI-3207) Hudi Trino connector PR review

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3207: - Sprint: Hudi-Sprint-Jan-10, Hudi-Sprint-Jan-18, Hudi-Sprint-Jan-24, Hudi-Sprint-Jan-31,

[jira] [Updated] (HUDI-1180) Upgrade HBase to 2.x

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1180: - Sprint: Hudi-Sprint-Jan-18, Hudi-Sprint-Jan-24, Hudi-Sprint-Jan-31, Hudi-Sprint-Feb-7,

[jira] [Updated] (HUDI-3560) Add docker image for spark3 hadoop3 and hive3

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3560: - Sprint: Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07 (was: Hudi-Sprint-Mar-01) > Add docker image for spark3

[jira] [Updated] (HUDI-3513) Clustering fails with updating col stats when disabling metadata table

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3513: - Sprint: Hudi-Sprint-Feb-22, Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07 (was: Hudi-Sprint-Feb-22,

[jira] [Updated] (HUDI-3208) Come up with rollout plan for enabling metadata table by default in 0.11

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3208: - Sprint: Hudi-Sprint-Jan-10, Hudi-Sprint-Jan-18, Hudi-Sprint-Jan-24, Hudi-Sprint-Jan-31,

[jira] [Updated] (HUDI-3074) Docs for Z-order

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3074: - Sprint: Hudi-Sprint-Jan-3, Hudi-Sprint-Jan-10, Hudi-Sprint-Jan-18, Hudi-Sprint-Jan-24,

[jira] [Updated] (HUDI-2973) Rewrite/re-publish RFC for Data skipping index

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2973: - Sprint: Hudi-Sprint-Jan-18, Hudi-Sprint-Jan-24, Hudi-Sprint-Jan-31, Hudi-Sprint-Feb-7,

[jira] [Updated] (HUDI-2961) Async table services can race with metadata table updates

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2961: - Sprint: Hudi-Sprint-Jan-24, Hudi-Sprint-Jan-31, Hudi-Sprint-Feb-7, Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07

[jira] [Updated] (HUDI-3258) Support multiple metadata index partitions - bloom and column stats

2022-03-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3258: - Sprint: Hudi-Sprint-Feb-7, Hudi-Sprint-Feb-14, Hudi-Sprint-Feb-22, Hudi-Sprint-Mar-01, Hudi-Sprint-Mar-07

  1   2   3   4   5   6   7   >