[GitHub] [hudi] alexeykudinkin commented on pull request #5166: [MINOR] Fix dates as per UTC in TestDataSkippingUtils

2022-03-29 Thread GitBox
alexeykudinkin commented on pull request #5166: URL: https://github.com/apache/hudi/pull/5166#issuecomment-1082253783 Sorry for the trouble folks, adding those tests i simply ignored the fact that these are in my local TZ. Thanks for fixing this @codope -- This is an automated me

[GitHub] [hudi] nsivabalan merged pull request #4955: [HUDI-3549] Removing dependency on "spark-avro"

2022-03-29 Thread GitBox
nsivabalan merged pull request #4955: URL: https://github.com/apache/hudi/pull/4955 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubs

[jira] [Updated] (HUDI-3745) Add support for spark data-source reader options in S3EventsHoodieIncrSource

2022-03-29 Thread Harshal Patil (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Harshal Patil updated HUDI-3745: Description: S3EventsHoodieIncrSource reader supports different file formats .  For each of these so

[jira] [Updated] (HUDI-3745) Add support for spark data-source reader options in S3EventsHoodieIncrSource

2022-03-29 Thread Harshal Patil (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Harshal Patil updated HUDI-3745: Description: S3EventsHoodieIncrSource reader supports different file formats .  For each of these so

[jira] [Created] (HUDI-3745) Add support for spark data-source reader options in S3EventsHoodieIncrSource

2022-03-29 Thread Harshal Patil (Jira)
Harshal Patil created HUDI-3745: --- Summary: Add support for spark data-source reader options in S3EventsHoodieIncrSource Key: HUDI-3745 URL: https://issues.apache.org/jira/browse/HUDI-3745 Project: Apach

[jira] [Assigned] (HUDI-3745) Add support for spark data-source reader options in S3EventsHoodieIncrSource

2022-03-29 Thread Harshal Patil (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Harshal Patil reassigned HUDI-3745: --- Assignee: Harshal Patil > Add support for spark data-source reader options in S3EventsHoodieI

[jira] [Updated] (HUDI-3732) Fix validation of rollback to avoid any pending clustering instants

2022-03-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-3732: -- Status: Patch Available (was: In Progress) > Fix validation of rollback to avoid any pe

[jira] [Updated] (HUDI-3733) Add argument for HoodieFailedWritesCleaningPolicy to restore in hudi-cli

2022-03-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-3733: -- Status: Patch Available (was: In Progress) > Add argument for HoodieFailedWritesCleanin

[jira] [Updated] (HUDI-3732) Fix validation of rollback to avoid any pending clustering instants

2022-03-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-3732: -- Status: In Progress (was: Open) > Fix validation of rollback to avoid any pending clust

[jira] [Updated] (HUDI-3733) Add argument for HoodieFailedWritesCleaningPolicy to restore in hudi-cli

2022-03-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-3733: -- Status: In Progress (was: Open) > Add argument for HoodieFailedWritesCleaningPolicy to

[hudi] branch master updated (fcb003e -> 0802510)

2022-03-29 Thread xushiyan
This is an automated email from the ASF dual-hosted git repository. xushiyan pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from fcb003e [HUDI-3731] Fixing Column Stats Index record Merging sequence missing `columnName` (#5159) add 0802510

[GitHub] [hudi] xushiyan merged pull request #5147: [HUDI-2520] Fix drop partition issue when sync to hive

2022-03-29 Thread GitBox
xushiyan merged pull request #5147: URL: https://github.com/apache/hudi/pull/5147 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr

[jira] [Updated] (HUDI-3738) Perf comparison between parquet and hudi for COW snapshot and MOR read optimized

2022-03-29 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3738: -- Status: In Progress (was: Open) > Perf comparison between parquet and hudi for COW snapshot and

[jira] [Updated] (HUDI-2875) Concurrent call to HoodieMergeHandler cause parquet corruption

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2875: - Priority: Critical (was: Major) > Concurrent call to HoodieMergeHandler cause parquet corruption > --

[jira] [Updated] (HUDI-2875) Concurrent call to HoodieMergeHandler cause parquet corruption

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2875: - Component/s: writer-core > Concurrent call to HoodieMergeHandler cause parquet corruption > --

[jira] [Updated] (HUDI-3653) Clean up Column Stats Index introduced along with Spatial Curves Clustering

2022-03-29 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3653: -- Status: In Progress (was: Open) > Clean up Column Stats Index introduced along with Spatial Cur

[jira] [Updated] (HUDI-1370) Scoping work needed to support bootstrapped data table and RFC-15 together

2022-03-29 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-1370: -- Status: Patch Available (was: In Progress) > Scoping work needed to support bootstrapped data t

[jira] [Updated] (HUDI-3739) Fix translation of isNotNull predicates in Data Skipping

2022-03-29 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3739: -- Status: In Progress (was: Open) > Fix translation of isNotNull predicates in Data Skipping > --

[jira] [Updated] (HUDI-2780) Mor reads the log file and skips the complete block as a bad block, resulting in data loss

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2780: - Fix Version/s: 0.12.0 (was: 0.11.0) > Mor reads the log file and skips the complete

[jira] [Updated] (HUDI-2296) flink support ConsistencyGuard plugin

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2296: - Fix Version/s: 0.12.0 (was: 0.11.0) > flink support ConsistencyGuard plugin > ---

[GitHub] [hudi] hudi-bot removed a comment on pull request #5147: [HUDI-2520] Fix drop partition issue when sync to hive

2022-03-29 Thread GitBox
hudi-bot removed a comment on pull request #5147: URL: https://github.com/apache/hudi/pull/5147#issuecomment-1082054596 ## CI report: * 08e5869a9f55f0d2b8fc649fc08dd7c62f1d100a UNKNOWN * 4d1f5e5b46bf135f4096db2182b983e04c6f11a7 Azure: [SUCCESS](https://dev.azure.com/apache-hud

[GitHub] [hudi] hudi-bot commented on pull request #5147: [HUDI-2520] Fix drop partition issue when sync to hive

2022-03-29 Thread GitBox
hudi-bot commented on pull request #5147: URL: https://github.com/apache/hudi/pull/5147#issuecomment-1082217071 ## CI report: * 08e5869a9f55f0d2b8fc649fc08dd7c62f1d100a UNKNOWN * dd80471745d52869640ab73fc8cb8d9924e41604 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org

[jira] [Updated] (HUDI-2873) Support optimize data layout by sql and make the build more fast

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2873: - Fix Version/s: 0.12.0 (was: 0.11.0) > Support optimize data layout by sql and make

[jira] [Updated] (HUDI-2808) Supports deduplication for streaming write

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2808: - Fix Version/s: 0.12.0 (was: 0.11.0) > Supports deduplication for streaming write >

[jira] [Updated] (HUDI-2873) Support optimize data layout by sql and make the build more fast

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2873: - Issue Type: New Feature (was: Task) > Support optimize data layout by sql and make the build more fast >

[jira] [Updated] (HUDI-2873) Support optimize data layout by sql and make the build more fast

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2873: - Labels: (was: sev:high) > Support optimize data layout by sql and make the build more fast > ---

[jira] [Updated] (HUDI-2173) Enhancing DynamoDB based LockProvider

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2173: - Issue Type: New Feature (was: Task) > Enhancing DynamoDB based LockProvider > ---

[jira] [Commented] (HUDI-2173) Enhancing DynamoDB based LockProvider

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17514261#comment-17514261 ] Raymond Xu commented on HUDI-2173: -- hey [~dave_hagman] any update on this work?  > Enhan

[jira] [Updated] (HUDI-1872) Move HoodieFlinkStreamer into hudi-utilities module

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1872: - Fix Version/s: 0.12.0 (was: 0.11.0) > Move HoodieFlinkStreamer into hudi-utilities

[jira] (HUDI-1864) Support for java.time.LocalDate in TimestampBasedAvroKeyGenerator

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1864 ] Raymond Xu deleted comment on HUDI-1864: -- was (Author: githubbot): hudi-bot commented on pull request #3039: URL: https://github.com/apache/hudi/pull/3039#issuecomment-914648327 ## CI rep

[jira] (HUDI-1864) Support for java.time.LocalDate in TimestampBasedAvroKeyGenerator

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1864 ] Raymond Xu deleted comment on HUDI-1864: -- was (Author: githubbot): hudi-bot edited a comment on pull request #3039: URL: https://github.com/apache/hudi/pull/3039#issuecomment-914648327 ##

[jira] (HUDI-1864) Support for java.time.LocalDate in TimestampBasedAvroKeyGenerator

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1864 ] Raymond Xu deleted comment on HUDI-1864: -- was (Author: githubbot): hudi-bot edited a comment on pull request #3039: URL: https://github.com/apache/hudi/pull/3039#issuecomment-914648327 ##

[jira] (HUDI-1864) Support for java.time.LocalDate in TimestampBasedAvroKeyGenerator

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1864 ] Raymond Xu deleted comment on HUDI-1864: -- was (Author: githubbot): n3nash commented on pull request #2923: URL: https://github.com/apache/hudi/pull/2923#issuecomment-872765957 @vaibhav-sinha

[jira] (HUDI-1864) Support for java.time.LocalDate in TimestampBasedAvroKeyGenerator

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1864 ] Raymond Xu deleted comment on HUDI-1864: -- was (Author: githubbot): hudi-bot edited a comment on pull request #2923: URL: https://github.com/apache/hudi/pull/2923#issuecomment-865487972 ##

[jira] (HUDI-1864) Support for java.time.LocalDate in TimestampBasedAvroKeyGenerator

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1864 ] Raymond Xu deleted comment on HUDI-1864: -- was (Author: githubbot): hudi-bot edited a comment on pull request #2923: URL: https://github.com/apache/hudi/pull/2923#issuecomment-865487972 ##

[jira] (HUDI-1864) Support for java.time.LocalDate in TimestampBasedAvroKeyGenerator

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1864 ] Raymond Xu deleted comment on HUDI-1864: -- was (Author: githubbot): vaibhav-sinha commented on pull request #2923: URL: https://github.com/apache/hudi/pull/2923#issuecomment-872778115 @n3nash

[jira] [Updated] (HUDI-1864) Support for java.time.LocalDate in TimestampBasedAvroKeyGenerator

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1864: - Fix Version/s: 0.12.0 (was: 0.11.0) > Support for java.time.LocalDate in TimestampB

[jira] [Assigned] (HUDI-1864) Support for java.time.LocalDate in TimestampBasedAvroKeyGenerator

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-1864: Assignee: sivabalan narayanan (was: Vaibhav Sinha) > Support for java.time.LocalDate in TimestampB

[jira] [Assigned] (HUDI-1602) Corrupted Avro schema extracted from parquet file

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-1602: Assignee: (was: Nishith Agarwal) > Corrupted Avro schema extracted from parquet file >

[jira] [Updated] (HUDI-1602) Corrupted Avro schema extracted from parquet file

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1602: - Fix Version/s: 0.11.0 (was: 0.12.0) > Corrupted Avro schema extracted from parquet

[jira] [Commented] (HUDI-1602) Corrupted Avro schema extracted from parquet file

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17514258#comment-17514258 ] Raymond Xu commented on HUDI-1602: -- We should triage this see if still problem with spark

[jira] [Updated] (HUDI-1602) Corrupted Avro schema extracted from parquet file

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1602: - Status: Open (was: Patch Available) > Corrupted Avro schema extracted from parquet file > ---

[jira] [Updated] (HUDI-1602) Corrupted Avro schema extracted from parquet file

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1602: - Fix Version/s: 0.12.0 (was: 0.11.0) > Corrupted Avro schema extracted from parquet

[jira] [Updated] (HUDI-1280) Add tool to capture earliest or latest offsets in kafka topics

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1280: - Issue Type: New Feature (was: Improvement) > Add tool to capture earliest or latest offsets in kafka topi

[jira] [Updated] (HUDI-1280) Add tool to capture earliest or latest offsets in kafka topics

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1280: - Fix Version/s: 0.12.0 (was: 0.11.0) > Add tool to capture earliest or latest offset

[jira] [Updated] (HUDI-2832) [Umbrella] [RFC-40] Implement SnowflakeSyncTool to support Hudi to Snowflake Integration

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2832: - Fix Version/s: 0.12.0 (was: 0.11.0) > [Umbrella] [RFC-40] Implement SnowflakeSyncTo

[jira] [Closed] (HUDI-3731) Failure to merging Column Stats Records

2022-03-29 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit closed HUDI-3731. - Resolution: Fixed > Failure to merging Column Stats Records > >

[jira] [Updated] (HUDI-2832) [Umbrella] [RFC-40] Implement SnowflakeSyncTool to support Hudi to Snowflake Integration

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2832: - Priority: Blocker (was: Major) > [Umbrella] [RFC-40] Implement SnowflakeSyncTool to support Hudi to Snowf

[GitHub] [hudi] hudi-bot commented on pull request #4925: [HUDI-3103] Enable MultiTableDeltaStreamer to update a single sink table from multiple source tables

2022-03-29 Thread GitBox
hudi-bot commented on pull request #4925: URL: https://github.com/apache/hudi/pull/4925#issuecomment-1082206671 ## CI report: * 333da7447af7d602ffa3067a759cecc62e4365d8 UNKNOWN * a119a051dbe0a0921b6bd58fbb0f1bbd3d647fa8 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org

[GitHub] [hudi] hudi-bot removed a comment on pull request #4925: [HUDI-3103] Enable MultiTableDeltaStreamer to update a single sink table from multiple source tables

2022-03-29 Thread GitBox
hudi-bot removed a comment on pull request #4925: URL: https://github.com/apache/hudi/pull/4925#issuecomment-1082054115 ## CI report: * 333da7447af7d602ffa3067a759cecc62e4365d8 UNKNOWN * ba049626f758b2eec3c5a7d2d3974c10369cf829 Azure: [FAILURE](https://dev.azure.com/apache-hud

[GitHub] [hudi] alexeykudinkin commented on a change in pull request #5168: [HUDI-3729][SPARK] fixed the per regression by enable vectorizeReader for parquet file

2022-03-29 Thread GitBox
alexeykudinkin commented on a change in pull request #5168: URL: https://github.com/apache/hudi/pull/5168#discussion_r837758932 ## File path: hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieBaseRelation.scala ## @@ -290,11 +290,8 @@ abstract class

[jira] [Closed] (HUDI-3021) HoodieAppendHandle#appendDataAndDeleteBlocks writer object occurred NPE

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-3021. Resolution: Fixed > HoodieAppendHandle#appendDataAndDeleteBlocks writer object occurred NPE > --

[jira] [Updated] (HUDI-3021) HoodieAppendHandle#appendDataAndDeleteBlocks writer object occurred NPE

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3021: - Fix Version/s: (was: 0.11.0) > HoodieAppendHandle#appendDataAndDeleteBlocks writer object occurred NPE

[jira] [Updated] (HUDI-3026) HoodieAppendhandle may result in duplicate key for hbase index

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3026: - Fix Version/s: 0.12.0 (was: 0.11.0) > HoodieAppendhandle may result in duplicate ke

[jira] [Updated] (HUDI-3255) Add HoodieFlinkSink for flink datastream api

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3255: - Labels: (was: sev:normal) > Add HoodieFlinkSink for flink datastream api > -

[jira] [Updated] (HUDI-3251) Add HoodieFlinkSource for flink datastream api

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3251: - Fix Version/s: 0.12.0 (was: 0.11.0) > Add HoodieFlinkSource for flink datastream ap

[jira] [Updated] (HUDI-3255) Add HoodieFlinkSink for flink datastream api

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3255: - Fix Version/s: 0.12.0 (was: 0.11.0) > Add HoodieFlinkSink for flink datastream api

[jira] [Updated] (HUDI-3304) support partial update on mor table

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3304: - Fix Version/s: 0.12.0 (was: 0.11.0) > support partial update on mor table > --

[jira] [Updated] (HUDI-3304) support partial update on mor table

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3304: - Issue Type: New Feature (was: Improvement) > support partial update on mor table > -

[jira] [Updated] (HUDI-3305) Drop deprecated util HDFSParquetImporter

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3305: - Fix Version/s: 0.12.0 (was: 0.11.0) > Drop deprecated util HDFSParquetImporter > --

[jira] [Updated] (HUDI-3321) HFileWriter, HFileReader and HFileDataBlock should avoid hardcoded key field name

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3321: - Component/s: code-quality > HFileWriter, HFileReader and HFileDataBlock should avoid hardcoded key field

[jira] [Updated] (HUDI-3350) Create Engine-specific Implementations of `HoodieRecord`

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3350: - Issue Type: Improvement (was: Task) > Create Engine-specific Implementations of `HoodieRecord` >

[jira] [Updated] (HUDI-3321) HFileWriter, HFileReader and HFileDataBlock should avoid hardcoded key field name

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3321: - Issue Type: Improvement (was: Task) > HFileWriter, HFileReader and HFileDataBlock should avoid hardcoded

[jira] [Updated] (HUDI-3321) HFileWriter, HFileReader and HFileDataBlock should avoid hardcoded key field name

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3321: - Component/s: metadata > HFileWriter, HFileReader and HFileDataBlock should avoid hardcoded key field > na

[jira] [Updated] (HUDI-3349) Revisit HoodieRecord API to be able to replace HoodieRecordPayload

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3349: - Issue Type: Improvement (was: Task) > Revisit HoodieRecord API to be able to replace HoodieRecordPayload

[jira] [Updated] (HUDI-3378) Rebase `HoodieCreateHandle` to operate on `HoodieRecord`

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3378: - Fix Version/s: 0.12.0 (was: 0.11.0) > Rebase `HoodieCreateHandle` to operate on `Ho

[jira] [Updated] (HUDI-3353) Rebase `HoodieFileWriter` to accept `HoodieRecord`

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3353: - Issue Type: Improvement (was: Task) > Rebase `HoodieFileWriter` to accept `HoodieRecord` > --

[jira] [Updated] (HUDI-3354) Rebase `HoodieRealtimeRecordReader` to return `HoodieRecord`

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3354: - Issue Type: Improvement (was: Task) > Rebase `HoodieRealtimeRecordReader` to return `HoodieRecord` >

[jira] [Updated] (HUDI-3351) Rebase Record combining semantic into `HoodieRecordCombiningEngine`

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3351: - Issue Type: Improvement (was: Task) > Rebase Record combining semantic into `HoodieRecordCombiningEngine`

[jira] [Updated] (HUDI-3378) Rebase `HoodieCreateHandle` to operate on `HoodieRecord`

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3378: - Issue Type: Improvement (was: Task) > Rebase `HoodieCreateHandle` to operate on `HoodieRecord` >

[jira] [Updated] (HUDI-3354) Rebase `HoodieRealtimeRecordReader` to return `HoodieRecord`

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3354: - Fix Version/s: 0.12.0 (was: 0.11.0) > Rebase `HoodieRealtimeRecordReader` to return

[jira] [Updated] (HUDI-3379) Rebase `HoodieAppendHandle` to operate on `HoodieRecord`

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3379: - Issue Type: Improvement (was: Task) > Rebase `HoodieAppendHandle` to operate on `HoodieRecord` >

[jira] [Updated] (HUDI-3380) Rebase `HoodieDataBlock`s to operate on `HoodieRecord`

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3380: - Fix Version/s: 0.12.0 (was: 0.11.0) > Rebase `HoodieDataBlock`s to operate on `Hood

[jira] [Updated] (HUDI-3381) Rebase `HoodieMergeHandle` to operate on `HoodieRecord`

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3381: - Issue Type: Improvement (was: Bug) > Rebase `HoodieMergeHandle` to operate on `HoodieRecord` > --

[jira] [Updated] (HUDI-3385) Implement Spark-specific `FileReader`s

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3385: - Issue Type: Improvement (was: Task) > Implement Spark-specific `FileReader`s > --

[jira] [Updated] (HUDI-3385) Implement Spark-specific `FileReader`s

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3385: - Fix Version/s: 0.12.0 (was: 0.11.0) > Implement Spark-specific `FileReader`s >

[jira] [Updated] (HUDI-3379) Rebase `HoodieAppendHandle` to operate on `HoodieRecord`

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3379: - Fix Version/s: 0.12.0 (was: 0.11.0) > Rebase `HoodieAppendHandle` to operate on `Ho

[jira] [Updated] (HUDI-3380) Rebase `HoodieDataBlock`s to operate on `HoodieRecord`

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3380: - Issue Type: Improvement (was: Bug) > Rebase `HoodieDataBlock`s to operate on `HoodieRecord` > ---

[jira] [Updated] (HUDI-3384) Implement Spark-specific FileWriters

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3384: - Fix Version/s: 0.12.0 (was: 0.11.0) > Implement Spark-specific FileWriters > --

[jira] [Updated] (HUDI-3384) Implement Spark-specific FileWriters

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3384: - Issue Type: Improvement (was: Task) > Implement Spark-specific FileWriters >

[jira] [Updated] (HUDI-3381) Rebase `HoodieMergeHandle` to operate on `HoodieRecord`

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3381: - Fix Version/s: 0.12.0 (was: 0.11.0) > Rebase `HoodieMergeHandle` to operate on `Hoo

[jira] [Updated] (HUDI-3353) Rebase `HoodieFileWriter` to accept `HoodieRecord`

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3353: - Fix Version/s: 0.12.0 (was: 0.11.0) > Rebase `HoodieFileWriter` to accept `HoodieRe

[jira] [Updated] (HUDI-3349) Revisit HoodieRecord API to be able to replace HoodieRecordPayload

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3349: - Fix Version/s: 0.12.0 (was: 0.11.0) > Revisit HoodieRecord API to be able to replac

[jira] [Updated] (HUDI-3351) Rebase Record combining semantic into `HoodieRecordCombiningEngine`

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3351: - Fix Version/s: 0.12.0 (was: 0.11.0) > Rebase Record combining semantic into `Hoodie

[jira] [Updated] (HUDI-3447) Fix HoodieIncr source checkpoint not progressing and add support from drop / cast columns

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3447: - Priority: Major (was: Critical) > Fix HoodieIncr source checkpoint not progressing and add support from d

[jira] [Updated] (HUDI-3350) Create Engine-specific Implementations of `HoodieRecord`

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3350: - Fix Version/s: 0.12.0 (was: 0.11.0) > Create Engine-specific Implementations of `Ho

[jira] [Updated] (HUDI-3447) Fix HoodieIncr source checkpoint not progressing and add support from drop / cast columns

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3447: - Fix Version/s: 0.12.0 (was: 0.11.0) > Fix HoodieIncr source checkpoint not progress

[jira] [Commented] (HUDI-3447) Fix HoodieIncr source checkpoint not progressing and add support from drop / cast columns

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17514252#comment-17514252 ] Raymond Xu commented on HUDI-3447: -- [~harsh1231] looks like there are 2 separate tasks in

[jira] [Updated] (HUDI-3447) Fix HoodieIncr source checkpoint not progressing and add support from drop / cast columns

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3447: - Component/s: deltastreamer > Fix HoodieIncr source checkpoint not progressing and add support from drop /

[jira] [Commented] (HUDI-3487) The global index is enabled regardless of changlog

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17514250#comment-17514250 ] Raymond Xu commented on HUDI-3487: -- [~waywtdcc] [~danny0405] do we target this for 0.11 ?

[jira] [Updated] (HUDI-3447) Fix HoodieIncr source checkpoint not progressing and add support from drop / cast columns

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3447: - Issue Type: New Feature (was: Bug) > Fix HoodieIncr source checkpoint not progressing and add support fro

[jira] [Updated] (HUDI-3487) The global index is enabled regardless of changlog

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3487: - Fix Version/s: 0.12.0 (was: 0.11.0) > The global index is enabled regardless of cha

[jira] [Updated] (HUDI-1258) Small file handling Merges can be handled without actual merging

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1258: - Fix Version/s: 0.12.0 (was: 0.11.0) > Small file handling Merges can be handled wit

[jira] [Updated] (HUDI-3512) Support Stats command based on Call Produce Command

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3512: - Fix Version/s: 0.12.0 (was: 0.11.0) > Support Stats command based on Call Produce C

[jira] [Updated] (HUDI-3518) Make HiveSchemaProvider support AWS Glue Catalog

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3518: - Fix Version/s: 0.12.0 (was: 0.11.0) > Make HiveSchemaProvider support AWS Glue Cata

[jira] [Updated] (HUDI-3518) Make HiveSchemaProvider support AWS Glue Catalog

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3518: - Issue Type: New Feature (was: Improvement) > Make HiveSchemaProvider support AWS Glue Catalog > -

[jira] [Updated] (HUDI-3523) Introduce AddColumnSchemaPostProcessor to support add columns to the end of a schema

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3523: - Issue Type: New Feature (was: Task) > Introduce AddColumnSchemaPostProcessor to support add columns to th

[jira] [Assigned] (HUDI-3551) Add OCS StorageScheme to support Oracle Cloud

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-3551: Assignee: Carter Shanklin (was: Rajesh) > Add OCS StorageScheme to support Oracle Cloud >

[jira] [Updated] (HUDI-3523) Introduce AddColumnSchemaPostProcessor to support add columns to the end of a schema

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3523: - Fix Version/s: 0.12.0 (was: 0.11.0) > Introduce AddColumnSchemaPostProcessor to sup

[jira] [Updated] (HUDI-3551) Add OCS StorageScheme to support Oracle Cloud

2022-03-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3551: - Status: In Progress (was: Open) > Add OCS StorageScheme to support Oracle Cloud > ---

<    1   2   3   4   5   6   7   >