[GitHub] [hudi] hudi-bot commented on pull request #4334: [HUDI-3011] Adding ability to read entire data with HoodieIncrSource with empty checkpoint

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4334: URL: https://github.com/apache/hudi/pull/4334#issuecomment-995354031 ## CI report: * 0d42f0084e7918d69816217c962aacc85259c9c8 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[jira] [Updated] (HUDI-3011) Add support to start incremental consumption from begin time rather than latest commit time with S3EventsHoodieIncrSource

2021-12-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3011: - Labels: core-flow-ds pull-request-available sev:normal (was: core-flow-ds sev:normal) > Add

[GitHub] [hudi] nsivabalan opened a new pull request #4334: [HUDI-3011] Adding ability to read entire data with HoodieIncrSource with empty checkpoint

2021-12-15 Thread GitBox
nsivabalan opened a new pull request #4334: URL: https://github.com/apache/hudi/pull/4334 ## What is the purpose of the pull request - HoodieIncremental source is used to read incrementally from a hudi table. When a deltastreamer is started to consume from source table and no

[GitHub] [hudi] hudi-bot commented on pull request #4333: Adding support for Parquet in MOR `LogBlock`s

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4333: URL: https://github.com/apache/hudi/pull/4333#issuecomment-995352818 ## CI report: * 7bce55782061189c539ae7acb1e9b87ade7cdf7f Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4333: Adding support for Parquet in MOR `LogBlock`s

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4333: URL: https://github.com/apache/hudi/pull/4333#issuecomment-995351561 ## CI report: * 7bce55782061189c539ae7acb1e9b87ade7cdf7f Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4333: Adding support for Parquet in MOR `LogBlock`s

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4333: URL: https://github.com/apache/hudi/pull/4333#issuecomment-995350274 ## CI report: * 7bce55782061189c539ae7acb1e9b87ade7cdf7f Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4333: Adding support for Parquet in MOR `LogBlock`s

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4333: URL: https://github.com/apache/hudi/pull/4333#issuecomment-995351561 ## CI report: * 7bce55782061189c539ae7acb1e9b87ade7cdf7f Azure:

[GitHub] [hudi] yanghua commented on issue #4229: [SUPPORT] Exception in thread "main" java.lang.IllegalArgumentException: Can't find primaryKey `uuid` in root

2021-12-15 Thread GitBox
yanghua commented on issue #4229: URL: https://github.com/apache/hudi/issues/4229#issuecomment-995350604 > Thank you all. Passing a primary is the way then. We are good to close for me. thanks, closing now. -- This is an automated message from the Apache Git Service. To respond

[GitHub] [hudi] yanghua closed issue #4229: [SUPPORT] Exception in thread "main" java.lang.IllegalArgumentException: Can't find primaryKey `uuid` in root

2021-12-15 Thread GitBox
yanghua closed issue #4229: URL: https://github.com/apache/hudi/issues/4229 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4333: Adding support for Parquet in MOR `LogBlock`s

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4333: URL: https://github.com/apache/hudi/pull/4333#issuecomment-995336374 ## CI report: * 7bce55782061189c539ae7acb1e9b87ade7cdf7f Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4333: Adding support for Parquet in MOR `LogBlock`s

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4333: URL: https://github.com/apache/hudi/pull/4333#issuecomment-995350274 ## CI report: * 7bce55782061189c539ae7acb1e9b87ade7cdf7f Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4300: [HUDI-2785] Add Trino setup in Docker Demo

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4300: URL: https://github.com/apache/hudi/pull/4300#issuecomment-995347403 ## CI report: * a31dff6167f0cebd6ac8b7c2f8815234d9fec344 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4300: [HUDI-2785] Add Trino setup in Docker Demo

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4300: URL: https://github.com/apache/hudi/pull/4300#issuecomment-995348836 ## CI report: * a31dff6167f0cebd6ac8b7c2f8815234d9fec344 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4300: [HUDI-2785] Add Trino setup in Docker Demo

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4300: URL: https://github.com/apache/hudi/pull/4300#issuecomment-993035697 ## CI report: * a31dff6167f0cebd6ac8b7c2f8815234d9fec344 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4300: [HUDI-2785] Add Trino setup in Docker Demo

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4300: URL: https://github.com/apache/hudi/pull/4300#issuecomment-995347403 ## CI report: * a31dff6167f0cebd6ac8b7c2f8815234d9fec344 Azure:

[hudi] branch asf-site updated: [DOCS] Fix the "Edit this page" config and add 6 cn docs. (#3859)

2021-12-15 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new 914c47a [DOCS] Fix the "Edit this page"

[GitHub] [hudi] yihua merged pull request #3859: [DOCS] Fix the "Edit this page" config and add 6 cn docs.

2021-12-15 Thread GitBox
yihua merged pull request #3859: URL: https://github.com/apache/hudi/pull/3859 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] yihua commented on pull request #3859: [DOCS] Fix the "Edit this page" config and add 6 cn docs.

2021-12-15 Thread GitBox
yihua commented on pull request #3859: URL: https://github.com/apache/hudi/pull/3859#issuecomment-995344882 > > @laurieliyang Could you check the failed build? Also, it looks like the commits you pushed don't include the changes resolving my comments. > > Hello, I have fixed the

[GitHub] [hudi] hudi-bot commented on pull request #4333: Adding support for Parquet in MOR `LogBlock`s

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4333: URL: https://github.com/apache/hudi/pull/4333#issuecomment-995336374 ## CI report: * 7bce55782061189c539ae7acb1e9b87ade7cdf7f Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4333: Adding support for Parquet in MOR `LogBlock`s

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4333: URL: https://github.com/apache/hudi/pull/4333#issuecomment-995334814 ## CI report: * 7bce55782061189c539ae7acb1e9b87ade7cdf7f UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run

[GitHub] [hudi] hudi-bot commented on pull request #4333: Adding support for Parquet in MOR `LogBlock`s

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4333: URL: https://github.com/apache/hudi/pull/4333#issuecomment-995334814 ## CI report: * 7bce55782061189c539ae7acb1e9b87ade7cdf7f UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[GitHub] [hudi] alexeykudinkin opened a new pull request #4333: Adding support for Parquet in MOR `LogBlock`s

2021-12-15 Thread GitBox
alexeykudinkin opened a new pull request #4333: URL: https://github.com/apache/hudi/pull/4333 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the

[GitHub] [hudi] yihua commented on a change in pull request #4078: [HUDI-2833] Clean up unused archive files instead of expanding indefinitely.

2021-12-15 Thread GitBox
yihua commented on a change in pull request #4078: URL: https://github.com/apache/hudi/pull/4078#discussion_r77013 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieCompactionConfig.java ## @@ -249,6 +249,19 @@ + "record

[GitHub] [hudi] yihua commented on pull request #4078: [HUDI-2833] Clean up unused archive files instead of expanding indefinitely.

2021-12-15 Thread GitBox
yihua commented on pull request #4078: URL: https://github.com/apache/hudi/pull/4078#issuecomment-995316339 As discussed offline, we should warn users to avoid the config if they don't understand the mechanism. They should only use it if they know what they are doing. We can follow up

[GitHub] [hudi] hudi-bot commented on pull request #4293: [HUDI-2981][HUDI-2982] Metadata table - enabling virtual keys and key deduplication by default

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4293: URL: https://github.com/apache/hudi/pull/4293#issuecomment-995314435 ## CI report: * 570a5116cdffd2a0aeac94051c6a8e00bc5a7363 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4293: [HUDI-2981][HUDI-2982] Metadata table - enabling virtual keys and key deduplication by default

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4293: URL: https://github.com/apache/hudi/pull/4293#issuecomment-995284995 ## CI report: * 2605eeb89e99f7c80621715c5d980661f6034fdc Azure:

[GitHub] [hudi] yihua commented on a change in pull request #4268: [2970] Adding tests for archival of replace commit actions

2021-12-15 Thread GitBox
yihua commented on a change in pull request #4268: URL: https://github.com/apache/hudi/pull/4268#discussion_r770127985 ## File path: hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/hudi/functional/TestCOWDataSourceStorage.scala ## @@ -170,4 +175,73 @@ class

[GitHub] [hudi] manojpec commented on pull request #4259: [HUDI-2962] Local process lock provider to guard single writer process with async table operations

2021-12-15 Thread GitBox
manojpec commented on pull request #4259: URL: https://github.com/apache/hudi/pull/4259#issuecomment-995311560 @vinothchandar 1. Yes, this is release notes worthy and need to be documented 2. Yes, benefits all CI tests having lock providers currently configured and that are all

[hudi] branch master updated: [HUDI-2998] claiming rfc number for consistent hashing index (#4303)

2021-12-15 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new f5b07a7 [HUDI-2998] claiming rfc number for

[GitHub] [hudi] yihua merged pull request #4303: [HUDI-2998] Claiming RFC number for consistent hashing index

2021-12-15 Thread GitBox
yihua merged pull request #4303: URL: https://github.com/apache/hudi/pull/4303 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] yihua commented on pull request #4303: [HUDI-2998] Claiming RFC number for consistent hashing index

2021-12-15 Thread GitBox
yihua commented on pull request #4303: URL: https://github.com/apache/hudi/pull/4303#issuecomment-995304863 Merging this as this only touches `rfc` folder. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[hudi] branch master updated: [HUDI-3028] Use blob storage to speed up CI downloads (#4331)

2021-12-15 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 27907de [HUDI-3028] Use blob storage to speed up

[GitHub] [hudi] yihua merged pull request #4331: [HUDI-3028] Use blob storage to speed up CI downloads

2021-12-15 Thread GitBox
yihua merged pull request #4331: URL: https://github.com/apache/hudi/pull/4331 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] yihua commented on pull request #4331: [HUDI-3028] Use blob storage to speed up CI downloads

2021-12-15 Thread GitBox
yihua commented on pull request #4331: URL: https://github.com/apache/hudi/pull/4331#issuecomment-995298431 IT passes so I'm going to merge this. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[jira] [Updated] (HUDI-3030) Make LocalLockProvider as the default when async table services are turned on

2021-12-15 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Govindassamy updated HUDI-3030: - Story Points: 6 (was: 8) > Make LocalLockProvider as the default when async table

[jira] [Updated] (HUDI-2962) Support JVM based local process lock provider implementation

2021-12-15 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Govindassamy updated HUDI-2962: - Story Points: 8 > Support JVM based local process lock provider implementation >

[GitHub] [hudi] hudi-bot commented on pull request #4293: [HUDI-2981][HUDI-2982] Metadata table - enabling virtual keys and key deduplication by default

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4293: URL: https://github.com/apache/hudi/pull/4293#issuecomment-995284995 ## CI report: * 2605eeb89e99f7c80621715c5d980661f6034fdc Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4293: [HUDI-2981][HUDI-2982] Metadata table - enabling virtual keys and key deduplication by default

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4293: URL: https://github.com/apache/hudi/pull/4293#issuecomment-995283319 ## CI report: * 2605eeb89e99f7c80621715c5d980661f6034fdc Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4331: [HUDI-3028] Use blob storage to speed up CI downloads

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4331: URL: https://github.com/apache/hudi/pull/4331#issuecomment-995250330 ## CI report: * ff7fbef169ff9529b1ba6f27220ac0220c8bacbe Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4293: [HUDI-2981][HUDI-2982] Metadata table - enabling virtual keys and key deduplication by default

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4293: URL: https://github.com/apache/hudi/pull/4293#issuecomment-994141230 ## CI report: * 2605eeb89e99f7c80621715c5d980661f6034fdc Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4331: [HUDI-3028] Use blob storage to speed up CI downloads

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4331: URL: https://github.com/apache/hudi/pull/4331#issuecomment-995283403 ## CI report: * 202e3525f5fcf564ea511199ae15fd3c263a0086 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4293: [HUDI-2981][HUDI-2982] Metadata table - enabling virtual keys and key deduplication by default

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4293: URL: https://github.com/apache/hudi/pull/4293#issuecomment-995283319 ## CI report: * 2605eeb89e99f7c80621715c5d980661f6034fdc Azure:

[jira] [Commented] (HUDI-3031) TestHoodieDeltaStreamerWithMultiWriter time out due to async services and writer deadlock

2021-12-15 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17460314#comment-17460314 ] Manoj Govindassamy commented on HUDI-3031: --   {code:java}

[jira] [Commented] (HUDI-3031) TestHoodieDeltaStreamerWithMultiWriter time out due to async services and writer deadlock

2021-12-15 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17460313#comment-17460313 ] Manoj Govindassamy commented on HUDI-3031: -- Ref: https://issues.apache.org/jira/browse/HUDI-3029

[jira] [Created] (HUDI-3031) TestHoodieDeltaStreamerWithMultiWriter time out due to async services and writer deadlock

2021-12-15 Thread Manoj Govindassamy (Jira)
Manoj Govindassamy created HUDI-3031: Summary: TestHoodieDeltaStreamerWithMultiWriter time out due to async services and writer deadlock Key: HUDI-3031 URL: https://issues.apache.org/jira/browse/HUDI-3031

[jira] (HUDI-3011) Add support to start incremental consumption from begin time rather than latest commit time with S3EventsHoodieIncrSource

2021-12-15 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3011 ] sivabalan narayanan deleted comment on HUDI-3011: --- was (Author: shivnarayan): Actually this is not an issue. Even reading with latest should fetch all the data. We don't ever delete the

[jira] [Updated] (HUDI-1850) Read on table fails if the first write to table failed

2021-12-15 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1850: -- Labels: core-flow-ds pull-request-available release-blocker sev:high spark (was:

[jira] [Assigned] (HUDI-1850) Read on table fails if the first write to table failed

2021-12-15 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-1850: - Assignee: sivabalan narayanan > Read on table fails if the first write to table

[jira] [Updated] (HUDI-2983) Remove all Log4j2 transitive dependencies

2021-12-15 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2983: -- Labels: pull-request-available sev:critical (was: pull-request-available sev:high) >

[jira] [Updated] (HUDI-2958) Automatically set spark.sql.parquet.writelegacyformat; When using bulkinsert to insert data which contains decimal Type.

2021-12-15 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2958: -- Status: In Progress (was: Open) > Automatically set

[jira] [Updated] (HUDI-2958) Automatically set spark.sql.parquet.writelegacyformat; When using bulkinsert to insert data which contains decimal Type.

2021-12-15 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2958: -- Status: Patch Available (was: In Progress) > Automatically set

[GitHub] [hudi] nsivabalan closed pull request #4332: [WIP] Fixing synchronization in TransactionManager

2021-12-15 Thread GitBox
nsivabalan closed pull request #4332: URL: https://github.com/apache/hudi/pull/4332 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] hudi-bot commented on pull request #4332: [WIP] Fixing synchronization in TransactionManager

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4332: URL: https://github.com/apache/hudi/pull/4332#issuecomment-995256932 ## CI report: * 610850be5265d7a0a070dde9a92f943cd17e81ca UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[GitHub] [hudi] nsivabalan opened a new pull request #4332: [WIP] Fixing synchronization in TransactionManager

2021-12-15 Thread GitBox
nsivabalan opened a new pull request #4332: URL: https://github.com/apache/hudi/pull/4332 ## What is the purpose of the pull request There could be a potential deadlock with the way methods in Transaction Manager is synchronized. Prior to this patch, all begin and end transaction

[GitHub] [hudi] hudi-bot commented on pull request #4331: [HUDI-3028] Use blob storage to speed up CI downloads

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4331: URL: https://github.com/apache/hudi/pull/4331#issuecomment-995250330 ## CI report: * ff7fbef169ff9529b1ba6f27220ac0220c8bacbe Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4331: [HUDI-3028] Use blob storage to speed up CI downloads

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4331: URL: https://github.com/apache/hudi/pull/4331#issuecomment-995248591 ## CI report: * ff7fbef169ff9529b1ba6f27220ac0220c8bacbe Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4331: [HUDI-3028] Use blob storage to speed up CI downloads

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4331: URL: https://github.com/apache/hudi/pull/4331#issuecomment-995228560 ## CI report: * ff7fbef169ff9529b1ba6f27220ac0220c8bacbe Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4331: [HUDI-3028] Use blob storage to speed up CI downloads

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4331: URL: https://github.com/apache/hudi/pull/4331#issuecomment-995248591 ## CI report: * ff7fbef169ff9529b1ba6f27220ac0220c8bacbe Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4331: [HUDI-3028] Use blob storage to speed up CI downloads

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4331: URL: https://github.com/apache/hudi/pull/4331#issuecomment-995228560 ## CI report: * ff7fbef169ff9529b1ba6f27220ac0220c8bacbe Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4331: [HUDI-3028] Use blob storage to speed up CI downloads

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4331: URL: https://github.com/apache/hudi/pull/4331#issuecomment-995204075 ## CI report: * ff7fbef169ff9529b1ba6f27220ac0220c8bacbe Azure:

[GitHub] [hudi] rubenssoto commented on pull request #4214: [HUDI-2928] Switching default Parquet's column encoding to zstd

2021-12-15 Thread GitBox
rubenssoto commented on pull request #4214: URL: https://github.com/apache/hudi/pull/4214#issuecomment-995226179 @alexeykudinkin but is it possible to use zstd in Hudi today, changing a config? -- This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [hudi] hudi-bot commented on pull request #4331: [HUDI-3028] Use blob storage to speed up CI downloads

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4331: URL: https://github.com/apache/hudi/pull/4331#issuecomment-995204075 ## CI report: * ff7fbef169ff9529b1ba6f27220ac0220c8bacbe Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4331: [HUDI-3028] Use blob storage to speed up CI downloads

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4331: URL: https://github.com/apache/hudi/pull/4331#issuecomment-995202239 ## CI report: * ff7fbef169ff9529b1ba6f27220ac0220c8bacbe UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run

[jira] [Updated] (HUDI-3028) Spark binary download sometimes takes a long time in Azure CI IT tests

2021-12-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3028: - Labels: pull-request-available (was: ) > Spark binary download sometimes takes a long time in

[GitHub] [hudi] hudi-bot commented on pull request #4331: [HUDI-3028] Use blob storage to speed up CI downloads

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4331: URL: https://github.com/apache/hudi/pull/4331#issuecomment-995202239 ## CI report: * ff7fbef169ff9529b1ba6f27220ac0220c8bacbe UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[jira] [Assigned] (HUDI-3028) Spark binary download sometimes takes a long time in Azure CI IT tests

2021-12-15 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-3028: Assignee: Raymond Xu > Spark binary download sometimes takes a long time in Azure CI IT tests >

[jira] [Created] (HUDI-3030) Make LocalLockProvider as the default when async table services are turned on

2021-12-15 Thread Manoj Govindassamy (Jira)
Manoj Govindassamy created HUDI-3030: Summary: Make LocalLockProvider as the default when async table services are turned on Key: HUDI-3030 URL: https://issues.apache.org/jira/browse/HUDI-3030

[GitHub] [hudi] hudi-bot removed a comment on pull request #4308: [HUDI-3008] Fixes bug in hoodieDeltaStreamer for nested partition lookup

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4308: URL: https://github.com/apache/hudi/pull/4308#issuecomment-995117146 ## CI report: * 7b565b9c55c651602636639115d67ad54ed1a369 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4308: [HUDI-3008] Fixes bug in hoodieDeltaStreamer for nested partition lookup

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4308: URL: https://github.com/apache/hudi/pull/4308#issuecomment-995182346 ## CI report: * 1be190032b1b6da5c07fc81bb2fdd42903a48185 Azure:

[jira] [Created] (HUDI-3029) TransactionManager synchronized begin/endTransaction() leading to deadlock

2021-12-15 Thread Manoj Govindassamy (Jira)
Manoj Govindassamy created HUDI-3029: Summary: TransactionManager synchronized begin/endTransaction() leading to deadlock Key: HUDI-3029 URL: https://issues.apache.org/jira/browse/HUDI-3029

[jira] [Updated] (HUDI-2780) Mor reads the log file and skips the complete block as a bad block, resulting in data loss

2021-12-15 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2780: -- Status: Patch Available (was: In Progress) > Mor reads the log file and skips the

[jira] [Updated] (HUDI-2270) Remove corrupted clean action

2021-12-15 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2270: -- Status: In Progress (was: Open) > Remove corrupted clean action >

[jira] [Updated] (HUDI-2675) Not an Avro data file

2021-12-15 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2675: -- Status: In Progress (was: Open) > Not an Avro data file > - > >

[jira] [Updated] (HUDI-2780) Mor reads the log file and skips the complete block as a bad block, resulting in data loss

2021-12-15 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2780: -- Status: In Progress (was: Open) > Mor reads the log file and skips the complete block

[jira] [Updated] (HUDI-2675) Not an Avro data file

2021-12-15 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2675: -- Status: Patch Available (was: In Progress) > Not an Avro data file >

[jira] [Updated] (HUDI-2270) Remove corrupted clean action

2021-12-15 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2270: -- Status: Patch Available (was: In Progress) > Remove corrupted clean action >

[hudi] branch asf-site updated: [HUDI-2543]: Added guides section (#3776)

2021-12-15 Thread sivabalan
This is an automated email from the ASF dual-hosted git repository. sivabalan pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new af6d23d [HUDI-2543]: Added guides section

[GitHub] [hudi] nsivabalan merged pull request #3776: [HUDI-2543]: Added guides section

2021-12-15 Thread GitBox
nsivabalan merged pull request #3776: URL: https://github.com/apache/hudi/pull/3776 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] nsivabalan commented on pull request #4203: [HUDI-2909] Handle logical type in TimestampBasedKeyGenerator

2021-12-15 Thread GitBox
nsivabalan commented on pull request #4203: URL: https://github.com/apache/hudi/pull/4203#issuecomment-995170388 @codope : is this good to review. If there are any pending work, I can take it up as I am focusing on all issues and jiras. let me know. -- This is an automated message from

[GitHub] [hudi] nsivabalan commented on a change in pull request #4253: [HUDI-2958] Automatically set spark.sql.parquet.writelegacyformat, when using bulkinsert to insert data which contains decimalTy

2021-12-15 Thread GitBox
nsivabalan commented on a change in pull request #4253: URL: https://github.com/apache/hudi/pull/4253#discussion_r769998324 ## File path: hudi-spark-datasource/hudi-spark-common/src/main/java/org/apache/hudi/DataSourceUtils.java ## @@ -309,4 +311,15 @@ public static

[GitHub] [hudi] nsivabalan closed pull request #4312: [DO NOT MERGE][MINOR] Increase timeouts for IT tests

2021-12-15 Thread GitBox
nsivabalan closed pull request #4312: URL: https://github.com/apache/hudi/pull/4312 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] nsivabalan closed pull request #4314: [DO NOT MERGE][MINOR] Remove HDFS safemode wait in ITTestHoodieDemo

2021-12-15 Thread GitBox
nsivabalan closed pull request #4314: URL: https://github.com/apache/hudi/pull/4314 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] nsivabalan closed pull request #4313: [DO NOT MERGE][MINOR] Disable ITTestHoodieSanity

2021-12-15 Thread GitBox
nsivabalan closed pull request #4313: URL: https://github.com/apache/hudi/pull/4313 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] nsivabalan closed pull request #4315: [DO NOT MERGE][MINOR] Add first few lines of setupDemo() to ITHoodieSanity

2021-12-15 Thread GitBox
nsivabalan closed pull request #4315: URL: https://github.com/apache/hudi/pull/4315 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] nsivabalan closed pull request #4321: [WIP][DO_NOT_MERGE] Adding a delay to integ tests just after init

2021-12-15 Thread GitBox
nsivabalan closed pull request #4321: URL: https://github.com/apache/hudi/pull/4321 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] nsivabalan closed pull request #4322: [WIP][DO_NOT_MERGE] Increasing timer for docker compose health checks

2021-12-15 Thread GitBox
nsivabalan closed pull request #4322: URL: https://github.com/apache/hudi/pull/4322 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] nsivabalan closed pull request #4325: [WIP][DO_NOT_MERGE] Adding health check dependency on namenode for history server in docker compose

2021-12-15 Thread GitBox
nsivabalan closed pull request #4325: URL: https://github.com/apache/hudi/pull/4325 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4308: [HUDI-3008] Fixes bug in hoodieDeltaStreamer for nested partition lookup

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4308: URL: https://github.com/apache/hudi/pull/4308#issuecomment-995114144 ## CI report: * 7b565b9c55c651602636639115d67ad54ed1a369 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4308: [HUDI-3008] Fixes bug in hoodieDeltaStreamer for nested partition lookup

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4308: URL: https://github.com/apache/hudi/pull/4308#issuecomment-995117146 ## CI report: * 7b565b9c55c651602636639115d67ad54ed1a369 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4308: [HUDI-3008] Fixes bug in hoodieDeltaStreamer for nested partition lookup

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4308: URL: https://github.com/apache/hudi/pull/4308#issuecomment-995114144 ## CI report: * 7b565b9c55c651602636639115d67ad54ed1a369 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4308: [HUDI-3008] Fixes bug in hoodieDeltaStreamer for nested partition lookup

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4308: URL: https://github.com/apache/hudi/pull/4308#issuecomment-994957028 ## CI report: * 7b565b9c55c651602636639115d67ad54ed1a369 Azure:

[jira] [Resolved] (HUDI-3025) Integ tests are failing in azure CI with namenode going to safe mode

2021-12-15 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-3025. --- > Integ tests are failing in azure CI with namenode going to safe mode >

[GitHub] [hudi] hudi-bot commented on pull request #4330: [HUDI-3027] Update hudi-examples README.md

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4330: URL: https://github.com/apache/hudi/pull/4330#issuecomment-995110159 ## CI report: * dd2a94bb59ed2e8b1bdc2b95d19731a26c812445 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4330: [HUDI-3027] Update hudi-examples README.md

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4330: URL: https://github.com/apache/hudi/pull/4330#issuecomment-995044126 ## CI report: * dd2a94bb59ed2e8b1bdc2b95d19731a26c812445 Azure:

[hudi] branch master updated (9a2030a -> 3b89457)

2021-12-15 Thread sivabalan
This is an automated email from the ASF dual-hosted git repository. sivabalan pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from 9a2030a [HUDI-3024] Add explicit write handler for flink (#4329) add 3b89457 [HUDI-3025] Add additional wait

[GitHub] [hudi] nsivabalan merged pull request #4328: [HUDI-3025] Add additional wait time for namenode availability during IT tests initiatialization

2021-12-15 Thread GitBox
nsivabalan merged pull request #4328: URL: https://github.com/apache/hudi/pull/4328 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4328: [HUDI-3025] Add additional wait time for namenode availability during IT tests initiatialization

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4328: URL: https://github.com/apache/hudi/pull/4328#issuecomment-994997061 ## CI report: * dd7af7f264b7edf07934898bd468a877a350f611 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4330: [HUDI-3027] Update hudi-examples README.md

2021-12-15 Thread GitBox
hudi-bot removed a comment on pull request #4330: URL: https://github.com/apache/hudi/pull/4330#issuecomment-995041827 ## CI report: * dd2a94bb59ed2e8b1bdc2b95d19731a26c812445 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run

[GitHub] [hudi] hudi-bot commented on pull request #4330: [HUDI-3027] Update hudi-examples README.md

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4330: URL: https://github.com/apache/hudi/pull/4330#issuecomment-995044126 ## CI report: * dd2a94bb59ed2e8b1bdc2b95d19731a26c812445 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4328: [HUDI-3025] Add additional wait time for namenode availability during IT tests initiatialization

2021-12-15 Thread GitBox
hudi-bot commented on pull request #4328: URL: https://github.com/apache/hudi/pull/4328#issuecomment-995044091 ## CI report: * 735771bb210fc0ac795074b12d520ceef133ead0 Azure:

<    1   2   3   4   5   >