[GitHub] [hudi] hudi-bot commented on pull request #6516: [HUDI-4729] Fix fq can not be queried in pending compaction when query ro table with spark

2022-09-19 Thread GitBox
hudi-bot commented on PR #6516: URL: https://github.com/apache/hudi/pull/6516#issuecomment-1250906873 ## CI report: * 52bcb16f2611c4476ede864cadd79a32fb6e94fa Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1141

[GitHub] [hudi] hudi-bot commented on pull request #6717: [HUDI-4877] Fix org.apache.hudi.index.bucket.TestHoodieSimpleBucketIndex#testTagLocation not work correct issue

2022-09-19 Thread GitBox
hudi-bot commented on PR #6717: URL: https://github.com/apache/hudi/pull/6717#issuecomment-1250897552 ## CI report: * 668b55c8b04a755f1a7b3d135bc8acea8c04e844 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1150

[GitHub] [hudi] hudi-bot commented on pull request #6714: Bump gson from 2.6.2 to 2.8.9 in /hudi-cli

2022-09-19 Thread GitBox
hudi-bot commented on PR #6714: URL: https://github.com/apache/hudi/pull/6714#issuecomment-1250897497 ## CI report: * fbe99e06d4a077fb6621a14345ba595bdf4f0421 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1149

[GitHub] [hudi] hudi-bot commented on pull request #6717: [HUDI-4877] Fix org.apache.hudi.index.bucket.TestHoodieSimpleBucketIndex#testTagLocation not work correct issue

2022-09-19 Thread GitBox
hudi-bot commented on PR #6717: URL: https://github.com/apache/hudi/pull/6717#issuecomment-1250892543 ## CI report: * 668b55c8b04a755f1a7b3d135bc8acea8c04e844 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #5920: [HUDI-4326] add updateTableSerDeInfo for HiveSyncTool

2022-09-19 Thread GitBox
hudi-bot commented on PR #5920: URL: https://github.com/apache/hudi/pull/5920#issuecomment-1250891247 ## CI report: * ae06beaf934497c0ecfe70bf285440ce1bc264e3 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1047

[GitHub] [hudi] eric9204 commented on pull request #6710: [HUDI-4810] Fix log4j imports to use bridge API

2022-09-19 Thread GitBox
eric9204 commented on PR #6710: URL: https://github.com/apache/hudi/pull/6710#issuecomment-1250890416 @xushiyan In the following four files, Log4j 2.x API is also used. I think it is unnecessary to change these files, because the lower version has no corresponding API. any suggestion?

[GitHub] [hudi] hudi-bot commented on pull request #5920: [HUDI-4326] add updateTableSerDeInfo for HiveSyncTool

2022-09-19 Thread GitBox
hudi-bot commented on PR #5920: URL: https://github.com/apache/hudi/pull/5920#issuecomment-1250886045 ## CI report: * ae06beaf934497c0ecfe70bf285440ce1bc264e3 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1047

[GitHub] [hudi] loukey-lj commented on a diff in pull request #6704: [HUDI-4869] Fix test for HUDI-4780

2022-09-19 Thread GitBox
loukey-lj commented on code in PR #6704: URL: https://github.com/apache/hudi/pull/6704#discussion_r974129124 ## hudi-common/src/test/java/org/apache/hudi/common/functional/TestHoodieLogFormat.java: ## @@ -124,12 +125,13 @@ public static void tearDownClass() { @BeforeEach

[jira] [Closed] (HUDI-4832) Hive Sync can potentially drop all partitions

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-4832. Resolution: Fixed > Hive Sync can potentially drop all partitions > ---

[hudi] branch master updated (1b2792cefa -> 8962946996)

2022-09-19 Thread xushiyan
This is an automated email from the ASF dual-hosted git repository. xushiyan pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from 1b2792cefa [minor] following 3304, some code refactoring (#6713) add 8962946996 [HUDI-4832] Fix drop partition me

[GitHub] [hudi] xushiyan closed issue #6277: [SUPPORT] HiveSyncTool: missing partitions

2022-09-19 Thread GitBox
xushiyan closed issue #6277: [SUPPORT] HiveSyncTool: missing partitions URL: https://github.com/apache/hudi/issues/6277 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsub

[GitHub] [hudi] xushiyan merged pull request #6662: [HUDI-4832] Fix drop partition meta sync

2022-09-19 Thread GitBox
xushiyan merged PR #6662: URL: https://github.com/apache/hudi/pull/6662 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.

[jira] [Updated] (HUDI-3407) Make sure Restore operation is Not Concurrent w/ Writes in Multi-Writer scenario

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3407: - Sprint: (was: 2022/09/19) > Make sure Restore operation is Not Concurrent w/ Writes in Multi-Writer > s

[jira] [Updated] (HUDI-4631) Enhance retries for failed writes w/ write conflicts in a multi writer scenarios

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4631: - Sprint: (was: 2022/09/19) > Enhance retries for failed writes w/ write conflicts in a multi writer > sc

[jira] [Updated] (HUDI-4137) Implement SnowflakeSyncTool to support Hudi to Snowflake Integration

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4137: - Sprint: (was: 2022/09/19) > Implement SnowflakeSyncTool to support Hudi to Snowflake Integration > -

[jira] [Updated] (HUDI-1885) Support Delete/Update Non-Pk Table

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1885: - Sprint: (was: 2022/09/19) > Support Delete/Update Non-Pk Table > -- > >

[jira] [Updated] (HUDI-4254) Refactor SnowflakeSyncTool and BigQuerySyncTool

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4254: - Sprint: (was: 2022/09/19) > Refactor SnowflakeSyncTool and BigQuerySyncTool > --

[jira] [Updated] (HUDI-3019) Upserts with Dataype promotion only to a subset of partition fails

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3019: - Sprint: (was: 2022/09/19) > Upserts with Dataype promotion only to a subset of partition fails > ---

[jira] [Updated] (HUDI-4112) Relax constraint in metadata table that rollback of a commit that got archived in MDT throws exception

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4112: - Sprint: (was: 2022/09/19) > Relax constraint in metadata table that rollback of a commit that got > arc

[jira] [Updated] (HUDI-3617) MOR compact improve

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3617: - Sprint: (was: 2022/09/19) > MOR compact improve > --- > > Key: HUDI-3617

[jira] [Updated] (HUDI-3786) how to deduce what MDT partitions to update on the write path w/ async indeing

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3786: - Sprint: (was: 2022/09/19) > how to deduce what MDT partitions to update on the write path w/ async indei

[jira] [Updated] (HUDI-3216) Support timestamp with microseconds precision

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3216: - Sprint: (was: 2022/09/19) > Support timestamp with microseconds precision >

[GitHub] [hudi] TJX2014 commented on pull request #6717: [HUDI-4877] Fix org.apache.hudi.index.bucket.TestHoodieSimpleBucketIndex#testTagLocation not work correct issue

2022-09-19 Thread GitBox
TJX2014 commented on PR #6717: URL: https://github.com/apache/hudi/pull/6717#issuecomment-1250861727 Hi @danny0405 , sorry for this ping due to the test code seems not failed when org.apache.hudi.index.bucket.HoodieSimpleBucketIndex#loadPartitionBucketIdFileIdMapping is not corrected. --

[jira] [Updated] (HUDI-3467) Check shutdown logic with async compaction in Spark Structured Streaming

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3467: - Sprint: (was: 2022/09/19) > Check shutdown logic with async compaction in Spark Structured Streaming > -

[GitHub] [hudi] TJX2014 commented on pull request #6717: [HUDI-4877] Fix org.apache.hudi.index.bucket.TestHoodieSimpleBucketIndex#testTagLocation not work correct issue

2022-09-19 Thread GitBox
TJX2014 commented on PR #6717: URL: https://github.com/apache/hudi/pull/6717#issuecomment-1250860964 Hi @danny0405 , sorry for this ping due to the test code seems not failed when org.apache.hudi.index.bucket.HoodieSimpleBucketIndex#loadPartitionBucketIdFileIdMapping is not corrected. --

[jira] [Updated] (HUDI-3818) hudi doesn't support bytes column as primary key

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3818: - Sprint: (was: 2022/09/19) > hudi doesn't support bytes column as primary key > -

[jira] [Updated] (HUDI-4877) org.apache.hudi.index.bucket.TestHoodieSimpleBucketIndex#testTagLocation not work correct

2022-09-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4877: - Labels: pull-request-available (was: ) > org.apache.hudi.index.bucket.TestHoodieSimpleBucketIndex

[GitHub] [hudi] TJX2014 opened a new pull request, #6717: [HUDI-4877] Fix org.apache.hudi.index.bucket.TestHoodieSimpleBucketIndex#testTagLocation not work correct issue

2022-09-19 Thread GitBox
TJX2014 opened a new pull request, #6717: URL: https://github.com/apache/hudi/pull/6717 ### Change Logs Make org.apache.hudi.index.bucket.TestHoodieSimpleBucketIndex#testTagLocation work correct ### Impact Correct test case. **Risk level: none | low | medium | high**

[jira] [Updated] (HUDI-4497) Vet all critical code paths for double-checked locking

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4497: - Sprint: (was: 2022/09/19) > Vet all critical code paths for double-checked locking > ---

[jira] [Assigned] (HUDI-4777) Flink gen bucket index of mor table not consistent with spark lead to duplicate bucket issue

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-4777: Assignee: JinxinTang > Flink gen bucket index of mor table not consistent with spark lead to > dup

[jira] [Updated] (HUDI-4777) Flink gen bucket index of mor table not consistent with spark lead to duplicate bucket issue

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4777: - Priority: Critical (was: Major) > Flink gen bucket index of mor table not consistent with spark lead to

[jira] [Updated] (HUDI-3717) Avoid double-listing w/in BaseHoodieTableFileIndex

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3717: - Sprint: (was: 2022/09/19) > Avoid double-listing w/in BaseHoodieTableFileIndex > ---

[jira] [Updated] (HUDI-4860) Presto/Trino Cannot parse partition value '\N' of type 'integer' for partition column

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4860: - Story Points: 3 > Presto/Trino Cannot parse partition value '\N' of type 'integer' for > partition column

[jira] [Updated] (HUDI-4860) Presto/Trino Cannot parse partition value '\N' of type 'integer' for partition column

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4860: - Priority: Blocker (was: Major) > Presto/Trino Cannot parse partition value '\N' of type 'integer' for >

[jira] [Updated] (HUDI-4860) Presto/Trino Cannot parse partition value '\N' of type 'integer' for partition column

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4860: - Fix Version/s: 0.12.1 > Presto/Trino Cannot parse partition value '\N' of type 'integer' for > partition

[jira] [Created] (HUDI-4877) org.apache.hudi.index.bucket.TestHoodieSimpleBucketIndex#testTagLocation not work correct

2022-09-19 Thread JinxinTang (Jira)
JinxinTang created HUDI-4877: Summary: org.apache.hudi.index.bucket.TestHoodieSimpleBucketIndex#testTagLocation not work correct Key: HUDI-4877 URL: https://issues.apache.org/jira/browse/HUDI-4877 Projec

[jira] [Updated] (HUDI-4860) Presto/Trino Cannot parse partition value '\N' of type 'integer' for partition column

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4860: - Labels: hudi-on-call (was: ) > Presto/Trino Cannot parse partition value '\N' of type 'integer' for > pa

[jira] [Updated] (HUDI-4281) Using hudi to build a large number of tables in spark on hive causes OOM

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4281: - Story Points: 1 > Using hudi to build a large number of tables in spark on hive causes OOM > -

[jira] [Updated] (HUDI-3892) Add HoodieReadClient with java

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3892: - Sprint: (was: 2022/09/19) > Add HoodieReadClient with java > -- > >

[jira] [Updated] (HUDI-4281) Using hudi to build a large number of tables in spark on hive causes OOM

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4281: - Story Points: 0.5 (was: 1) > Using hudi to build a large number of tables in spark on hive causes OOM > -

[jira] [Updated] (HUDI-3055) Make sure that Compression Codec configuration is respected across the board

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3055: - Sprint: (was: 2022/09/19) > Make sure that Compression Codec configuration is respected across the board

[jira] [Updated] (HUDI-4457) Make sure IT docker test return code non-zero when failed

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4457: - Sprint: (was: 2022/09/19) > Make sure IT docker test return code non-zero when failed >

[jira] [Updated] (HUDI-4716) Avoid bundle parquet in hadoop-mr

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4716: - Sprint: (was: 2022/09/19) > Avoid bundle parquet in hadoop-mr > - > >

[jira] [Updated] (HUDI-3881) Implement index syntax for spark sql

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3881: - Sprint: (was: 2022/09/19) > Implement index syntax for spark sql >

[jira] [Updated] (HUDI-3649) Add HoodieTableConfig defaults to HoodieWriteConfig

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3649: - Sprint: (was: 2022/09/19) > Add HoodieTableConfig defaults to HoodieWriteConfig > --

[jira] [Updated] (HUDI-4021) Support deferring compaction when there is an inflight delta commit

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4021: - Sprint: (was: 2022/09/19) > Support deferring compaction when there is an inflight delta commit > --

[jira] [Updated] (HUDI-4467) Port borrowed code from Spark 3.3

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4467: - Sprint: (was: 2022/09/19) > Port borrowed code from Spark 3.3 > - > >

[jira] [Updated] (HUDI-4789) Convert FileSystem usage in hudi connector to use TrinoFileSystem interface

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4789: - Sprint: (was: 2022/09/19) > Convert FileSystem usage in hudi connector to use TrinoFileSystem interface

[jira] [Updated] (HUDI-1593) Add support for "show restore" in hudi-cli

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1593: - Sprint: (was: 2022/09/19) > Add support for "show restore" in hudi-cli > ---

[jira] [Updated] (HUDI-2580) Ability to clean up dangling data files using hudi-cli

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2580: - Sprint: (was: 2022/09/19) > Ability to clean up dangling data files using hudi-cli > ---

[jira] [Updated] (HUDI-4778) Upgrade to 0.12.1 Hudi version

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4778: - Sprint: (was: 2022/09/19) > Upgrade to 0.12.1 Hudi version > -- > >

[jira] [Updated] (HUDI-3819) upgrade spring cve-2022-22965

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3819: - Sprint: (was: 2022/09/19) > upgrade spring cve-2022-22965 > - > >

[jira] [Updated] (HUDI-4666) Investigate Hudi CLI out of box support

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4666: - Sprint: (was: 2022/09/19) > Investigate Hudi CLI out of box support > -

[jira] [Updated] (HUDI-4689) Add documentation for all CLI commands

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4689: - Sprint: (was: 2022/09/19) > Add documentation for all CLI commands > ---

[jira] [Updated] (HUDI-1413) Need binary release of Hudi to distribute tools like hudi-cli.sh and hudi-sync

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1413: - Sprint: (was: 2022/09/19) > Need binary release of Hudi to distribute tools like hudi-cli.sh and hudi-sy

[jira] [Updated] (HUDI-1593) Add support for "show restore" in hudi-cli

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1593: - Priority: Critical (was: Blocker) > Add support for "show restore" in hudi-cli >

[jira] [Updated] (HUDI-2580) Ability to clean up dangling data files using hudi-cli

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2580: - Priority: Critical (was: Blocker) > Ability to clean up dangling data files using hudi-cli >

[jira] [Updated] (HUDI-3819) upgrade spring cve-2022-22965

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3819: - Priority: Critical (was: Blocker) > upgrade spring cve-2022-22965 > - > >

[jira] [Updated] (HUDI-4787) ITTestHoodieSanity#testRunHoodieJavaAppOnMultiPartitionKeysMORTable streaming test

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4787: - Labels: hudi-on-call (was: ) > ITTestHoodieSanity#testRunHoodieJavaAppOnMultiPartitionKeysMORTable stream

[jira] [Updated] (HUDI-4787) ITTestHoodieSanity#testRunHoodieJavaAppOnMultiPartitionKeysMORTable streaming test

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4787: - Sprint: (was: 2022/09/19) > ITTestHoodieSanity#testRunHoodieJavaAppOnMultiPartitionKeysMORTable streamin

[jira] [Updated] (HUDI-2369) Blog on bulk insert sort modes

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2369: - Sprint: (was: 2022/09/19) > Blog on bulk insert sort modes > -- > >

[jira] [Updated] (HUDI-4692) Clean up HoodieSparkSqlWriter

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4692: - Sprint: (was: 2022/09/19) > Clean up HoodieSparkSqlWriter > - > >

[jira] [Updated] (HUDI-4690) Remove code duplicated over from Spark

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4690: - Sprint: (was: 2022/09/19) > Remove code duplicated over from Spark > ---

[jira] [Updated] (HUDI-4503) Support table identifier with explicit catalog

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4503: - Sprint: 2022/08/08 (was: 2022/08/08, 2022/09/19) > Support table identifier with explicit catalog > -

[jira] [Updated] (HUDI-3654) Support basic actions based on hudi metastore server

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3654: - Sprint: (was: 2022/09/19) > Support basic actions based on hudi metastore server >

[jira] [Closed] (HUDI-4212) kafka-connect module: Unresolved dependency: 'jdk.tools:jdk.tools:jar:1.7'

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-4212. Resolution: Not A Bug > kafka-connect module: Unresolved dependency: 'jdk.tools:jdk.tools:jar:1.7' > ---

[GitHub] [hudi] xushiyan closed pull request #6473: [HUDI-4212] Exclude jdk.tools from kafka-connect in dev setup

2022-09-19 Thread GitBox
xushiyan closed pull request #6473: [HUDI-4212] Exclude jdk.tools from kafka-connect in dev setup URL: https://github.com/apache/hudi/pull/6473 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the sp

[jira] [Updated] (HUDI-4212) kafka-connect module: Unresolved dependency: 'jdk.tools:jdk.tools:jar:1.7'

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4212: - Fix Version/s: (was: 0.12.1) > kafka-connect module: Unresolved dependency: 'jdk.tools:jdk.tools:jar:1

[GitHub] [hudi] hudi-bot commented on pull request #6712: [HUDI-4770][Stacked on 4294] Adapt spark query engine to use secondary index when querying data

2022-09-19 Thread GitBox
hudi-bot commented on PR #6712: URL: https://github.com/apache/hudi/pull/6712#issuecomment-1250829084 ## CI report: * 3d41a1bd8e5386f900167c35611dafacd53c80a1 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1148

[GitHub] [hudi] hudi-bot commented on pull request #6662: [HUDI-4832] Fix drop partition meta sync

2022-09-19 Thread GitBox
hudi-bot commented on PR #6662: URL: https://github.com/apache/hudi/pull/6662#issuecomment-1250828889 ## CI report: * 05774542e7e99184781292e69747f7b20de24e22 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1148

[GitHub] [hudi] hudi-bot commented on pull request #6516: [HUDI-4729] Fix fq can not be queried in pending compaction when query ro table with spark

2022-09-19 Thread GitBox
hudi-bot commented on PR #6516: URL: https://github.com/apache/hudi/pull/6516#issuecomment-1250828625 ## CI report: * 52bcb16f2611c4476ede864cadd79a32fb6e94fa Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1141

[jira] [Updated] (HUDI-4297) Test TestHoodieDeltaStreamerWithMultiWriter.testUpsertsContinuousModeWithMultipleWriters* is flaky

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4297: - Fix Version/s: (was: 0.12.1) > Test > TestHoodieDeltaStreamerWithMultiWriter.testUpsertsContinuousMod

[jira] [Updated] (HUDI-4297) Test TestHoodieDeltaStreamerWithMultiWriter.testUpsertsContinuousModeWithMultipleWriters* is flaky

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4297: - Sprint: (was: 2022/09/19) > Test > TestHoodieDeltaStreamerWithMultiWriter.testUpsertsContinuousModeWith

[jira] [Updated] (HUDI-4724) add function of skip the _rt suffix for read snapshot

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4724: - Sprint: (was: 2022/09/05) > add function of skip the _rt suffix for read snapshot >

[jira] [Updated] (HUDI-4297) Test TestHoodieDeltaStreamerWithMultiWriter.testUpsertsContinuousModeWithMultipleWriters* is flaky

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4297: - Labels: hudi-on-call pull-request-available (was: pull-request-available) > Test > TestHoodieDeltaStream

[jira] [Updated] (HUDI-4432) Checkpoint management for muti-writer scenario

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4432: - Sprint: (was: 2022/09/19) > Checkpoint management for muti-writer scenario > ---

[jira] [Updated] (HUDI-2873) Support optimize data layout by sql and make the build more fast

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2873: - Sprint: (was: 2022/09/19) > Support optimize data layout by sql and make the build more fast > -

[GitHub] [hudi] hudi-bot commented on pull request #6712: [HUDI-4770][Stacked on 4294] Adapt spark query engine to use secondary index when querying data

2022-09-19 Thread GitBox
hudi-bot commented on PR #6712: URL: https://github.com/apache/hudi/pull/6712#issuecomment-1250823558 ## CI report: * 3d41a1bd8e5386f900167c35611dafacd53c80a1 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1148

[GitHub] [hudi] hudi-bot commented on pull request #6516: [HUDI-4729] Fix fq can not be queried in pending compaction when query ro table with spark

2022-09-19 Thread GitBox
hudi-bot commented on PR #6516: URL: https://github.com/apache/hudi/pull/6516#issuecomment-1250823196 ## CI report: * 52bcb16f2611c4476ede864cadd79a32fb6e94fa Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1141

[jira] [Updated] (HUDI-3648) Failed to execute rollback due to HoodieIOException: Could not delete instant

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3648: - Sprint: (was: 2022/09/19) > Failed to execute rollback due to HoodieIOException: Could not delete instan

[jira] [Updated] (HUDI-3067) "Table already exists" error with multiple writers and dynamodb

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3067: - Sprint: (was: 2022/09/19) > "Table already exists" error with multiple writers and dynamodb > --

[jira] [Updated] (HUDI-4734) Add table config change validation in deltastreamer

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4734: - Sprint: (was: 2022/09/19) > Add table config change validation in deltastreamer > --

[jira] [Updated] (HUDI-2733) Adding Thrift support in HiveSyncTool

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2733: - Sprint: Cont' improve - 2021/01/24, Cont' improve - 2021/01/31, Cont' improve - 2022/02/07, Cont' impro

[jira] [Updated] (HUDI-4755) INSERT_OVERWRITE(/TABLE) in spark sql should not fail time travel queries for older timestamps

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4755: - Sprint: (was: 2022/09/19) > INSERT_OVERWRITE(/TABLE) in spark sql should not fail time travel queries fo

[jira] [Updated] (HUDI-4781) Allow omit metadata fields for hive sync

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4781: - Sprint: (was: 2022/09/19) > Allow omit metadata fields for hive sync > -

[jira] [Updated] (HUDI-3973) Implement GENERATE manifest command for Snowflake integration

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3973: - Sprint: (was: 2022/09/19) > Implement GENERATE manifest command for Snowflake integration >

[jira] [Updated] (HUDI-3067) "Table already exists" error with multiple writers and dynamodb

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3067: - Sprint: 2022/09/19 > "Table already exists" error with multiple writers and dynamodb > ---

[jira] [Updated] (HUDI-3067) "Table already exists" error with multiple writers and dynamodb

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3067: - Sprint: (was: 2022/09/05) > "Table already exists" error with multiple writers and dynamodb > --

[GitHub] [hudi] hudi-bot commented on pull request #6710: [HUDI-4810] Fix log4j imports to use bridge API

2022-09-19 Thread GitBox
hudi-bot commented on PR #6710: URL: https://github.com/apache/hudi/pull/6710#issuecomment-1250817773 ## CI report: * 74550dcaca4958449b17e348ac1387b334072b99 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1148

[jira] [Updated] (HUDI-4805) Update docs for workaround to make HBase working with HDFS on Hadoop 3

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4805: - Component/s: docs > Update docs for workaround to make HBase working with HDFS on Hadoop 3 > -

[jira] [Updated] (HUDI-3067) "Table already exists" error with multiple writers and dynamodb

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3067: - Labels: hudi-on-call (was: ) > "Table already exists" error with multiple writers and dynamodb >

[GitHub] [hudi] hudi-bot commented on pull request #6388: [HUDI-4617] Fix delete_record's preCombine logic when changelog disabled

2022-09-19 Thread GitBox
hudi-bot commented on PR #6388: URL: https://github.com/apache/hudi/pull/6388#issuecomment-1250817066 ## CI report: * cdad7b0ea85f8181ebf79a01f38edc561690f9f4 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1075

[jira] [Updated] (HUDI-4734) Add table config change validation in deltastreamer

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4734: - Fix Version/s: 0.13.0 (was: 0.12.1) > Add table config change validation in deltast

[jira] [Updated] (HUDI-4734) Add table config change validation in deltastreamer

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4734: - Sprint: 2022/09/19 > Add table config change validation in deltastreamer > ---

[jira] [Updated] (HUDI-4734) Add table config change validation in deltastreamer

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4734: - Labels: hudi-on-call (was: ) > Add table config change validation in deltastreamer >

[jira] [Updated] (HUDI-4734) Add table config change validation in deltastreamer

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4734: - Sprint: (was: 2022/09/05) > Add table config change validation in deltastreamer > --

[jira] [Updated] (HUDI-4781) Allow omit metadata fields for hive sync

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4781: - Sprint: (was: 2022/09/05) > Allow omit metadata fields for hive sync > -

[jira] [Updated] (HUDI-4781) Allow omit metadata fields for hive sync

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4781: - Sprint: 2022/09/19 > Allow omit metadata fields for hive sync > >

[jira] [Updated] (HUDI-4781) Allow omit metadata fields for hive sync

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4781: - Priority: Minor (was: Major) > Allow omit metadata fields for hive sync > ---

[jira] [Updated] (HUDI-4781) Allow omit metadata fields for hive sync

2022-09-19 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4781: - Labels: hudi-on-call pull-request-available (was: pull-request-available) > Allow omit metadata fields fo

<    1   2   3   4   5   >