[GitHub] [hudi] hudi-bot commented on pull request #4245: [MINOR] remove unuse construction method

2021-12-07 Thread GitBox
hudi-bot commented on pull request #4245: URL: https://github.com/apache/hudi/pull/4245#issuecomment-988581448 ## CI report: * 6f4e9f5fd7387cc3ec4dfa8d7f7a83a3abcbd0c0 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?b

[GitHub] [hudi] hudi-bot removed a comment on pull request #4245: [MINOR] remove unuse construction method

2021-12-07 Thread GitBox
hudi-bot removed a comment on pull request #4245: URL: https://github.com/apache/hudi/pull/4245#issuecomment-988578456 ## CI report: * 0893724351e36c2c0e50d1971ab4a37134c30435 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/r

[GitHub] [hudi] hudi-bot commented on pull request #4245: [MINOR] remove unuse construction method

2021-12-07 Thread GitBox
hudi-bot commented on pull request #4245: URL: https://github.com/apache/hudi/pull/4245#issuecomment-988578456 ## CI report: * 0893724351e36c2c0e50d1971ab4a37134c30435 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?b

[GitHub] [hudi] hudi-bot removed a comment on pull request #4245: [MINOR] remove unuse construction method

2021-12-07 Thread GitBox
hudi-bot removed a comment on pull request #4245: URL: https://github.com/apache/hudi/pull/4245#issuecomment-988572701 ## CI report: * 0893724351e36c2c0e50d1971ab4a37134c30435 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/r

[GitHub] [hudi] hudi-bot commented on pull request #4245: [MINOR] remove unuse construction method

2021-12-07 Thread GitBox
hudi-bot commented on pull request #4245: URL: https://github.com/apache/hudi/pull/4245#issuecomment-988572701 ## CI report: * 0893724351e36c2c0e50d1971ab4a37134c30435 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?b

[GitHub] [hudi] hudi-bot removed a comment on pull request #4245: [MINOR] remove unuse construction method

2021-12-07 Thread GitBox
hudi-bot removed a comment on pull request #4245: URL: https://github.com/apache/hudi/pull/4245#issuecomment-988543706 ## CI report: * 0893724351e36c2c0e50d1971ab4a37134c30435 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/r

[GitHub] [hudi] BruceKellan closed issue #4102: [SUPPORT] What should I do if I want to delete data in certain partitions?

2021-12-07 Thread GitBox
BruceKellan closed issue #4102: URL: https://github.com/apache/hudi/issues/4102 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr..

[GitHub] [hudi] wjcwin opened a new issue #4249: [SUPPORT]FLINK CDC WRITE HUDI, restart job get exception:org.apache.hudi.org.apache.avro.InvalidAvroMagicException: Not an Avro data file

2021-12-07 Thread GitBox
wjcwin opened a new issue #4249: URL: https://github.com/apache/hudi/issues/4249 FLINK CDC WRITE HUDI, restart job get exception:org.apache.hudi.org.apache.avro.InvalidAvroMagicException: Not an Avro data file logs: ``` org.apache.flink.util.FlinkException: Global failure trig

[GitHub] [hudi] yanenze opened a new issue #4248: [SUPPORT] flink write hudi problem

2021-12-07 Thread GitBox
yanenze opened a new issue #4248: URL: https://github.com/apache/hudi/issues/4248 when i set the flinkoption (FlinkOptions.WRITE_PARQUET_MAX_FILE_SIZE)=128 but this config dosen`t work, and the program gennerate parquet file`s size is far greater than 128MB,so i read the origin code find

[GitHub] [hudi] BruceKellan opened a new issue #4247: [SUPPORT] Unsupport operation exception occur when using flink+hudi in bulk_insert mode

2021-12-07 Thread GitBox
BruceKellan opened a new issue #4247: URL: https://github.com/apache/hudi/issues/4247 **Describe the problem you faced** I am using flink+hudi to initial dataset from hive. but unsupport operation exception occur like this, it seems like doesn't support map type in bulk_insert mod

[GitHub] [hudi] hudi-bot commented on pull request #4246: [MINOR] Update DOAP with 0.10.0 Release

2021-12-07 Thread GitBox
hudi-bot commented on pull request #4246: URL: https://github.com/apache/hudi/pull/4246#issuecomment-988561713 ## CI report: * b40dde1704cd0c69ec1981bfd411e64bd46831a4 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4246: [MINOR] Update DOAP with 0.10.0 Release

2021-12-07 Thread GitBox
hudi-bot removed a comment on pull request #4246: URL: https://github.com/apache/hudi/pull/4246#issuecomment-988560327 ## CI report: * b40dde1704cd0c69ec1981bfd411e64bd46831a4 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run

[GitHub] [hudi] hudi-bot commented on pull request #4246: [MINOR] Update DOAP with 0.10.0 Release

2021-12-07 Thread GitBox
hudi-bot commented on pull request #4246: URL: https://github.com/apache/hudi/pull/4246#issuecomment-988560327 ## CI report: * b40dde1704cd0c69ec1981bfd411e64bd46831a4 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` r

[jira] [Updated] (HUDI-2637) Triage all bugs around Multi-writer and certify the tested flows

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2637: - Fix Version/s: 0.11.0 (was: 0.10.0) > Triage all bugs around Multi-writer and certi

[GitHub] [hudi] danny0405 opened a new pull request #4246: [MINOR] Update DOAP with 0.10.0 Release

2021-12-07 Thread GitBox
danny0405 opened a new pull request #4246: URL: https://github.com/apache/hudi/pull/4246 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the purpo

[GitHub] [hudi] xuzifu666 commented on pull request #4171: Revert "[HUDI-2856] Bit cask disk map delete modified"

2021-12-07 Thread GitBox
xuzifu666 commented on pull request #4171: URL: https://github.com/apache/hudi/pull/4171#issuecomment-988556771 > I filed a ticket for further investigation: [HUDI-2944](https://issues.apache.org/jira/projects/HUDI/issues/HUDI-2944). Feel free to take it on. i review the change from

[jira] [Updated] (HUDI-2877) Support flink catalog to help user use flink table conveniently

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2877: - Fix Version/s: 0.11.0 (was: 0.10.0) > Support flink catalog to help user use flink

[jira] [Updated] (HUDI-2418) Support HiveSchemaProvider

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2418: - Fix Version/s: 0.11.0 (was: 0.10.0) > Support HiveSchemaProvider > ---

[jira] [Updated] (HUDI-2900) Fix corrupt block end position

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2900: - Fix Version/s: 0.11.0 (was: 0.10.0) > Fix corrupt block end position >

[jira] [Updated] (HUDI-2081) Move TestHiveSyncTool to functional

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2081: - Fix Version/s: 0.11.0 (was: 0.10.0) > Move TestHiveSyncTool to functional > ---

[jira] [Updated] (HUDI-2595) Guard all writes to metadata table for a single writer datatable and async operations

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2595: - Fix Version/s: 0.11.0 (was: 0.10.0) > Guard all writes to metadata table for a sing

[jira] [Updated] (HUDI-2267) Test suite infra Automate with playbook

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2267: - Fix Version/s: 0.11.0 (was: 0.10.0) > Test suite infra Automate with playbook > ---

[jira] [Updated] (HUDI-2670) Fix broken relative links

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2670: - Fix Version/s: 0.11.0 (was: 0.10.0) > Fix broken relative links > -

[jira] [Updated] (HUDI-2753) Rollback plan does has to take in explicit arg from restore for rollback strategy

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2753: - Fix Version/s: 0.11.0 (was: 0.10.0) > Rollback plan does has to take in explicit ar

[jira] [Updated] (HUDI-1752) Add HoodieFlinkClient InsertOverwrite

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1752: - Fix Version/s: 0.11.0 (was: 0.10.0) > Add HoodieFlinkClient InsertOverwrite > -

[jira] [Updated] (HUDI-2746) Do not bootstrap for flink insert overwrite

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2746: - Fix Version/s: 0.11.0 (was: 0.10.0) > Do not bootstrap for flink insert overwrite >

[jira] [Updated] (HUDI-2629) Improve Overview Page

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2629: - Fix Version/s: 0.11.0 (was: 0.10.0) > Improve Overview Page > -

[jira] [Updated] (HUDI-2742) Multiple S3EventsHoodieIncrSource from same S3 metadata table for different Hudi tables

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2742: - Fix Version/s: 0.11.0 (was: 0.10.0) > Multiple S3EventsHoodieIncrSource from same S

[jira] [Updated] (HUDI-2709) Add more options when initializing table

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2709: - Fix Version/s: 0.11.0 (was: 0.10.0) > Add more options when initializing table > --

[jira] [Updated] (HUDI-2728) Clean up concepts and consolidate from cwiki

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2728: - Fix Version/s: 0.11.0 (was: 0.10.0) > Clean up concepts and consolidate from cwiki

[jira] [Updated] (HUDI-2760) HoodieFlinkStreamer parameter bug

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2760: - Fix Version/s: 0.11.0 (was: 0.10.0) > HoodieFlinkStreamer parameter bug > -

[jira] [Updated] (HUDI-2766) Enable marker based rollback by default

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2766: - Fix Version/s: 0.11.0 (was: 0.10.0) > Enable marker based rollback by default > ---

[jira] [Updated] (HUDI-2725) Add precommit validators doc

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2725: - Fix Version/s: 0.11.0 (was: 0.10.0) > Add precommit validators doc > --

[jira] [Updated] (HUDI-2761) IllegalArgException from timeline server when serving getLastestBaseFiles with multi-writer

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2761: - Fix Version/s: 0.11.0 (was: 0.10.0) > IllegalArgException from timeline server when

[jira] [Updated] (HUDI-2108) Flaky test: TestHoodieBackedMetadata.testOnlyValidPartitionsAdded:210

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2108: - Fix Version/s: 0.11.0 (was: 0.10.0) > Flaky test: TestHoodieBackedMetadata.testOnly

[jira] [Updated] (HUDI-1975) Upgrade java-prometheus-client from 3.1.2 to 4.x

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1975: - Fix Version/s: 0.11.0 (was: 0.10.0) > Upgrade java-prometheus-client from 3.1.2 to

[jira] [Updated] (HUDI-2685) Support scheduling online compaction plan when there are no commit data

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2685: - Fix Version/s: 0.11.0 (was: 0.10.0) > Support scheduling online compaction plan whe

[jira] [Updated] (HUDI-2727) Add prometheus reporter docs

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2727: - Fix Version/s: 0.11.0 (was: 0.10.0) > Add prometheus reporter docs > --

[jira] [Updated] (HUDI-2472) Tests failure follow up when metadata is enabled by default

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2472: - Fix Version/s: 0.11.0 (was: 0.10.0) > Tests failure follow up when metadata is enab

[jira] [Updated] (HUDI-2718) ExternalSpillableMap throws ArithmeticException when estimating the size of the payload

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2718: - Fix Version/s: 0.11.0 (was: 0.10.0) > ExternalSpillableMap throws ArithmeticExcepti

[jira] [Updated] (HUDI-2704) Create and publish RFC for metadata based bloom index

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2704: - Fix Version/s: 0.11.0 (was: 0.10.0) > Create and publish RFC for metadata based blo

[jira] [Updated] (HUDI-2698) Remove the table source options validation

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2698: - Fix Version/s: 0.11.0 (was: 0.10.0) > Remove the table source options validation >

[jira] [Updated] (HUDI-2607) Reorganize Hudi docs

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2607: - Fix Version/s: 0.11.0 (was: 0.10.0) > Reorganize Hudi docs > >

[jira] [Updated] (HUDI-2478) Handle failure mid-way during init buckets

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2478: - Fix Version/s: 0.11.0 (was: 0.10.0) > Handle failure mid-way during init buckets >

[jira] [Updated] (HUDI-2636) Make release notes discoverable

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2636: - Fix Version/s: 0.11.0 (was: 0.10.0) > Make release notes discoverable > ---

[jira] [Updated] (HUDI-2579) Deltastreamer checkpoint metadata is not merged from previous commit instant

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2579: - Fix Version/s: 0.11.0 (was: 0.10.0) > Deltastreamer checkpoint metadata is not merg

[jira] [Updated] (HUDI-2731) Clustering should work regardless of whether there are base files

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2731: - Fix Version/s: 0.11.0 (was: 0.10.0) > Clustering should work regardless of whether

[jira] [Updated] (HUDI-2756) Fix flink parquet writer decimal type conversion

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2756: - Fix Version/s: 0.11.0 (was: 0.10.0) > Fix flink parquet writer decimal type convers

[jira] [Updated] (HUDI-2716) InLineFS support for S3FS

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2716: - Fix Version/s: 0.11.0 (was: 0.10.0) > InLineFS support for S3FS > -

[jira] [Updated] (HUDI-2745) Record count does not match input after compaction is scheduled when running Hudi Kafka Connect sink

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2745: - Fix Version/s: 0.11.0 (was: 0.10.0) > Record count does not match input after compa

[jira] [Updated] (HUDI-2758) remove redundant code in the HoodieRealtimeInputFormatUtils.getRealtimeSplits

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2758: - Fix Version/s: 0.11.0 (was: 0.10.0) > remove redundant code in the HoodieRealtimeIn

[jira] [Updated] (HUDI-2891) Kafka-connect sink still uses timeline-server-based markers as default

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2891: - Fix Version/s: 0.11.0 (was: 0.10.0) > Kafka-connect sink still uses timeline-server

[jira] [Updated] (HUDI-2759) extract HoodieCatalogTable as a bridge between spark catalog table and hoodie table

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2759: - Fix Version/s: 0.11.0 (was: 0.10.0) > extract HoodieCatalogTable as a bridge betwee

[jira] [Updated] (HUDI-2764) Address test failures after enabling virtual keys support for the metadata table

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2764: - Fix Version/s: 0.11.0 (was: 0.10.0) > Address test failures after enabling virtual

[jira] [Updated] (HUDI-2861) Failed rollback is never re-attempted

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2861: - Fix Version/s: 0.11.0 (was: 0.10.0) > Failed rollback is never re-attempted > -

[jira] [Updated] (HUDI-2800) Clustering execution triggers HoodieCreateHandle twice during validateWriteResult isEmpty call

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2800: - Fix Version/s: 0.11.0 (was: 0.10.0) > Clustering execution triggers HoodieCreateHan

[jira] [Updated] (HUDI-1904) Make SchemaProvider spark free and move it to hudi-client-common

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1904: - Fix Version/s: 0.11.0 (was: 0.10.0) > Make SchemaProvider spark free and move it to

[jira] [Updated] (HUDI-765) Implement OrcReaderIterator

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-765: Fix Version/s: 0.11.0 (was: 0.10.0) > Implement OrcReaderIterator > --

[jira] [Updated] (HUDI-2396) Tracking larger changes in 0.9.0 to be called out in 0.10.0

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2396: - Fix Version/s: 0.11.0 (was: 0.10.0) > Tracking larger changes in 0.9.0 to be called

[jira] [Updated] (HUDI-718) java.lang.ClassCastException during upsert

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-718: Fix Version/s: 0.11.0 (was: 0.10.0) > java.lang.ClassCastException during upsert > ---

[jira] [Updated] (HUDI-1932) Hive Sync should not always update last_commit_time_sync

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1932: - Fix Version/s: 0.11.0 (was: 0.10.0) > Hive Sync should not always update last_commi

[jira] [Updated] (HUDI-2495) Difference in behavior between GenericRecord based key gen and Row based key gen

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2495: - Fix Version/s: 0.11.0 (was: 0.10.0) > Difference in behavior between GenericRecord

[jira] [Updated] (HUDI-2586) Design col stats partition in Metadata table

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2586: - Fix Version/s: 0.11.0 (was: 0.10.0) > Design col stats partition in Metadata table

[jira] [Updated] (HUDI-1353) Incremental timeline support for pending clustering operations

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1353: - Fix Version/s: 0.11.0 (was: 0.10.0) > Incremental timeline support for pending clus

[jira] [Updated] (HUDI-2505) [UMBRELLA] Spark DataSource APIs and Spark SQL discrepancies

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2505: - Fix Version/s: 0.11.0 (was: 0.10.0) > [UMBRELLA] Spark DataSource APIs and Spark SQ

[jira] [Updated] (HUDI-1290) Implement Debezium avro source for Delta Streamer

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1290: - Fix Version/s: 0.11.0 (was: 0.10.0) > Implement Debezium avro source for Delta Stre

[jira] [Updated] (HUDI-1015) Audit all getAllPartitionPaths() calls and keep em out of fast path

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1015: - Fix Version/s: 0.11.0 (was: 0.10.0) > Audit all getAllPartitionPaths() calls and ke

[jira] [Updated] (HUDI-1491) Support partition pruning for MOR snapshot query

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1491: - Fix Version/s: 0.11.0 (was: 0.10.0) > Support partition pruning for MOR snapshot qu

[jira] [Updated] (HUDI-2471) Add support ignoring case in merge into

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2471: - Fix Version/s: 0.11.0 (was: 0.10.0) > Add support ignoring case in merge into > ---

[jira] [Updated] (HUDI-2628) Fix Chinese Docs

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2628: - Fix Version/s: 0.11.0 (was: 0.10.0) > Fix Chinese Docs > > >

[jira] [Updated] (HUDI-1172) Use OverwriteWithLatestAvroPayload as default payload class everywhere

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1172: - Fix Version/s: 0.11.0 (was: 0.10.0) > Use OverwriteWithLatestAvroPayload as default

[jira] [Updated] (HUDI-847) Umbrella ticket for tuning default configs for 0.6.0

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-847: Fix Version/s: (was: 0.10.0) > Umbrella ticket for tuning default configs for 0.6.0 > ---

[jira] [Updated] (HUDI-2641) One inflight commit rolling back other concurrent inflight commits causing them to fail

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2641: - Fix Version/s: 0.11.0 (was: 0.10.0) > One inflight commit rolling back other concur

[jira] [Updated] (HUDI-2102) support hilbert curve for hudi

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2102: - Fix Version/s: 0.11.0 (was: 0.10.0) > support hilbert curve for hudi >

[jira] [Updated] (HUDI-1879) Spark DataSource tables/HoodieFileIndex issues for Merge On Read

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1879: - Fix Version/s: 0.11.0 (was: 0.10.0) > Spark DataSource tables/HoodieFileIndex issue

[jira] [Updated] (HUDI-2234) MERGE INTO works only ON primary key

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2234: - Fix Version/s: 0.11.0 (was: 0.10.0) > MERGE INTO works only ON primary key > --

[jira] [Updated] (HUDI-2314) Add DynamoDb based lock provider

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2314: - Fix Version/s: 0.11.0 (was: 0.10.0) > Add DynamoDb based lock provider > --

[jira] [Updated] (HUDI-1877) clustering support for external index

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1877: - Fix Version/s: 0.11.0 (was: 0.10.0) > clustering support for external index > -

[jira] [Updated] (HUDI-2599) [Performance] Lower parallelism with snapshot query on COW tables in Presto

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2599: - Fix Version/s: 0.11.0 (was: 0.10.0) > [Performance] Lower parallelism with snapshot

[jira] [Updated] (HUDI-2287) Partition pruning not working on Hudi dataset

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2287: - Fix Version/s: 0.11.0 (was: 0.10.0) > Partition pruning not working on Hudi dataset

[jira] [Updated] (HUDI-1794) Generating a new instant time in HoodieActiveTimeline is not thread safe

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1794: - Fix Version/s: 0.11.0 (was: 0.10.0) > Generating a new instant time in HoodieActive

[jira] [Updated] (HUDI-1937) When clustering fail, generating unfinished replacecommit timeline.

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1937: - Fix Version/s: 0.11.0 (was: 0.10.0) > When clustering fail, generating unfinished r

[jira] [Updated] (HUDI-2663) Incorrect deletion of heartbeat files for inflight commits

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2663: - Fix Version/s: 0.11.0 (was: 0.10.0) > Incorrect deletion of heartbeat files for inf

[jira] [Updated] (HUDI-2526) Make spark.sql.parquet.writeLegacyFormat configurable

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2526: - Fix Version/s: 0.11.0 (was: 0.10.0) > Make spark.sql.parquet.writeLegacyFormat conf

[jira] [Updated] (HUDI-2362) Hudi external configuration file support

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2362: - Fix Version/s: 0.11.0 (was: 0.10.0) > Hudi external configuration file support > --

[jira] [Updated] (HUDI-2667) Avoid fs.exists() and fs.mkdirs() call to partitions in AbstractTablefileSystemView

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2667: - Fix Version/s: 0.11.0 (was: 0.10.0) > Avoid fs.exists() and fs.mkdirs() call to par

[jira] [Updated] (HUDI-1763) DefaultHoodieRecordPayload does not honor ordering value when records within multiple log files are merged

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1763: - Fix Version/s: 0.11.0 (was: 0.10.0) > DefaultHoodieRecordPayload does not honor ord

[jira] [Updated] (HUDI-2512) Multi-writer w/ DeltaStreamer and Spark datasource does not work

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2512: - Fix Version/s: 0.11.0 (was: 0.10.0) > Multi-writer w/ DeltaStreamer and Spark datas

[jira] [Updated] (HUDI-1609) Issues w/ using hive metastore by disabling jdbc

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1609: - Fix Version/s: 0.11.0 (was: 0.10.0) > Issues w/ using hive metastore by disabling j

[jira] [Updated] (HUDI-2455) Fail to build Spark caused by spark_avro dependency

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2455: - Fix Version/s: 0.11.0 (was: 0.10.0) > Fail to build Spark caused by spark_avro depe

[jira] [Updated] (HUDI-2332) Implement scheduling of compaction/ clustering for Kafka Connect

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2332: - Fix Version/s: 0.11.0 (was: 0.10.0) > Implement scheduling of compaction/ clusterin

[jira] [Updated] (HUDI-2672) Avoid empty commits and rollbacks when there is no event from the topic

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2672: - Fix Version/s: 0.11.0 (was: 0.10.0) > Avoid empty commits and rollbacks when there

[jira] [Updated] (HUDI-2567) Verify synchronous metadata patch w/ multi writers end to end

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2567: - Fix Version/s: 0.11.0 (was: 0.10.0) > Verify synchronous metadata patch w/ multi wr

[jira] [Updated] (HUDI-2712) rollback of a partially failed commit which has new partitions fails with metadata table

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2712: - Fix Version/s: 0.11.0 (was: 0.10.0) > rollback of a partially failed commit which h

[jira] [Updated] (HUDI-2697) Minor changes about hbase index config.

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2697: - Fix Version/s: 0.11.0 (was: 0.10.0) > Minor changes about hbase index config. > --

[jira] [Updated] (HUDI-1283) Fill missing columns with default value when spark dataframe save to hudi table

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1283: - Fix Version/s: 0.11.0 (was: 0.10.0) > Fill missing columns with default value when

[jira] [Updated] (HUDI-1444) fix the error when rollback commit that belong to a non partition table

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1444: - Fix Version/s: 0.11.0 (was: 0.10.0) > fix the error when rollback commit that belon

[jira] [Updated] (HUDI-2242) Add inference logic to few Hudi configs

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2242: - Fix Version/s: 0.11.0 (was: 0.10.0) > Add inference logic to few Hudi configs > ---

[jira] [Updated] (HUDI-1995) FIx typo in oss cn docs

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-1995: - Fix Version/s: 0.11.0 (was: 0.10.0) > FIx typo in oss cn docs > ---

[jira] [Updated] (HUDI-2717) Test and certify inline file system in S3 and hdfs

2021-12-07 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2717: - Fix Version/s: 0.11.0 (was: 0.10.0) > Test and certify inline file system in S3 and

  1   2   3   4   5   6   >