[jira] [Created] (HUDI-995) Add hudi-testutils module

2020-06-03 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-995: --- Summary: Add hudi-testutils module Key: HUDI-995 URL: https://issues.apache.org/jira/browse/HUDI-995 Project: Apache Hudi Issue Type: Sub-task Components:

[jira] [Created] (HUDI-994) Identify functional tests that are convertible to unit tests with mocks

2020-06-03 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-994: --- Summary: Identify functional tests that are convertible to unit tests with mocks Key: HUDI-994 URL: https://issues.apache.org/jira/browse/HUDI-994 Project: Apache Hudi

[jira] [Created] (HUDI-996) Use shared spark session provider

2020-06-03 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-996: --- Summary: Use shared spark session provider Key: HUDI-996 URL: https://issues.apache.org/jira/browse/HUDI-996 Project: Apache Hudi Issue Type: Sub-task

[jira] [Updated] (HUDI-995) Add hudi-testutils module

2020-06-03 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-995: Description: * add a new module {{hudi-testutils}} and add it to all other modules as test dep and remove 

[jira] [Updated] (HUDI-781) Re-design test utilities

2020-06-03 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-781: Status: Open (was: New) > Re-design test utilities > > > Key:

[jira] [Updated] (HUDI-896) Parallelize CI testing to reduce CI wait time

2020-06-03 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-896: Parent: HUDI-781 Issue Type: Sub-task (was: Improvement) > Parallelize CI testing to reduce CI wait

[jira] [Updated] (HUDI-994) Identify functional tests that are convertible to unit tests with mocks

2020-06-03 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-994: Description: * Identify convertible functional tests and re-implement by using mock * remove/merge

[jira] [Commented] (HUDI-781) Re-design test utilities

2020-06-08 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17128778#comment-17128778 ] Raymond Xu commented on HUDI-781: - [~yanghua] [~yanghua] [~nishith29] [~garyli1019] Here is an execution

[jira] [Comment Edited] (HUDI-781) Re-design test utilities

2020-06-08 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17128778#comment-17128778 ] Raymond Xu edited comment on HUDI-781 at 6/9/20, 2:36 AM: -- [~yanghua] [~vinoth]

[jira] [Comment Edited] (HUDI-781) Re-design test utilities

2020-06-08 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17128778#comment-17128778 ] Raymond Xu edited comment on HUDI-781 at 6/9/20, 2:41 AM: -- [~yanghua] [~vinoth]

[jira] [Updated] (HUDI-995) Organize test utils methods and classes

2020-07-24 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-995: Description: * Move test utils classes to hudi-common where appropriate, e.g. TestRawTripPayload,

[jira] [Updated] (HUDI-995) Organize test utils methods and classes

2020-07-24 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-995: Summary: Organize test utils methods and classes (was: Add hudi-testutils module) > Organize test utils

[jira] [Assigned] (HUDI-995) Organize test utils methods and classes

2020-07-24 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-995: --- Assignee: Raymond Xu > Organize test utils methods and classes >

[jira] [Assigned] (HUDI-996) Use shared spark session provider

2020-07-28 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-996: --- Assignee: (was: Raymond Xu) > Use shared spark session provider >

[jira] [Commented] (HUDI-996) Use shared spark session provider

2020-07-28 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17166871#comment-17166871 ] Raymond Xu commented on HUDI-996: - pausing the work on tagging more functional tests to functional test

[jira] [Updated] (HUDI-995) Organize test utils methods and classes

2020-07-28 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-995: Status: Open (was: New) > Organize test utils methods and classes > ---

[jira] [Commented] (HUDI-995) Organize test utils methods and classes

2020-07-28 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17166868#comment-17166868 ] Raymond Xu commented on HUDI-995: - [~yanghua] yes, there'll be more incremental changes. Let me get back to

[jira] [Updated] (HUDI-995) Organize test utils methods and classes

2020-07-28 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-995: Status: In Progress (was: Open) > Organize test utils methods and classes >

[jira] [Updated] (HUDI-73) Support vanilla Avro Kafka Source in HoodieDeltaStreamer

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-73?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-73: --- Labels: gsoc2021 mentor pull-request-available (was: pull-request-available) > Support vanilla Avro Kafka

[jira] [Updated] (HUDI-1280) Add tool to capture earliest or latest offsets in kafka topics

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1280: - Labels: (was: gsoc2021 mentor) > Add tool to capture earliest or latest offsets in kafka topics >

[jira] [Updated] (HUDI-767) Support transformation when export to Hudi

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-767: Labels: gsoc2021 mentor (was: ) > Support transformation when export to Hudi >

[jira] [Updated] (HUDI-1280) Add tool to capture earliest or latest offsets in kafka topics

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1280: - Labels: gsoc2021 mentor (was: ) > Add tool to capture earliest or latest offsets in kafka topics >

[jira] [Updated] (HUDI-791) Replace null by Option in Delta Streamer

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-791: Labels: gsoc2021 mentor pull-request-available (was: pull-request-available) > Replace null by Option in

[jira] [Updated] (HUDI-347) Fix TestHoodieClientOnCopyOnWriteStorage Tests with modular private methods

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-347: Labels: gsoc2021 mentor (was: ) > Fix TestHoodieClientOnCopyOnWriteStorage Tests with modular private

[jira] [Updated] (HUDI-233) Redo log statements using SLF4J

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-233: Labels: gsoc2021 mentor pull-request-available (was: pull-request-available) > Redo log statements using

[jira] [Updated] (HUDI-1034) Document info about test structure and guide

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1034: - Status: In Progress (was: Open) > Document info about test structure and guide >

[jira] [Updated] (HUDI-1034) Document info about test structure and guide

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1034: - Status: Open (was: New) > Document info about test structure and guide >

[jira] [Closed] (HUDI-1034) Document info about test structure and guide

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-1034. > Document info about test structure and guide > > >

[jira] [Resolved] (HUDI-1034) Document info about test structure and guide

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu resolved HUDI-1034. -- Resolution: Done > Document info about test structure and guide >

[jira] [Assigned] (HUDI-1034) Document info about test structure and guide

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-1034: Assignee: Raymond Xu > Document info about test structure and guide >

[jira] [Updated] (HUDI-270) [UMBRELLA] Improve Hudi website UI and documentation

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-270: Labels: gsoc2021 mentor (was: ) > [UMBRELLA] Improve Hudi website UI and documentation >

[jira] [Updated] (HUDI-534) Explore a new way to fix import order

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-534: Component/s: Code Cleanup > Explore a new way to fix import order > - >

[jira] [Updated] (HUDI-534) Explore a new way to fix import order

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-534: Labels: gsoc2021 mentor (was: ) > Explore a new way to fix import order >

[jira] [Updated] (HUDI-304) Bring back spotless plugin

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-304: Labels: bug-bash-0.6.0 gsoc2021 help-wanted mentor pull-request-available (was: bug-bash-0.6.0 help-wanted

[jira] [Commented] (HUDI-791) Replace null by Option in Delta Streamer

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17228869#comment-17228869 ] Raymond Xu commented on HUDI-791: - hi [~garyli1019] saw the PR is merged. can we close this? > Replace

[jira] [Updated] (HUDI-791) Replace null by Option in Delta Streamer

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-791: Labels: pull-request-available (was: gsoc2021 mentor pull-request-available) > Replace null by Option in

[jira] [Updated] (HUDI-1001) Add implementation to translate source partition paths when doing metadata bootstrap

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1001: - Labels: gsoc2021 mentor (was: ) > Add implementation to translate source partition paths when doing

[jira] [Updated] (HUDI-246) Apache Pulsar data source for Hudi

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-246: Labels: gsoc2021 mentor (was: ) > Apache Pulsar data source for Hudi > -- >

[jira] [Commented] (HUDI-768) Support split/merge source datasets during export

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17228779#comment-17228779 ] Raymond Xu commented on HUDI-768: - [~vinoth] i wonder if we should close this as won't do. This sort of

[jira] [Assigned] (HUDI-904) Segregate metrics configs by reporter type

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-904: --- Assignee: (was: Raymond Xu) > Segregate metrics configs by reporter type >

[jira] [Assigned] (HUDI-1211) Test failures w/ some index tests (TestHoodieIndex)

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-1211: Assignee: Raymond Xu > Test failures w/ some index tests (TestHoodieIndex) >

[jira] [Updated] (HUDI-904) Segregate metrics configs by reporter type

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-904: Labels: gsoc2021 mentor (was: ) > Segregate metrics configs by reporter type >

[jira] [Updated] (HUDI-1211) Test failures w/ some index tests (TestHoodieIndex)

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1211: - Status: Open (was: New) > Test failures w/ some index tests (TestHoodieIndex) >

[jira] [Assigned] (HUDI-1211) Test failures w/ some index tests (TestHoodieIndex)

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-1211: Assignee: (was: Raymond Xu) > Test failures w/ some index tests (TestHoodieIndex) >

[jira] [Updated] (HUDI-488) Refactor Source classes in hudi-utilities

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-488: Labels: gsoc2021 mentor (was: ) > Refactor Source classes in hudi-utilities >

[jira] [Updated] (HUDI-74) Improve compaction support in HoodieDeltaStreamer & CLI

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-74?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-74: --- Labels: gsoc2021 mentor (was: ) > Improve compaction support in HoodieDeltaStreamer & CLI >

[jira] [Updated] (HUDI-1290) Implement Debezium avro source for Delta Streamer

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1290: - Labels: gsoc2021 mentor (was: ) > Implement Debezium avro source for Delta Streamer >

[jira] [Updated] (HUDI-735) Improve deltastreamer error message when case mismatch of commandline arguments.

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-735: Labels: bug-bash-0.6.0 gsoc2021 mentor (was: bug-bash-0.6.0) > Improve deltastreamer error message when

[jira] [Updated] (HUDI-388) Support DDL / DML SparkSQL statements which useful for admins

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-388: Labels: gsoc2021 mentor (was: ) > Support DDL / DML SparkSQL statements which useful for admins >

[jira] [Updated] (HUDI-96) Use Command line options instead of positional arguments when launching spark applications from various CLI commands

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-96?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-96: --- Labels: gsoc2021 mentor newbie pull-request-available (was: newbie pull-request-available) > Use Command line

[jira] [Updated] (HUDI-67) Tool to convert sequence file based archived commits to log format #224

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-67?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-67: --- Labels: gsoc2021 mentor (was: ) > Tool to convert sequence file based archived commits to log format #224 >

[jira] [Closed] (HUDI-693) Add unit test for hudi-cli module

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-693. --- Resolution: Duplicate Closing due to being a duplicate > Add unit test for hudi-cli module >

[jira] [Updated] (HUDI-347) Fix TestHoodieClientOnCopyOnWriteStorage Tests with modular private methods

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-347: Labels: (was: gsoc2021 mentor) > Fix TestHoodieClientOnCopyOnWriteStorage Tests with modular private

[jira] [Updated] (HUDI-693) Add unit test for hudi-cli module

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-693: Status: Open (was: New) > Add unit test for hudi-cli module > - > >

[jira] [Updated] (HUDI-145) Limit the amount of partitions considered for GlobalBloomIndex

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-145: Labels: gsoc2021 mentor (was: ) > Limit the amount of partitions considered for GlobalBloomIndex >

[jira] [Commented] (HUDI-226) Hudi Website - Provide links to documentation corresponding to older release versions

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17228874#comment-17228874 ] Raymond Xu commented on HUDI-226: - [~vbalaji] Seems like this is done.

[jira] [Updated] (HUDI-791) Replace null by Option in Delta Streamer

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-791: Status: Closed (was: Patch Available) > Replace null by Option in Delta Streamer >

[jira] [Updated] (HUDI-233) Redo log statements using SLF4J

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-233: Labels: pull-request-available (was: gsoc2021 mentor pull-request-available) > Redo log statements using

[jira] [Updated] (HUDI-1001) Add implementation to translate source partition paths when doing metadata bootstrap

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1001: - Labels: (was: gsoc2021 mentor) > Add implementation to translate source partition paths when doing

[jira] [Updated] (HUDI-270) [UMBRELLA] Improve Hudi website UI and documentation

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-270: Labels: (was: gsoc2021 mentor) > [UMBRELLA] Improve Hudi website UI and documentation >

[jira] [Updated] (HUDI-246) Apache Pulsar data source for Hudi

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-246: Labels: (was: gsoc2021 mentor) > Apache Pulsar data source for Hudi > -- >

[jira] [Updated] (HUDI-73) Support vanilla Avro Kafka Source in HoodieDeltaStreamer

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-73?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-73: --- Labels: pull-request-available (was: gsoc2021 mentor pull-request-available) > Support vanilla Avro Kafka

[jira] [Updated] (HUDI-60) [UMBRELLA] Beam IO module to support incremental tailing of Hoodie Hive/Spark tables

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-60?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-60: --- Summary: [UMBRELLA] Beam IO module to support incremental tailing of Hoodie Hive/Spark tables (was: Beam IO

[jira] [Updated] (HUDI-67) Tool to convert sequence file based archived commits to log format #224

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-67?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-67: --- Labels: (was: gsoc2021 mentor) > Tool to convert sequence file based archived commits to log format #224 >

[jira] [Updated] (HUDI-388) Support DDL / DML SparkSQL statements which useful for admins

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-388: Labels: (was: gsoc2021 mentor) > Support DDL / DML SparkSQL statements which useful for admins >

[jira] [Updated] (HUDI-74) Improve compaction support in HoodieDeltaStreamer & CLI

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-74?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-74: --- Labels: (was: gsoc2021 mentor) > Improve compaction support in HoodieDeltaStreamer & CLI >

[jira] [Updated] (HUDI-304) Bring back spotless plugin

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-304: Labels: bug-bash-0.6.0 help-wanted pull-request-available (was: bug-bash-0.6.0 gsoc2021 help-wanted mentor

[jira] [Updated] (HUDI-534) Explore a new way to fix import order

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-534: Labels: (was: gsoc2021 mentor) > Explore a new way to fix import order >

[jira] [Updated] (HUDI-60) Beam IO module to support incremental tailing of Hoodie Hive/Spark tables #8

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-60?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-60: --- Labels: gsoc2021 mentor (was: ) > Beam IO module to support incremental tailing of Hoodie Hive/Spark tables #8

[jira] [Updated] (HUDI-488) Refactor Source classes in hudi-utilities

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-488: Labels: (was: gsoc2021 mentor) > Refactor Source classes in hudi-utilities >

[jira] [Updated] (HUDI-1385) [UMBRELLA] Improve source ingestion support in DeltaStreamer

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1385: - Summary: [UMBRELLA] Improve source ingestion support in DeltaStreamer (was: [UMBRELLA] Improve source

[jira] [Created] (HUDI-1386) AWS kinesis data source for DeltaStreamer

2020-11-10 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-1386: Summary: AWS kinesis data source for DeltaStreamer Key: HUDI-1386 URL: https://issues.apache.org/jira/browse/HUDI-1386 Project: Apache Hudi Issue Type: New Feature

[jira] [Created] (HUDI-1388) [UMBRELLA] Improve CLI features and usabilities

2020-11-10 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-1388: Summary: [UMBRELLA] Improve CLI features and usabilities Key: HUDI-1388 URL: https://issues.apache.org/jira/browse/HUDI-1388 Project: Apache Hudi Issue Type:

[jira] [Updated] (HUDI-1388) [UMBRELLA] Improve CLI features and usabilities

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1388: - Labels: gsoc2021 mentor (was: ) > [UMBRELLA] Improve CLI features and usabilities >

[jira] [Updated] (HUDI-1237) [UMBRELLA] Checkstyle, formatting, warnings, spotless

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1237: - Component/s: (was: Testing) Code Cleanup > [UMBRELLA] Checkstyle, formatting,

[jira] [Updated] (HUDI-767) Support transformation when export to Hudi

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-767: Labels: (was: gsoc2021 mentor) > Support transformation when export to Hudi >

[jira] [Updated] (HUDI-735) Improve deltastreamer error message when case mismatch of commandline arguments.

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-735: Labels: (was: bug-bash-0.6.0) > Improve deltastreamer error message when case mismatch of commandline >

[jira] [Updated] (HUDI-74) Improve compaction support in HoodieDeltaStreamer & CLI

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-74?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-74: --- Component/s: CLI > Improve compaction support in HoodieDeltaStreamer & CLI >

[jira] [Created] (HUDI-1387) [UMBRELLA] Support Apache Calcite for querying Hudi datasets

2020-11-10 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-1387: Summary: [UMBRELLA] Support Apache Calcite for querying Hudi datasets Key: HUDI-1387 URL: https://issues.apache.org/jira/browse/HUDI-1387 Project: Apache Hudi

[jira] [Updated] (HUDI-145) Limit the amount of partitions considered for GlobalBloomIndex

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-145: Labels: (was: gsoc2021 mentor) > Limit the amount of partitions considered for GlobalBloomIndex >

[jira] [Created] (HUDI-1389) [UMBRELLA] Survey indexing technique for better query performance

2020-11-10 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-1389: Summary: [UMBRELLA] Survey indexing technique for better query performance Key: HUDI-1389 URL: https://issues.apache.org/jira/browse/HUDI-1389 Project: Apache Hudi

[jira] [Updated] (HUDI-1389) [UMBRELLA] Survey indexing technique for better query performance

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1389: - Labels: gsoc gsoc2021 mentor (was: gsoc2021 mentor) > [UMBRELLA] Survey indexing technique for better

[jira] [Updated] (HUDI-1388) [UMBRELLA] Improve CLI features and usabilities

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1388: - Labels: gsoc gsoc2021 mentor (was: gsoc2021 mentor) > [UMBRELLA] Improve CLI features and usabilities >

[jira] [Updated] (HUDI-1387) [UMBRELLA] Support Apache Calcite for querying Hudi datasets

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1387: - Labels: gsoc gsoc2021 mentor (was: gsoc2021 mentor) > [UMBRELLA] Support Apache Calcite for querying

[jira] [Updated] (HUDI-1385) [UMBRELLA] Improve source ingestion support in DeltaStreamer

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1385: - Labels: gsoc gsoc2021 mentor (was: gsoc2021 mentor) > [UMBRELLA] Improve source ingestion support in

[jira] [Updated] (HUDI-60) [UMBRELLA] Support Apache Beam for incremental tailing

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-60?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-60: --- Labels: gsoc gsoc2021 mentor (was: gsoc2021 mentor) > [UMBRELLA] Support Apache Beam for incremental tailing >

[jira] [Updated] (HUDI-1390) [UMBRELLA] Support schema inference for unstructured data

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1390: - Labels: gsoc gsoc2021 mentor (was: gsoc2021 mentor) > [UMBRELLA] Support schema inference for

[jira] [Updated] (HUDI-1237) [UMBRELLA] Checkstyle, formatting, warnings, spotless

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1237: - Labels: gsoc gsoc2021 mentor (was: gsoc2021 mentor) > [UMBRELLA] Checkstyle, formatting, warnings,

[jira] [Updated] (HUDI-904) Segregate metrics configs by reporter type

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-904: Labels: (was: gsoc2021 mentor) > Segregate metrics configs by reporter type >

[jira] [Created] (HUDI-1385) [UMBRELLA] Improve source support in DeltaStreamer

2020-11-10 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-1385: Summary: [UMBRELLA] Improve source support in DeltaStreamer Key: HUDI-1385 URL: https://issues.apache.org/jira/browse/HUDI-1385 Project: Apache Hudi Issue Type:

[jira] [Updated] (HUDI-1290) Implement Debezium avro source for Delta Streamer

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1290: - Labels: (was: gsoc2021 mentor) > Implement Debezium avro source for Delta Streamer >

[jira] [Updated] (HUDI-735) Improve deltastreamer error message when case mismatch of commandline arguments.

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-735: Component/s: (was: Utilities) Code Cleanup > Improve deltastreamer error message when

[jira] [Updated] (HUDI-735) Improve deltastreamer error message when case mismatch of commandline arguments.

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-735: Labels: bug-bash-0.6.0 (was: bug-bash-0.6.0 gsoc2021 mentor) > Improve deltastreamer error message when

[jira] [Updated] (HUDI-96) Use Command line options instead of positional arguments when launching spark applications from various CLI commands

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-96?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-96: --- Labels: newbie pull-request-available (was: gsoc2021 mentor newbie pull-request-available) > Use Command line

[jira] [Updated] (HUDI-60) Beam IO module to support incremental tailing of Hoodie Hive/Spark tables #8

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-60?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-60: --- Description: (More details to be added) (was: https://github.com/uber/hudi/issues/8) > Beam IO module to

[jira] [Commented] (HUDI-1387) [UMBRELLA] Support Apache Calcite for querying Hudi datasets

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17229508#comment-17229508 ] Raymond Xu commented on HUDI-1387: -- [~vinoth] Made this under presto integration component. Shall we

[jira] [Updated] (HUDI-1388) [UMBRELLA] Improve CLI features and usabilities

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1388: - Component/s: Usability > [UMBRELLA] Improve CLI features and usabilities >

[jira] [Created] (HUDI-1390) [UMBRELLA] Support schema inference for unstructured data

2020-11-10 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-1390: Summary: [UMBRELLA] Support schema inference for unstructured data Key: HUDI-1390 URL: https://issues.apache.org/jira/browse/HUDI-1390 Project: Apache Hudi Issue

[jira] [Updated] (HUDI-1389) [UMBRELLA] Survey indexing technique for better query performance

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1389: - Component/s: Performance > [UMBRELLA] Survey indexing technique for better query performance >

[jira] [Updated] (HUDI-60) [UMBRELLA] Support Apache Beam for incremental tailing

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-60?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-60: --- Summary: [UMBRELLA] Support Apache Beam for incremental tailing (was: [UMBRELLA] Beam IO module to support

<    1   2   3   4   5   6   7   8   9   10   >