[jira] [Updated] (HUDI-246) Apache Pulsar data source for Hudi

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-246: Labels: gsoc2021 mentor (was: ) > Apache Pulsar data source for Hudi > -- >

[jira] [Updated] (HUDI-73) Support vanilla Avro Kafka Source in HoodieDeltaStreamer

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-73?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-73: --- Labels: gsoc2021 mentor pull-request-available (was: pull-request-available) > Support vanilla Avro Kafka Sour

[jira] [Commented] (HUDI-768) Support split/merge source datasets during export

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17228779#comment-17228779 ] Raymond Xu commented on HUDI-768: - [~vinoth] i wonder if we should close this as won't do.

[jira] [Updated] (HUDI-1290) Implement Debezium avro source for Delta Streamer

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1290: - Labels: gsoc2021 mentor (was: ) > Implement Debezium avro source for Delta Streamer > ---

[jira] [Updated] (HUDI-74) Improve compaction support in HoodieDeltaStreamer & CLI

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-74?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-74: --- Labels: gsoc2021 mentor (was: ) > Improve compaction support in HoodieDeltaStreamer & CLI > ---

[jira] [Updated] (HUDI-1280) Add tool to capture earliest or latest offsets in kafka topics

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1280: - Labels: gsoc2021 mentor (was: ) > Add tool to capture earliest or latest offsets in kafka topics > -

[jira] [Updated] (HUDI-1280) Add tool to capture earliest or latest offsets in kafka topics

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1280: - Labels: (was: gsoc2021 mentor) > Add tool to capture earliest or latest offsets in kafka topics > -

[jira] [Updated] (HUDI-304) Bring back spotless plugin

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-304: Labels: bug-bash-0.6.0 gsoc2021 help-wanted mentor pull-request-available (was: bug-bash-0.6.0 help-wanted p

[jira] [Updated] (HUDI-735) Improve deltastreamer error message when case mismatch of commandline arguments.

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-735: Labels: bug-bash-0.6.0 gsoc2021 mentor (was: bug-bash-0.6.0) > Improve deltastreamer error message when case

[jira] [Updated] (HUDI-67) Tool to convert sequence file based archived commits to log format #224

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-67?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-67: --- Labels: gsoc2021 mentor (was: ) > Tool to convert sequence file based archived commits to log format #224 > ---

[jira] [Updated] (HUDI-388) Support DDL / DML SparkSQL statements which useful for admins

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-388: Labels: gsoc2021 mentor (was: ) > Support DDL / DML SparkSQL statements which useful for admins > --

[jira] [Updated] (HUDI-96) Use Command line options instead of positional arguments when launching spark applications from various CLI commands

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-96?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-96: --- Labels: gsoc2021 mentor newbie pull-request-available (was: newbie pull-request-available) > Use Command line

[jira] [Updated] (HUDI-791) Replace null by Option in Delta Streamer

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-791: Labels: gsoc2021 mentor pull-request-available (was: pull-request-available) > Replace null by Option in De

[jira] [Updated] (HUDI-791) Replace null by Option in Delta Streamer

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-791: Labels: pull-request-available (was: gsoc2021 mentor pull-request-available) > Replace null by Option in De

[jira] [Commented] (HUDI-791) Replace null by Option in Delta Streamer

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17228869#comment-17228869 ] Raymond Xu commented on HUDI-791: - hi [~garyli1019] saw the PR is merged. can we close this

[jira] [Updated] (HUDI-534) Explore a new way to fix import order

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-534: Component/s: Code Cleanup > Explore a new way to fix import order > - > >

[jira] [Updated] (HUDI-534) Explore a new way to fix import order

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-534: Labels: gsoc2021 mentor (was: ) > Explore a new way to fix import order > --

[jira] [Updated] (HUDI-347) Fix TestHoodieClientOnCopyOnWriteStorage Tests with modular private methods

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-347: Labels: gsoc2021 mentor (was: ) > Fix TestHoodieClientOnCopyOnWriteStorage Tests with modular private method

[jira] [Updated] (HUDI-233) Redo log statements using SLF4J

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-233: Labels: gsoc2021 mentor pull-request-available (was: pull-request-available) > Redo log statements using SL

[jira] [Commented] (HUDI-226) Hudi Website - Provide links to documentation corresponding to older release versions

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17228874#comment-17228874 ] Raymond Xu commented on HUDI-226: - [~vbalaji] Seems like this is done. [https://hudi.apach

[jira] [Updated] (HUDI-145) Limit the amount of partitions considered for GlobalBloomIndex

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-145: Labels: gsoc2021 mentor (was: ) > Limit the amount of partitions considered for GlobalBloomIndex > -

[jira] [Updated] (HUDI-693) Add unit test for hudi-cli module

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-693: Status: Open (was: New) > Add unit test for hudi-cli module > - > >

[jira] [Closed] (HUDI-693) Add unit test for hudi-cli module

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-693. --- Resolution: Duplicate Closing due to being a duplicate > Add unit test for hudi-cli module > -

[jira] [Updated] (HUDI-347) Fix TestHoodieClientOnCopyOnWriteStorage Tests with modular private methods

2020-11-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-347: Labels: (was: gsoc2021 mentor) > Fix TestHoodieClientOnCopyOnWriteStorage Tests with modular private method

[jira] [Updated] (HUDI-791) Replace null by Option in Delta Streamer

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-791: Status: Closed (was: Patch Available) > Replace null by Option in Delta Streamer > -

[jira] [Updated] (HUDI-904) Segregate metrics configs by reporter type

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-904: Labels: (was: gsoc2021 mentor) > Segregate metrics configs by reporter type > -

[jira] [Updated] (HUDI-767) Support transformation when export to Hudi

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-767: Labels: (was: gsoc2021 mentor) > Support transformation when export to Hudi > -

[jira] [Updated] (HUDI-233) Redo log statements using SLF4J

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-233: Labels: pull-request-available (was: gsoc2021 mentor pull-request-available) > Redo log statements using SL

[jira] [Updated] (HUDI-270) [UMBRELLA] Improve Hudi website UI and documentation

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-270: Labels: (was: gsoc2021 mentor) > [UMBRELLA] Improve Hudi website UI and documentation > ---

[jira] [Updated] (HUDI-1001) Add implementation to translate source partition paths when doing metadata bootstrap

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1001: - Labels: (was: gsoc2021 mentor) > Add implementation to translate source partition paths when doing metad

[jira] [Updated] (HUDI-304) Bring back spotless plugin

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-304: Labels: bug-bash-0.6.0 help-wanted pull-request-available (was: bug-bash-0.6.0 gsoc2021 help-wanted mentor p

[jira] [Updated] (HUDI-534) Explore a new way to fix import order

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-534: Labels: (was: gsoc2021 mentor) > Explore a new way to fix import order > --

[jira] [Updated] (HUDI-60) Beam IO module to support incremental tailing of Hoodie Hive/Spark tables #8

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-60?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-60: --- Labels: gsoc2021 mentor (was: ) > Beam IO module to support incremental tailing of Hoodie Hive/Spark tables #8

[jira] [Created] (HUDI-1385) [UMBRELLA] Improve source support in DeltaStreamer

2020-11-10 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-1385: Summary: [UMBRELLA] Improve source support in DeltaStreamer Key: HUDI-1385 URL: https://issues.apache.org/jira/browse/HUDI-1385 Project: Apache Hudi Issue Type: Impr

[jira] [Updated] (HUDI-488) Refactor Source classes in hudi-utilities

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-488: Labels: (was: gsoc2021 mentor) > Refactor Source classes in hudi-utilities > -

[jira] [Updated] (HUDI-1290) Implement Debezium avro source for Delta Streamer

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1290: - Labels: (was: gsoc2021 mentor) > Implement Debezium avro source for Delta Streamer > ---

[jira] [Updated] (HUDI-1385) [UMBRELLA] Improve source ingestion support in DeltaStreamer

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1385: - Summary: [UMBRELLA] Improve source ingestion support in DeltaStreamer (was: [UMBRELLA] Improve source sup

[jira] [Updated] (HUDI-246) Apache Pulsar data source for Hudi

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-246: Labels: (was: gsoc2021 mentor) > Apache Pulsar data source for Hudi > -- >

[jira] [Updated] (HUDI-73) Support vanilla Avro Kafka Source in HoodieDeltaStreamer

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-73?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-73: --- Labels: pull-request-available (was: gsoc2021 mentor pull-request-available) > Support vanilla Avro Kafka Sour

[jira] [Updated] (HUDI-735) Improve deltastreamer error message when case mismatch of commandline arguments.

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-735: Component/s: (was: Utilities) Code Cleanup > Improve deltastreamer error message when ca

[jira] [Updated] (HUDI-735) Improve deltastreamer error message when case mismatch of commandline arguments.

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-735: Labels: bug-bash-0.6.0 (was: bug-bash-0.6.0 gsoc2021 mentor) > Improve deltastreamer error message when case

[jira] [Updated] (HUDI-735) Improve deltastreamer error message when case mismatch of commandline arguments.

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-735: Labels: (was: bug-bash-0.6.0) > Improve deltastreamer error message when case mismatch of commandline > ar

[jira] [Updated] (HUDI-74) Improve compaction support in HoodieDeltaStreamer & CLI

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-74?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-74: --- Component/s: CLI > Improve compaction support in HoodieDeltaStreamer & CLI > ---

[jira] [Created] (HUDI-1386) AWS kinesis data source for DeltaStreamer

2020-11-10 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-1386: Summary: AWS kinesis data source for DeltaStreamer Key: HUDI-1386 URL: https://issues.apache.org/jira/browse/HUDI-1386 Project: Apache Hudi Issue Type: New Feature

[jira] [Updated] (HUDI-96) Use Command line options instead of positional arguments when launching spark applications from various CLI commands

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-96?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-96: --- Labels: newbie pull-request-available (was: gsoc2021 mentor newbie pull-request-available) > Use Command line

[jira] [Updated] (HUDI-60) Beam IO module to support incremental tailing of Hoodie Hive/Spark tables #8

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-60?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-60: --- Description: (More details to be added) (was: https://github.com/uber/hudi/issues/8) > Beam IO module to suppo

[jira] [Updated] (HUDI-60) [UMBRELLA] Beam IO module to support incremental tailing of Hoodie Hive/Spark tables

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-60?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-60: --- Summary: [UMBRELLA] Beam IO module to support incremental tailing of Hoodie Hive/Spark tables (was: Beam IO mod

[jira] [Created] (HUDI-1387) [UMBRELLA] Support Apache Calcite for querying Hudi datasets

2020-11-10 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-1387: Summary: [UMBRELLA] Support Apache Calcite for querying Hudi datasets Key: HUDI-1387 URL: https://issues.apache.org/jira/browse/HUDI-1387 Project: Apache Hudi Issue

[jira] [Commented] (HUDI-1387) [UMBRELLA] Support Apache Calcite for querying Hudi datasets

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17229508#comment-17229508 ] Raymond Xu commented on HUDI-1387: -- [~vinoth] Made this under presto integration componen

[jira] [Created] (HUDI-1388) [UMBRELLA] Improve CLI features and usabilities

2020-11-10 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-1388: Summary: [UMBRELLA] Improve CLI features and usabilities Key: HUDI-1388 URL: https://issues.apache.org/jira/browse/HUDI-1388 Project: Apache Hudi Issue Type: Improve

[jira] [Updated] (HUDI-1388) [UMBRELLA] Improve CLI features and usabilities

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1388: - Labels: gsoc2021 mentor (was: ) > [UMBRELLA] Improve CLI features and usabilities > -

[jira] [Updated] (HUDI-67) Tool to convert sequence file based archived commits to log format #224

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-67?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-67: --- Labels: (was: gsoc2021 mentor) > Tool to convert sequence file based archived commits to log format #224 > ---

[jira] [Updated] (HUDI-388) Support DDL / DML SparkSQL statements which useful for admins

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-388: Labels: (was: gsoc2021 mentor) > Support DDL / DML SparkSQL statements which useful for admins > --

[jira] [Updated] (HUDI-74) Improve compaction support in HoodieDeltaStreamer & CLI

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-74?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-74: --- Labels: (was: gsoc2021 mentor) > Improve compaction support in HoodieDeltaStreamer & CLI > ---

[jira] [Updated] (HUDI-145) Limit the amount of partitions considered for GlobalBloomIndex

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-145: Labels: (was: gsoc2021 mentor) > Limit the amount of partitions considered for GlobalBloomIndex > -

[jira] [Created] (HUDI-1389) [UMBRELLA] Survey indexing technique for better query performance

2020-11-10 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-1389: Summary: [UMBRELLA] Survey indexing technique for better query performance Key: HUDI-1389 URL: https://issues.apache.org/jira/browse/HUDI-1389 Project: Apache Hudi

[jira] [Created] (HUDI-1390) [UMBRELLA] Support schema inference for unstructured data

2020-11-10 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-1390: Summary: [UMBRELLA] Support schema inference for unstructured data Key: HUDI-1390 URL: https://issues.apache.org/jira/browse/HUDI-1390 Project: Apache Hudi Issue Typ

[jira] [Updated] (HUDI-1388) [UMBRELLA] Improve CLI features and usabilities

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1388: - Component/s: Usability > [UMBRELLA] Improve CLI features and usabilities > ---

[jira] [Updated] (HUDI-1389) [UMBRELLA] Survey indexing technique for better query performance

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1389: - Component/s: Performance > [UMBRELLA] Survey indexing technique for better query performance > ---

[jira] [Updated] (HUDI-1237) [UMBRELLA] Checkstyle, formatting, warnings, spotless

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1237: - Component/s: (was: Testing) Code Cleanup > [UMBRELLA] Checkstyle, formatting, warning

[jira] [Updated] (HUDI-60) [UMBRELLA] Support Apache Beam for incremental tailing

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-60?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-60: --- Summary: [UMBRELLA] Support Apache Beam for incremental tailing (was: [UMBRELLA] Beam IO module to support incr

[jira] [Updated] (HUDI-1389) [UMBRELLA] Survey indexing technique for better query performance

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1389: - Labels: gsoc gsoc2021 mentor (was: gsoc2021 mentor) > [UMBRELLA] Survey indexing technique for better que

[jira] [Updated] (HUDI-1388) [UMBRELLA] Improve CLI features and usabilities

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1388: - Labels: gsoc gsoc2021 mentor (was: gsoc2021 mentor) > [UMBRELLA] Improve CLI features and usabilities > -

[jira] [Updated] (HUDI-1387) [UMBRELLA] Support Apache Calcite for querying Hudi datasets

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1387: - Labels: gsoc gsoc2021 mentor (was: gsoc2021 mentor) > [UMBRELLA] Support Apache Calcite for querying Hudi

[jira] [Updated] (HUDI-1385) [UMBRELLA] Improve source ingestion support in DeltaStreamer

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1385: - Labels: gsoc gsoc2021 mentor (was: gsoc2021 mentor) > [UMBRELLA] Improve source ingestion support in Delt

[jira] [Updated] (HUDI-60) [UMBRELLA] Support Apache Beam for incremental tailing

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-60?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-60: --- Labels: gsoc gsoc2021 mentor (was: gsoc2021 mentor) > [UMBRELLA] Support Apache Beam for incremental tailing >

[jira] [Updated] (HUDI-1390) [UMBRELLA] Support schema inference for unstructured data

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1390: - Labels: gsoc gsoc2021 mentor (was: gsoc2021 mentor) > [UMBRELLA] Support schema inference for unstructure

[jira] [Updated] (HUDI-1237) [UMBRELLA] Checkstyle, formatting, warnings, spotless

2020-11-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1237: - Labels: gsoc gsoc2021 mentor (was: gsoc2021 mentor) > [UMBRELLA] Checkstyle, formatting, warnings, spotle

[jira] [Commented] (HUDI-1387) [UMBRELLA] Support Apache Calcite for writing/querying Hudi datasets

2020-11-12 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17231039#comment-17231039 ] Raymond Xu commented on HUDI-1387: -- [~vinoth] ok sounds good. > [UMBRELLA] Support Apach

[jira] [Commented] (HUDI-1390) [UMBRELLA] Support schema inference for unstructured data

2020-11-14 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232147#comment-17232147 ] Raymond Xu commented on HUDI-1390: -- [~309637554] glad to hear that! we labelled a series

[jira] [Commented] (HUDI-1390) [UMBRELLA] Support schema inference for unstructured data

2020-11-15 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232381#comment-17232381 ] Raymond Xu commented on HUDI-1390: -- yes [~309637554] that is the intended use case. It'll

[jira] [Commented] (HUDI-1390) [UMBRELLA] Support schema inference for unstructured data

2020-11-16 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232904#comment-17232904 ] Raymond Xu commented on HUDI-1390: -- yes sounds good. > [UMBRELLA] Support schema inferen

[jira] [Updated] (HUDI-304) Bring back spotless plugin

2021-01-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-304: Labels: pull-request-available (was: bug-bash-0.6.0 help-wanted pull-request-available) > Bring back spotle

[jira] [Assigned] (HUDI-304) Bring back spotless plugin

2021-01-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-304: --- Assignee: Raymond Xu (was: leesf) > Bring back spotless plugin > --- > >

[jira] [Updated] (HUDI-304) Bring back spotless plugin

2021-01-09 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-304: Status: In Progress (was: Open) > Bring back spotless plugin > --- > >

[jira] [Commented] (HUDI-304) Bring back spotless plugin

2021-01-16 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17266549#comment-17266549 ] Raymond Xu commented on HUDI-304: - [~vinoth] Thanks, I've checked out flink's changes and m

[jira] [Assigned] (HUDI-393) Integrate with Azure Pipeline run the end to end tests

2021-01-16 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-393: --- Assignee: Raymond Xu > Integrate with Azure Pipeline run the end to end tests > --

[jira] [Commented] (HUDI-393) Integrate with Azure Pipeline run the end to end tests

2021-01-16 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17266551#comment-17266551 ] Raymond Xu commented on HUDI-393: - [~wangxianghu] I've started trying out some Azure config

[jira] [Created] (HUDI-3262) Integration test suite failure

2022-01-17 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-3262: Summary: Integration test suite failure Key: HUDI-3262 URL: https://issues.apache.org/jira/browse/HUDI-3262 Project: Apache Hudi Issue Type: Bug Components

[jira] [Comment Edited] (HUDI-3222) On-call team to triage GH issues, PRs, and JIRAs

2022-01-17 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17477396#comment-17477396 ] Raymond Xu edited comment on HUDI-3222 at 1/18/22, 5:34 AM: h4

[jira] [Comment Edited] (HUDI-3222) On-call team to triage GH issues, PRs, and JIRAs

2022-01-17 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17477396#comment-17477396 ] Raymond Xu edited comment on HUDI-3222 at 1/18/22, 5:38 AM: h4

[jira] [Comment Edited] (HUDI-3222) On-call team to triage GH issues, PRs, and JIRAs

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17477396#comment-17477396 ] Raymond Xu edited comment on HUDI-3222 at 1/18/22, 8:56 AM: h4

[jira] [Updated] (HUDI-2514) Add default hiveTableSerdeProperties for Spark SQL when sync Hive

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2514: - Status: In Progress (was: Open) > Add default hiveTableSerdeProperties for Spark SQL when sync Hive > ---

[jira] [Updated] (HUDI-2514) Add default hiveTableSerdeProperties for Spark SQL when sync Hive

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2514: - Status: Patch Available (was: In Progress) > Add default hiveTableSerdeProperties for Spark SQL when sync

[jira] [Updated] (HUDI-3222) On-call team to triage GH issues, PRs, and JIRAs

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3222: - Reviewers: Raymond Xu, sivabalan narayanan > On-call team to triage GH issues, PRs, and JIRAs > --

[jira] [Comment Edited] (HUDI-3222) On-call team to triage GH issues, PRs, and JIRAs

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17477396#comment-17477396 ] Raymond Xu edited comment on HUDI-3222 at 1/18/22, 3:49 PM: h4

[jira] [Updated] (HUDI-2837) The original hoodie.table.name should be maintained in Spark SQL

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2837: - Sprint: Cont' improve - 2021/01/18 > The original hoodie.table.name should be maintained in Spark SQL > -

[jira] [Updated] (HUDI-83) Map Timestamp type in spark to corresponding Timestamp type in Hive during Hive sync

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-83?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-83: --- Sprint: Cont' improve - 2021/01/18 > Map Timestamp type in spark to corresponding Timestamp type in Hive during

[jira] [Updated] (HUDI-1977) Fix Hudi-CLI show table spark-sql

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1977: - Sprint: Cont' improve - 2021/01/18 > Fix Hudi-CLI show table spark-sql > ---

[jira] [Updated] (HUDI-2732) Spark Datasource V2 integration RFC

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2732: - Priority: Blocker (was: Major) > Spark Datasource V2 integration RFC > -

[jira] [Updated] (HUDI-2732) Spark Datasource V2 integration RFC

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2732: - Fix Version/s: 0.11.0 > Spark Datasource V2 integration RFC > > >

[jira] [Updated] (HUDI-3161) Add Call Produce Command for spark sql

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3161: - Priority: Blocker (was: Major) > Add Call Produce Command for spark sql > ---

[jira] [Updated] (HUDI-2777) Data import performance deteriorates because multiple Spark jobs are started when data is written to disks.

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2777: - Sprint: Cont' improve - 2021/01/18 > Data import performance deteriorates because multiple Spark jobs are

[jira] [Updated] (HUDI-3204) spark on TimestampBasedKeyGenerator has no result when query by partition column

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3204: - Sprint: Cont' improve - 2021/01/18 > spark on TimestampBasedKeyGenerator has no result when query by part

[jira] [Updated] (HUDI-3240) ALTER TABLE rename breaks with managed table in Spark 2.4

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3240: - Sprint: Cont' improve - 2021/01/18 > ALTER TABLE rename breaks with managed table in Spark 2.4 >

[jira] [Updated] (HUDI-3237) ALTER TABLE column type change fails select query

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3237: - Sprint: Cont' improve - 2021/01/18 > ALTER TABLE column type change fails select query >

[jira] [Updated] (HUDI-3161) Add Call Produce Command for spark sql

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3161: - Fix Version/s: 0.11.0 > Add Call Produce Command for spark sql > -- >

[jira] [Updated] (HUDI-3213) compaction should not change the commit time

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3213: - Sprint: Cont' improve - 2021/01/18 > compaction should not change the commit time > -

[jira] [Updated] (HUDI-3200) File Index config affects partition fields shown in printSchema results

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3200: - Sprint: Cont' improve - 2021/01/18 > File Index config affects partition fields shown in printSchema resu

[jira] [Updated] (HUDI-3254) Introduce HoodieCatalog to manage tables for Spark Datasource V2

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3254: - Priority: Blocker (was: Major) > Introduce HoodieCatalog to manage tables for Spark Datasource V2 > -

<    1   2   3   4   5   6   7   8   9   10   >