[jira] [Updated] (HUDI-110) Better defaults for Partition extractor for Spark DataSource and DeltaStreamer
[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-110: Fix Version/s: (was: 0.12.1) > Better defaults for Partition extractor for Spark DataSource and DeltaStreamer > -- > > Key: HUDI-110 > URL: https://issues.apache.org/jira/browse/HUDI-110 > Project: Apache Hudi > Issue Type: Improvement > Components: deltastreamer, spark, Usability >Reporter: Balaji Varadarajan >Priority: Major > Labels: user-support-issues > > Currently > SlashEncodedDayPartitionValueExtractor is the default being used. This is not > a common format outside Uber. > > Also, Spark DataSource provides partitionedBy clauses which has not been > integrated for Hudi Data Source. We need to investigate how we can leverage > partitionBy clause for partitioning. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Updated] (HUDI-110) Better defaults for Partition extractor for Spark DataSource and DeltaStreamer
[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-110: - Fix Version/s: 0.12.1 (was: 0.12.0) > Better defaults for Partition extractor for Spark DataSource and DeltaStreamer > -- > > Key: HUDI-110 > URL: https://issues.apache.org/jira/browse/HUDI-110 > Project: Apache Hudi > Issue Type: Improvement > Components: deltastreamer, spark, Usability >Reporter: Balaji Varadarajan >Priority: Major > Labels: user-support-issues > Fix For: 0.12.1 > > > Currently > SlashEncodedDayPartitionValueExtractor is the default being used. This is not > a common format outside Uber. > > Also, Spark DataSource provides partitionedBy clauses which has not been > integrated for Hudi Data Source. We need to investigate how we can leverage > partitionBy clause for partitioning. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Updated] (HUDI-110) Better defaults for Partition extractor for Spark DataSource and DeltaStreamer
[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-110: Fix Version/s: 0.12.0 (was: 0.11.0) > Better defaults for Partition extractor for Spark DataSource and DeltaStreamer > -- > > Key: HUDI-110 > URL: https://issues.apache.org/jira/browse/HUDI-110 > Project: Apache Hudi > Issue Type: Improvement > Components: deltastreamer, spark, Usability >Reporter: Balaji Varadarajan >Priority: Major > Labels: user-support-issues > Fix For: 0.12.0 > > > Currently > SlashEncodedDayPartitionValueExtractor is the default being used. This is not > a common format outside Uber. > > Also, Spark DataSource provides partitionedBy clauses which has not been > integrated for Hudi Data Source. We need to investigate how we can leverage > partitionBy clause for partitioning. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (HUDI-110) Better defaults for Partition extractor for Spark DataSource and DeltaStreamer
[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-110: - Priority: Major (was: Critical) > Better defaults for Partition extractor for Spark DataSource and DeltaStreamer > -- > > Key: HUDI-110 > URL: https://issues.apache.org/jira/browse/HUDI-110 > Project: Apache Hudi > Issue Type: Improvement > Components: deltastreamer, spark, Usability >Reporter: Balaji Varadarajan >Priority: Major > Labels: user-support-issues > Fix For: 0.11.0 > > > Currently > SlashEncodedDayPartitionValueExtractor is the default being used. This is not > a common format outside Uber. > > Also, Spark DataSource provides partitionedBy clauses which has not been > integrated for Hudi Data Source. We need to investigate how we can leverage > partitionBy clause for partitioning. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (HUDI-110) Better defaults for Partition extractor for Spark DataSource and DeltaStreamer
[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Forward Xu updated HUDI-110: Summary: Better defaults for Partition extractor for Spark DataSource and DeltaStreamer (was: Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer) > Better defaults for Partition extractor for Spark DataSource and DeltaStreamer > -- > > Key: HUDI-110 > URL: https://issues.apache.org/jira/browse/HUDI-110 > Project: Apache Hudi > Issue Type: Improvement > Components: DeltaStreamer, Spark Integration, Usability >Reporter: Balaji Varadarajan >Priority: Critical > Labels: user-support-issues > Fix For: 0.11.0 > > > Currently > SlashEncodedDayPartitionValueExtractor is the default being used. This is not > a common format outside Uber. > > Also, Spark DataSource provides partitionedBy clauses which has not been > integrated for Hudi Data Source. We need to investigate how we can leverage > partitionBy clause for partitioning. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (HUDI-110) Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer
[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-110: Fix Version/s: 0.11.0 > Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer > -- > > Key: HUDI-110 > URL: https://issues.apache.org/jira/browse/HUDI-110 > Project: Apache Hudi > Issue Type: Improvement > Components: DeltaStreamer, Spark Integration, Usability >Reporter: Balaji Varadarajan >Priority: Critical > Labels: user-support-issues > Fix For: 0.11.0 > > > Currently > SlashEncodedDayPartitionValueExtractor is the default being used. This is not > a common format outside Uber. > > Also, Spark DataSource provides partitionedBy clauses which has not been > integrated for Hudi Data Source. We need to investigate how we can leverage > partitionBy clause for partitioning. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (HUDI-110) Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer
[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-110: Priority: Critical (was: Minor) > Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer > -- > > Key: HUDI-110 > URL: https://issues.apache.org/jira/browse/HUDI-110 > Project: Apache Hudi > Issue Type: Improvement > Components: DeltaStreamer, Spark Integration, Usability >Reporter: Balaji Varadarajan >Priority: Critical > Labels: user-support-issues > > Currently > SlashEncodedDayPartitionValueExtractor is the default being used. This is not > a common format outside Uber. > > Also, Spark DataSource provides partitionedBy clauses which has not been > integrated for Hudi Data Source. We need to investigate how we can leverage > partitionBy clause for partitioning. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (HUDI-110) Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer
[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-110: - Labels: user-support-issues (was: ) > Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer > -- > > Key: HUDI-110 > URL: https://issues.apache.org/jira/browse/HUDI-110 > Project: Apache Hudi > Issue Type: Improvement > Components: DeltaStreamer, Spark Integration, Usability >Reporter: Balaji Varadarajan >Priority: Minor > Labels: user-support-issues > > Currently > SlashEncodedDayPartitionValueExtractor is the default being used. This is not > a common format outside Uber. > > Also, Spark DataSource provides partitionedBy clauses which has not been > integrated for Hudi Data Source. We need to investigate how we can leverage > partitionBy clause for partitioning. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (HUDI-110) Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer
[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-110: Labels: (was: bug-bash-0.6.0 pull-request-available) > Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer > -- > > Key: HUDI-110 > URL: https://issues.apache.org/jira/browse/HUDI-110 > Project: Apache Hudi > Issue Type: Improvement > Components: DeltaStreamer, Spark Integration, Usability >Reporter: Balaji Varadarajan >Assignee: Yanjia Gary Li >Priority: Minor > > Currently > SlashEncodedDayPartitionValueExtractor is the default being used. This is not > a common format outside Uber. > > Also, Spark DataSource provides partitionedBy clauses which has not been > integrated for Hudi Data Source. We need to investigate how we can leverage > partitionBy clause for partitioning. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (HUDI-110) Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer
[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-110: Labels: bug-bash-0.6.0 pull-request-available (was: bug-bash-0.6.0) > Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer > -- > > Key: HUDI-110 > URL: https://issues.apache.org/jira/browse/HUDI-110 > Project: Apache Hudi (incubating) > Issue Type: Improvement > Components: DeltaStreamer, Spark Integration, Usability >Reporter: Balaji Varadarajan >Assignee: Yanjia Gary Li >Priority: Minor > Labels: bug-bash-0.6.0, pull-request-available > > Currently > SlashEncodedDayPartitionValueExtractor is the default being used. This is not > a common format outside Uber. > > Also, Spark DataSource provides partitionedBy clauses which has not been > integrated for Hudi Data Source. We need to investigate how we can leverage > partitionBy clause for partitioning. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (HUDI-110) Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer
[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-110: Status: In Progress (was: Open) > Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer > -- > > Key: HUDI-110 > URL: https://issues.apache.org/jira/browse/HUDI-110 > Project: Apache Hudi (incubating) > Issue Type: Improvement > Components: DeltaStreamer, Spark Integration, Usability >Reporter: Balaji Varadarajan >Assignee: Yanjia Gary Li >Priority: Minor > Labels: bug-bash-0.6.0 > > Currently > SlashEncodedDayPartitionValueExtractor is the default being used. This is not > a common format outside Uber. > > Also, Spark DataSource provides partitionedBy clauses which has not been > integrated for Hudi Data Source. We need to investigate how we can leverage > partitionBy clause for partitioning. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (HUDI-110) Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer
[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-110: - Labels: bug-bash-0.6.0 (was: ) > Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer > -- > > Key: HUDI-110 > URL: https://issues.apache.org/jira/browse/HUDI-110 > Project: Apache Hudi (incubating) > Issue Type: Improvement > Components: DeltaStreamer, Spark Integration, Usability >Reporter: Balaji Varadarajan >Priority: Minor > Labels: bug-bash-0.6.0 > > Currently > SlashEncodedDayPartitionValueExtractor is the default being used. This is not > a common format outside Uber. > > Also, Spark DataSource provides partitionedBy clauses which has not been > integrated for Hudi Data Source. We need to investigate how we can leverage > partitionBy clause for partitioning. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (HUDI-110) Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer
[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-110: Status: New (was: Open) > Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer > -- > > Key: HUDI-110 > URL: https://issues.apache.org/jira/browse/HUDI-110 > Project: Apache Hudi (incubating) > Issue Type: Improvement > Components: DeltaStreamer, Spark Integration, Usability >Reporter: Balaji Varadarajan >Priority: Minor > > Currently > SlashEncodedDayPartitionValueExtractor is the default being used. This is not > a common format outside Uber. > > Also, Spark DataSource provides partitionedBy clauses which has not been > integrated for Hudi Data Source. We need to investigate how we can leverage > partitionBy clause for partitioning. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (HUDI-110) Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer
[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-110: Status: Open (was: New) > Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer > -- > > Key: HUDI-110 > URL: https://issues.apache.org/jira/browse/HUDI-110 > Project: Apache Hudi (incubating) > Issue Type: Improvement > Components: DeltaStreamer, Spark Integration, Usability >Reporter: Balaji Varadarajan >Priority: Minor > > Currently > SlashEncodedDayPartitionValueExtractor is the default being used. This is not > a common format outside Uber. > > Also, Spark DataSource provides partitionedBy clauses which has not been > integrated for Hudi Data Source. We need to investigate how we can leverage > partitionBy clause for partitioning. -- This message was sent by Atlassian Jira (v8.3.4#803005)