[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated HUDI-110: -------------------------------- Labels: bug-bash-0.6.0 pull-request-available (was: bug-bash-0.6.0) > Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer > ------------------------------------------------------------------------------ > > Key: HUDI-110 > URL: https://issues.apache.org/jira/browse/HUDI-110 > Project: Apache Hudi (incubating) > Issue Type: Improvement > Components: DeltaStreamer, Spark Integration, Usability > Reporter: Balaji Varadarajan > Assignee: Yanjia Gary Li > Priority: Minor > Labels: bug-bash-0.6.0, pull-request-available > > Currently > SlashEncodedDayPartitionValueExtractor is the default being used. This is not > a common format outside Uber. > > Also, Spark DataSource provides partitionedBy clauses which has not been > integrated for Hudi Data Source. We need to investigate how we can leverage > partitionBy clause for partitioning. -- This message was sent by Atlassian Jira (v8.3.4#803005)