[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sagar Sumit updated HUDI-110: ----------------------------- Fix Version/s: 0.12.1 (was: 0.12.0) > Better defaults for Partition extractor for Spark DataSource and DeltaStreamer > ------------------------------------------------------------------------------ > > Key: HUDI-110 > URL: https://issues.apache.org/jira/browse/HUDI-110 > Project: Apache Hudi > Issue Type: Improvement > Components: deltastreamer, spark, Usability > Reporter: Balaji Varadarajan > Priority: Major > Labels: user-support-issues > Fix For: 0.12.1 > > > Currently > SlashEncodedDayPartitionValueExtractor is the default being used. This is not > a common format outside Uber. > > Also, Spark DataSource provides partitionedBy clauses which has not been > integrated for Hudi Data Source. We need to investigate how we can leverage > partitionBy clause for partitioning. -- This message was sent by Atlassian Jira (v8.20.10#820010)