[ https://issues.apache.org/jira/browse/SPARK-14678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Apache Spark reassigned SPARK-14678: ------------------------------------ Assignee: Shixiong Zhu (was: Apache Spark) > Add a file sink log to support versioning and compaction > -------------------------------------------------------- > > Key: SPARK-14678 > URL: https://issues.apache.org/jira/browse/SPARK-14678 > Project: Spark > Issue Type: Improvement > Components: SQL > Reporter: Shixiong Zhu > Assignee: Shixiong Zhu > > To use FileStreamSink in production, there are two requirements for > FileStreamSink's log: > 1.Versioning. A future Spark version should be able to read the metadata of > an old FileStreamSink. > 2. Compaction. As reading from many small files is usually pretty slow, we > should compact small metadata files into big files. > See the PR description for more details. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org