[ https://issues.apache.org/jira/browse/SPARK-16317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Apache Spark reassigned SPARK-16317: ------------------------------------ Assignee: Apache Spark > Add file filtering interface for FileFormat > ------------------------------------------- > > Key: SPARK-16317 > URL: https://issues.apache.org/jira/browse/SPARK-16317 > Project: Spark > Issue Type: Improvement > Components: SQL > Affects Versions: 2.0.0 > Reporter: Cheng Lian > Assignee: Apache Spark > Priority: Minor > > {{FileFormat}} data sources like Parquet and Avro (provided by spark-avro) > have customized file filtering logics. For example, Parquet needs to filter > out summary files, while Avro provides a Hadoop configuration option to > filter out all files whose names don't end with ".avro". > It would be nice to have a general file filtering interface in {{FileFormat}} > to handle similar requirements. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org