PathFilter for newAPIHadoopFile?

Eric Friedman Sun, 14 Sep 2014 19:38:43 -0700

Hi,

I have a directory structure with parquet+avro data in it. There are a
couple of administrative files (.foo and/or _foo) that I need to ignore
when processing this data or Spark tries to read them as containing parquet
content, which they do not.


How can I set a PathFilter on the FileInputFormat used to construct an RDD?

PathFilter for newAPIHadoopFile?

Reply via email to