Here is a tutorial on how to customize your own file format in hadoop: https://developer.yahoo.com/hadoop/tutorial/module5.html#fileformat
and once you get your own file format, you can use it the same way as TextInputFormat in spark as you have done in this post. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Hadoop-Input-Format-newAPIHadoopFile-tp2860p10762.html Sent from the Apache Spark User List mailing list archive at Nabble.com.