Hello everyone.
It seems pyspark dataframe read is broken for reading multiple files.
sql.read.json( "file1,file2") fails with java.io.IOException: No input
paths specified in job.
This used to work in spark 1.4 and also still work with sc.textFile
Blaž
Also missing is
http://s3.amazonaws.com/spark-related-packages/spark-1.5.1-bin-hadoop1.tgz
which breaks spark-ec2 script.
On Mon, Oct 5, 2015 at 5:20 AM, Ted Yu wrote:
> hadoop1 package for Scala 2.10 wasn't in RC1 either:
> http://people.apache.org/~pwendell/spark-releases/spark-1.5.1-rc1-bin/