You can run hadoop checknative -a and see if bzip2 is detected correctly.
-- Ruslan Dautkhanov On Fri, Jun 26, 2015 at 10:18 AM, Marcelo Vanzin <van...@cloudera.com> wrote: > What master are you using? If this is not a "local" master, you'll need to > set LD_LIBRARY_PATH on the executors also (using > spark.executor.extraLibraryPath). > > If you are using local, then I don't know what's going on. > > On Fri, Jun 26, 2015 at 1:39 AM, Arunabha Ghosh <arunabha...@gmail.com> > wrote: > >> Hi, >> I'm having trouble reading Bzip2 compressed sequence files after I >> enabled hadoop native libraries in spark. >> >> Running >> LD_LIBRARY_PATH=$HADOOP_HOME/lib/native/ $SPARK_HOME/bin/spark-submit >> --class .... gives the following error >> >> 5/06/26 00:48:02 INFO CodecPool: Got brand-new decompressor [.bz2] >> 15/06/26 00:48:02 ERROR Executor: Exception in task 3.0 in stage 0.0 (TID >> 3) >> java.lang.UnsupportedOperationException >> at >> org.apache.hadoop.io.compress.bzip2.BZip2DummyDecompressor.decompress(BZip2DummyDecompressor.java:32) >> at >> org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:91) >> at >> org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:85) >> at java.io.DataInputStream.readFully(DataInputStream.java:195) >> at java.io.DataInputStream.readLong(DataInputStream.java:416) >> >> removing the LD_LIBRARY_PATH makes spark run fine but it gives the >> following warning >> WARN NativeCodeLoader: Unable to load native-hadoop library for your >> platform... using builtin-java classes where applicable >> >> Has anyone else run into this issue ? Any help is welcome. >> > > > > -- > Marcelo >