Re: LZO configuration can not affect

2015-03-19 Thread Ted Yu
jeanlyn92:
I was not very clear in previous reply: I meant to refer to
/home/hadoop/mylib/hadoop-lzo-SNAPSHOT.jar

But looks like the distro includes hadoop-lzo-0.4.15.jar

Cheers

On Thu, Mar 19, 2015 at 6:26 PM, jeanlyn92  wrote:

> That's not enough .The config must appoint specific jar instead of the
> folder.
>
> 2015-03-19 21:27 GMT+08:00 Ted Yu :
>
>> If I read the screenshot correctly, Hadoop lzo jar is under
>> /home/hadoop/mylib
>>
>> Cheers
>>
>>
>>
>> On Mar 19, 2015, at 5:37 AM, jeanlyn92  wrote:
>>
>> You should conf as follow:
>> export
>> SPARK_LIBRARY_PATH="$HADOOP_HOME/lib/native:$HADOOP_HOME/share/hadoop/common/lib/
>> *hadoop-lzo-0.4.15.jar*"
>>
>>
>> On 03/19/2015 05:25 PM, Ted Yu wrote:
>>
>> How did you generate the Hadoop-lzo jar ?
>>
>> Thanks
>>
>>
>>
>>
>> On Mar 17, 2015, at 2:36 AM, 唯我者 <878223...@qq.com> wrote:
>>
>> hi,everybody:
>>I have configured the  env about LZO like this:
>> <9da01...@a75e774d.bbf50755.jpg>
>>
>> <54346...@a75e774d.bbf50755.jpg>
>>
>>
>>   But when  I execute code with spark-shell ,still  error come out like this:
>>
>> scala> val hdfsfile=sc.textFile("/xiaoming/gps_info")
>>
>> scala> hdfsfile.map(_.split(","))
>>
>> scala> res0.collect
>> ava.lang.RuntimeException: Error in configuring object
>> at 
>> org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
>> at 
>> org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
>> at 
>> org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
>> at org.apache.spark.rdd.HadoopRDD.getInputFormat(HadoopRDD.scala:184)
>> at org.apache.spark.rdd.HadoopRDD.getPartitions(HadoopRDD.scala:197)
>> at 
>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:222)
>> at 
>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:220)
>> at scala.Option.getOrElse(Option.scala:120)
>> at org.apache.spark.rdd.RDD.partitions(RDD.scala:220)
>> at org.apache.spark.rdd.MappedRDD.getPartitions(MappedRDD.scala:28)
>> at 
>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:222)
>> at 
>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:220)
>> at scala.Option.getOrElse(Option.scala:120)
>> at org.apache.spark.rdd.RDD.partitions(RDD.scala:220)
>> at org.apache.spark.rdd.MappedRDD.getPartitions(MappedRDD.scala:28)
>> at 
>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:222)
>> at 
>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:220)
>> at scala.Option.getOrElse(Option.scala:120)
>> at org.apache.spark.rdd.RDD.partitions(RDD.scala:220)
>> at org.apache.spark.SparkContext.runJob(SparkContext.scala:1367)
>> at org.apache.spark.rdd.RDD.collect(RDD.scala:797)
>> at $iwC$$iwC$$iwC$$iwC.(:17)
>> at $iwC$$iwC$$iwC.(:22)
>> at $iwC$$iwC.(:24)
>> at $iwC.(:26)
>> at (:28)
>> at .(:32)
>> at .()
>> at .(:7)
>> at .()
>> at $print()
>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>> at 
>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>> at 
>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>> at java.lang.reflect.Method.invoke(Method.java:606)
>> at 
>> org.apache.spark.repl.SparkIMain$ReadEvalPrint.call(SparkIMain.scala:852)
>> at 
>> org.apache.spark.repl.SparkIMain$Request.loadAndRun(SparkIMain.scala:1125)
>> at 
>> org.apache.spark.repl.SparkIMain.loadAndRunReq$1(SparkIMain.scala:674)
>> at org.apache.spark.repl.SparkIMain.interpret(SparkIMain.scala:705)
>> at org.apache.spark.repl.SparkIMain.interpret(SparkIMain.scala:669)
>> at 
>> org.apache.spark.repl.SparkILoop.reallyInterpret$1(SparkILoop.scala:828)
>> at 
>> org.apache.spark.repl.SparkILoop.interpretStartingWith(SparkILoop.scala:873)
>> at org.apache.spark.repl.SparkILoop.command(SparkILoop.scala:785)
>> at 
>> org.apache.spark.repl.SparkILoop.processLine$1(SparkILoop.scala:628)
>> at org.apache.spark.repl.SparkILoop.innerLoop$1(SparkILoop.scala:636)
>> at org.apache.spark.repl.SparkILoop.loop(SparkILoop.scala:641)
>> at 
>> org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply$mcZ$sp(SparkILoop.scala:968)
>> at 
>> org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply(SparkILoop.scala:916)
>> at 
>> org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply(SparkILoop.scala:916)
>> at 
>> scala.tools.nsc.util.ScalaClassLoader$.savingContextLoader(ScalaClassLoader.scala:135)
>> at org.apache.spark.repl.SparkILoop.process(SparkILoop.scala:916)
>> at org.apache.spark.repl.SparkILoop.process(SparkILoop.scala:1011)
>> at org.apache.

Re: LZO configuration can not affect

2015-03-19 Thread Ted Yu
If I read the screenshot correctly, Hadoop lzo jar is under /home/hadoop/mylib

Cheers



> On Mar 19, 2015, at 5:37 AM, jeanlyn92  wrote:
> 
> You should conf as follow:
> export 
> SPARK_LIBRARY_PATH="$HADOOP_HOME/lib/native:$HADOOP_HOME/share/hadoop/common/lib/hadoop-lzo-0.4.15.jar"
> 
> 
>> On 03/19/2015 05:25 PM, Ted Yu wrote:
>> How did you generate the Hadoop-lzo jar ?
>> 
>> Thanks
>> 
>> 
>> 
>>> On Mar 17, 2015, at 2:36 AM, 唯我者 <878223...@qq.com> wrote:
>>> 
>>> hi,everybody:
>>>I have configured the  env about LZO like this:
>>> <9da01...@a75e774d.bbf50755.jpg>
>>> 
>>> <54346...@a75e774d.bbf50755.jpg>
>>> 
>>> 
>>>   But when  I execute code with spark-shell ,still  error come out like 
>>> this:
>>> 
>>> scala> val hdfsfile=sc.textFile("/xiaoming/gps_info")
>>> 
>>> scala> hdfsfile.map(_.split(","))
>>> 
>>> scala> res0.collect
>>> ava.lang.RuntimeException: Error in configuring object
>>> at 
>>> org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
>>> at 
>>> org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
>>> at 
>>> org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
>>> at 
>>> org.apache.spark.rdd.HadoopRDD.getInputFormat(HadoopRDD.scala:184)
>>> at org.apache.spark.rdd.HadoopRDD.getPartitions(HadoopRDD.scala:197)
>>> at 
>>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:222)
>>> at 
>>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:220)
>>> at scala.Option.getOrElse(Option.scala:120)
>>> at org.apache.spark.rdd.RDD.partitions(RDD.scala:220)
>>> at org.apache.spark.rdd.MappedRDD.getPartitions(MappedRDD.scala:28)
>>> at 
>>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:222)
>>> at 
>>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:220)
>>> at scala.Option.getOrElse(Option.scala:120)
>>> at org.apache.spark.rdd.RDD.partitions(RDD.scala:220)
>>> at org.apache.spark.rdd.MappedRDD.getPartitions(MappedRDD.scala:28)
>>> at 
>>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:222)
>>> at 
>>> org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:220)
>>> at scala.Option.getOrElse(Option.scala:120)
>>> at org.apache.spark.rdd.RDD.partitions(RDD.scala:220)
>>> at org.apache.spark.SparkContext.runJob(SparkContext.scala:1367)
>>> at org.apache.spark.rdd.RDD.collect(RDD.scala:797)
>>> at $iwC$$iwC$$iwC$$iwC.(:17)
>>> at $iwC$$iwC$$iwC.(:22)
>>> at $iwC$$iwC.(:24)
>>> at $iwC.(:26)
>>> at (:28)
>>> at .(:32)
>>> at .()
>>> at .(:7)
>>> at .()
>>> at $print()
>>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>>> at 
>>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>>> at 
>>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>>> at java.lang.reflect.Method.invoke(Method.java:606)
>>> at 
>>> org.apache.spark.repl.SparkIMain$ReadEvalPrint.call(SparkIMain.scala:852)
>>> at 
>>> org.apache.spark.repl.SparkIMain$Request.loadAndRun(SparkIMain.scala:1125)
>>> at 
>>> org.apache.spark.repl.SparkIMain.loadAndRunReq$1(SparkIMain.scala:674)
>>> at org.apache.spark.repl.SparkIMain.interpret(SparkIMain.scala:705)
>>> at org.apache.spark.repl.SparkIMain.interpret(SparkIMain.scala:669)
>>> at 
>>> org.apache.spark.repl.SparkILoop.reallyInterpret$1(SparkILoop.scala:828)
>>> at 
>>> org.apache.spark.repl.SparkILoop.interpretStartingWith(SparkILoop.scala:873)
>>> at org.apache.spark.repl.SparkILoop.command(SparkILoop.scala:785)
>>> at 
>>> org.apache.spark.repl.SparkILoop.processLine$1(SparkILoop.scala:628)
>>> at 
>>> org.apache.spark.repl.SparkILoop.innerLoop$1(SparkILoop.scala:636)
>>> at org.apache.spark.repl.SparkILoop.loop(SparkILoop.scala:641)
>>> at 
>>> org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply$mcZ$sp(SparkILoop.scala:968)
>>> at 
>>> org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply(SparkILoop.scala:916)
>>> at 
>>> org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply(SparkILoop.scala:916)
>>> at 
>>> scala.tools.nsc.util.ScalaClassLoader$.savingContextLoader(ScalaClassLoader.scala:135)
>>> at org.apache.spark.repl.SparkILoop.process(SparkILoop.scala:916)
>>> at org.apache.spark.repl.SparkILoop.process(SparkILoop.scala:1011)
>>> at org.apache.spark.repl.Main$.main(Main.scala:31)
>>> at org.apache.spark.repl.Main.main(Main.scala)
>>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>>> at 
>>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>>> 

Re: LZO configuration can not affect

2015-03-19 Thread Ted Yu
How did you generate the Hadoop-lzo jar ?

Thanks



> On Mar 17, 2015, at 2:36 AM, 唯我者 <878223...@qq.com> wrote:
> 
> hi,everybody:
>I have configured the  env about LZO like this:
> <9da01...@a75e774d.bbf50755.jpg>
> 
> <54346...@a75e774d.bbf50755.jpg>
> 
> 
>   But when  I execute code with spark-shell ,still  error come out like this:
> 
> scala> val hdfsfile=sc.textFile("/xiaoming/gps_info")
> 
> scala> hdfsfile.map(_.split(","))
> 
> scala> res0.collect
> ava.lang.RuntimeException: Error in configuring object
> at 
> org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
> at 
> org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
> at 
> org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
> at org.apache.spark.rdd.HadoopRDD.getInputFormat(HadoopRDD.scala:184)
> at org.apache.spark.rdd.HadoopRDD.getPartitions(HadoopRDD.scala:197)
> at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:222)
> at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:220)
> at scala.Option.getOrElse(Option.scala:120)
> at org.apache.spark.rdd.RDD.partitions(RDD.scala:220)
> at org.apache.spark.rdd.MappedRDD.getPartitions(MappedRDD.scala:28)
> at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:222)
> at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:220)
> at scala.Option.getOrElse(Option.scala:120)
> at org.apache.spark.rdd.RDD.partitions(RDD.scala:220)
> at org.apache.spark.rdd.MappedRDD.getPartitions(MappedRDD.scala:28)
> at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:222)
> at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:220)
> at scala.Option.getOrElse(Option.scala:120)
> at org.apache.spark.rdd.RDD.partitions(RDD.scala:220)
> at org.apache.spark.SparkContext.runJob(SparkContext.scala:1367)
> at org.apache.spark.rdd.RDD.collect(RDD.scala:797)
> at $iwC$$iwC$$iwC$$iwC.(:17)
> at $iwC$$iwC$$iwC.(:22)
> at $iwC$$iwC.(:24)
> at $iwC.(:26)
> at (:28)
> at .(:32)
> at .()
> at .(:7)
> at .()
> at $print()
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:606)
> at 
> org.apache.spark.repl.SparkIMain$ReadEvalPrint.call(SparkIMain.scala:852)
> at 
> org.apache.spark.repl.SparkIMain$Request.loadAndRun(SparkIMain.scala:1125)
> at 
> org.apache.spark.repl.SparkIMain.loadAndRunReq$1(SparkIMain.scala:674)
> at org.apache.spark.repl.SparkIMain.interpret(SparkIMain.scala:705)
> at org.apache.spark.repl.SparkIMain.interpret(SparkIMain.scala:669)
> at 
> org.apache.spark.repl.SparkILoop.reallyInterpret$1(SparkILoop.scala:828)
> at 
> org.apache.spark.repl.SparkILoop.interpretStartingWith(SparkILoop.scala:873)
> at org.apache.spark.repl.SparkILoop.command(SparkILoop.scala:785)
> at 
> org.apache.spark.repl.SparkILoop.processLine$1(SparkILoop.scala:628)
> at org.apache.spark.repl.SparkILoop.innerLoop$1(SparkILoop.scala:636)
> at org.apache.spark.repl.SparkILoop.loop(SparkILoop.scala:641)
> at 
> org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply$mcZ$sp(SparkILoop.scala:968)
> at 
> org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply(SparkILoop.scala:916)
> at 
> org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply(SparkILoop.scala:916)
> at 
> scala.tools.nsc.util.ScalaClassLoader$.savingContextLoader(ScalaClassLoader.scala:135)
> at org.apache.spark.repl.SparkILoop.process(SparkILoop.scala:916)
> at org.apache.spark.repl.SparkILoop.process(SparkILoop.scala:1011)
> at org.apache.spark.repl.Main$.main(Main.scala:31)
> at org.apache.spark.repl.Main.main(Main.scala)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:606)
> at org.apache.spark.deploy.SparkSubmit$.launch(SparkSubmit.scala:358)
> at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:75)
> at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
> Caused by: java.lang.reflect.InvocationTargetException
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.in