Re: Anyone wants to look at SPARK-1123?

2014-02-24 Thread Nan Zhu
I just misread the API doc, and forgot to pass the type information when 
calling this 

Best, 

-- 
Nan Zhu


On Monday, February 24, 2014 at 8:17 AM, Mridul Muralidharan wrote:

> Curious, what was the issue ?
> 
> - Mridul
> 
> On Sun, Feb 23, 2014 at 11:41 PM, Nan Zhu  (mailto:zhunanmcg...@gmail.com)> wrote:
> > OK, I know where I was wrong
> > 
> > 
> > Best,
> > 
> > --
> > Nan Zhu
> > Sent with Sparrow (http://www.sparrowmailapp.com/?sig)
> > 
> > 
> > On Sunday, February 23, 2014 at 12:50 PM, Nan Zhu wrote:
> > 
> > > String, it should be get the following helper function
> > > 
> > > private[spark] def getKeyClass() = implicitly[ClassTag[K]].runtimeClass
> > > 
> > > private[spark] def getValueClass() = implicitly[ClassTag[V]].runtimeClass
> > > 
> > > and this is what I run
> > > 
> > > scala> val a = sc.textFile("/Users/nanzhu/code/incubator-spark/LICENSE", 
> > > 2).map(line => ("a", "b"))
> > > 
> > > scala> a.saveAsNewAPIHadoopFile("/Users/nanzhu/code/output_rdd")
> > > java.lang.InstantiationException
> > > at 
> > > sun.reflect.InstantiationExceptionConstructorAccessorImpl.newInstance(InstantiationExceptionConstructorAccessorImpl.java:48)
> > > at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
> > > at java.lang.Class.newInstance(Class.java:374)
> > > at 
> > > org.apache.spark.rdd.PairRDDFunctions.saveAsNewAPIHadoopFile(PairRDDFunctions.scala:632)
> > > at 
> > > org.apache.spark.rdd.PairRDDFunctions.saveAsNewAPIHadoopFile(PairRDDFunctions.scala:590)
> > > at $iwC$$iwC$$iwC$$iwC.(:15)
> > > at $iwC$$iwC$$iwC.(:20)
> > > at $iwC$$iwC.(:22)
> > > at $iwC.(:24)
> > > at (:26)
> > > at .(:30)
> > > at .()
> > > at .(:7)
> > > at .()
> > > at $print()
> > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> > > at 
> > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> > > at 
> > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> > > at java.lang.reflect.Method.invoke(Method.java:606)
> > > at 
> > > org.apache.spark.repl.SparkIMain$ReadEvalPrint.call(SparkIMain.scala:774)
> > > at 
> > > org.apache.spark.repl.SparkIMain$Request.loadAndRun(SparkIMain.scala:1042)
> > > at org.apache.spark.repl.SparkIMain.loadAndRunReq$1(SparkIMain.scala:611)
> > > at org.apache.spark.repl.SparkIMain.interpret(SparkIMain.scala:642)
> > > at org.apache.spark.repl.SparkIMain.interpret(SparkIMain.scala:606)
> > > at 
> > > org.apache.spark.repl.SparkILoop.reallyInterpret$1(SparkILoop.scala:790)
> > > at 
> > > org.apache.spark.repl.SparkILoop.interpretStartingWith(SparkILoop.scala:835)
> > > at org.apache.spark.repl.SparkILoop.command(SparkILoop.scala:747)
> > > at org.apache.spark.repl.SparkILoop.processLine$1(SparkILoop.scala:595)
> > > at org.apache.spark.repl.SparkILoop.innerLoop$1(SparkILoop.scala:602)
> > > at org.apache.spark.repl.SparkILoop.loop(SparkILoop.scala:605)
> > > at 
> > > org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply$mcZ$sp(SparkILoop.scala:928)
> > > at 
> > > org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply(SparkILoop.scala:878)
> > > at 
> > > org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply(SparkILoop.scala:878)
> > > at 
> > > scala.tools.nsc.util.ScalaClassLoader$.savingContextLoader(ScalaClassLoader.scala:135)
> > > at org.apache.spark.repl.SparkILoop.process(SparkILoop.scala:878)
> > > at org.apache.spark.repl.SparkILoop.process(SparkILoop.scala:970)
> > > at org.apache.spark.repl.Main$.main(Main.scala:31)
> > > at org.apache.spark.repl.Main.main(Main.scala)
> > > 
> > > 
> > > 
> > > 
> > > 
> > > 
> > > --
> > > Nan Zhu
> > > 
> > > 
> > > On Sunday, February 23, 2014 at 11:06 AM, Nick Pentreath wrote:
> > > 
> > > > Hi
> > > > 
> > > > What KeyClass and ValueClass are you trying to save as the keys/values 
> > > > of
> > > > your dataset?
> > > > 
> > > > 
> > > > 
> > > > On Sun, Feb 23, 2014 at 10:48 AM, Nan Zhu  > > > (mailto:zhunanmcg...@gmail.com)> wrote:
> > > > 
> > > > > Hi, all
> > > > > 
> > > > > I found the weird thing on saveAsNewAPIHadoopFile in
> > > > > PairRDDFunctions.scala when working on the other issue,
> > > > > 
> > > > > saveAsNewAPIHadoopFile throws java.lang.InstantiationException all 
> > > > > the time
> > > > > 
> > > > > I checked the commit history of the file, it seems that the API 
> > > > > exists for
> > > > > a long time, no one else found this? (that's the reason I'm confusing)
> > > > > 
> > > > > Best,
> > > > > 
> > > > > --
> > > > > Nan Zhu
> > > > > 
> > > > 
> > > > 
> > > 
> > > 
> > 
> > 
> 
> 
> 




Re: Anyone wants to look at SPARK-1123?

2014-02-24 Thread Mridul Muralidharan
Curious, what was the issue ?

- Mridul

On Sun, Feb 23, 2014 at 11:41 PM, Nan Zhu  wrote:
> OK, I know where I was wrong
>
>
> Best,
>
> --
> Nan Zhu
> Sent with Sparrow (http://www.sparrowmailapp.com/?sig)
>
>
> On Sunday, February 23, 2014 at 12:50 PM, Nan Zhu wrote:
>
>> String, it should be get the following helper function
>>
>> private[spark] def getKeyClass() = implicitly[ClassTag[K]].runtimeClass
>>
>> private[spark] def getValueClass() = implicitly[ClassTag[V]].runtimeClass
>>
>> and this is what I run
>>
>> scala> val a = sc.textFile("/Users/nanzhu/code/incubator-spark/LICENSE", 
>> 2).map(line => ("a", "b"))
>>
>> scala> a.saveAsNewAPIHadoopFile("/Users/nanzhu/code/output_rdd")
>> java.lang.InstantiationException
>> at 
>> sun.reflect.InstantiationExceptionConstructorAccessorImpl.newInstance(InstantiationExceptionConstructorAccessorImpl.java:48)
>> at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
>> at java.lang.Class.newInstance(Class.java:374)
>> at 
>> org.apache.spark.rdd.PairRDDFunctions.saveAsNewAPIHadoopFile(PairRDDFunctions.scala:632)
>> at 
>> org.apache.spark.rdd.PairRDDFunctions.saveAsNewAPIHadoopFile(PairRDDFunctions.scala:590)
>> at $iwC$$iwC$$iwC$$iwC.(:15)
>> at $iwC$$iwC$$iwC.(:20)
>> at $iwC$$iwC.(:22)
>> at $iwC.(:24)
>> at (:26)
>> at .(:30)
>> at .()
>> at .(:7)
>> at .()
>> at $print()
>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>> at 
>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>> at 
>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>> at java.lang.reflect.Method.invoke(Method.java:606)
>> at org.apache.spark.repl.SparkIMain$ReadEvalPrint.call(SparkIMain.scala:774)
>> at org.apache.spark.repl.SparkIMain$Request.loadAndRun(SparkIMain.scala:1042)
>> at org.apache.spark.repl.SparkIMain.loadAndRunReq$1(SparkIMain.scala:611)
>> at org.apache.spark.repl.SparkIMain.interpret(SparkIMain.scala:642)
>> at org.apache.spark.repl.SparkIMain.interpret(SparkIMain.scala:606)
>> at org.apache.spark.repl.SparkILoop.reallyInterpret$1(SparkILoop.scala:790)
>> at 
>> org.apache.spark.repl.SparkILoop.interpretStartingWith(SparkILoop.scala:835)
>> at org.apache.spark.repl.SparkILoop.command(SparkILoop.scala:747)
>> at org.apache.spark.repl.SparkILoop.processLine$1(SparkILoop.scala:595)
>> at org.apache.spark.repl.SparkILoop.innerLoop$1(SparkILoop.scala:602)
>> at org.apache.spark.repl.SparkILoop.loop(SparkILoop.scala:605)
>> at 
>> org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply$mcZ$sp(SparkILoop.scala:928)
>> at 
>> org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply(SparkILoop.scala:878)
>> at 
>> org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply(SparkILoop.scala:878)
>> at 
>> scala.tools.nsc.util.ScalaClassLoader$.savingContextLoader(ScalaClassLoader.scala:135)
>> at org.apache.spark.repl.SparkILoop.process(SparkILoop.scala:878)
>> at org.apache.spark.repl.SparkILoop.process(SparkILoop.scala:970)
>> at org.apache.spark.repl.Main$.main(Main.scala:31)
>> at org.apache.spark.repl.Main.main(Main.scala)
>>
>>
>>
>>
>>
>>
>> --
>> Nan Zhu
>>
>>
>> On Sunday, February 23, 2014 at 11:06 AM, Nick Pentreath wrote:
>>
>> > Hi
>> >
>> > What KeyClass and ValueClass are you trying to save as the keys/values of
>> > your dataset?
>> >
>> >
>> >
>> > On Sun, Feb 23, 2014 at 10:48 AM, Nan Zhu > > (mailto:zhunanmcg...@gmail.com)> wrote:
>> >
>> > > Hi, all
>> > >
>> > > I found the weird thing on saveAsNewAPIHadoopFile in
>> > > PairRDDFunctions.scala when working on the other issue,
>> > >
>> > > saveAsNewAPIHadoopFile throws java.lang.InstantiationException all the 
>> > > time
>> > >
>> > > I checked the commit history of the file, it seems that the API exists 
>> > > for
>> > > a long time, no one else found this? (that's the reason I'm confusing)
>> > >
>> > > Best,
>> > >
>> > > --
>> > > Nan Zhu
>> > >
>> >
>> >
>> >
>> >
>>
>>
>


Re: Anyone wants to look at SPARK-1123?

2014-02-23 Thread Nan Zhu
OK, I know where I was wrong 


Best, 

-- 
Nan Zhu
Sent with Sparrow (http://www.sparrowmailapp.com/?sig)


On Sunday, February 23, 2014 at 12:50 PM, Nan Zhu wrote:

> String, it should be get the following helper function 
> 
> private[spark] def getKeyClass() = implicitly[ClassTag[K]].runtimeClass
> 
> private[spark] def getValueClass() = implicitly[ClassTag[V]].runtimeClass
> 
> and this is what I run 
> 
> scala> val a = sc.textFile("/Users/nanzhu/code/incubator-spark/LICENSE", 
> 2).map(line => ("a", "b"))
> 
> scala> a.saveAsNewAPIHadoopFile("/Users/nanzhu/code/output_rdd")
> java.lang.InstantiationException
> at 
> sun.reflect.InstantiationExceptionConstructorAccessorImpl.newInstance(InstantiationExceptionConstructorAccessorImpl.java:48)
> at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
> at java.lang.Class.newInstance(Class.java:374)
> at 
> org.apache.spark.rdd.PairRDDFunctions.saveAsNewAPIHadoopFile(PairRDDFunctions.scala:632)
> at 
> org.apache.spark.rdd.PairRDDFunctions.saveAsNewAPIHadoopFile(PairRDDFunctions.scala:590)
> at $iwC$$iwC$$iwC$$iwC.(:15)
> at $iwC$$iwC$$iwC.(:20)
> at $iwC$$iwC.(:22)
> at $iwC.(:24)
> at (:26)
> at .(:30)
> at .()
> at .(:7)
> at .()
> at $print()
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:606)
> at org.apache.spark.repl.SparkIMain$ReadEvalPrint.call(SparkIMain.scala:774)
> at org.apache.spark.repl.SparkIMain$Request.loadAndRun(SparkIMain.scala:1042)
> at org.apache.spark.repl.SparkIMain.loadAndRunReq$1(SparkIMain.scala:611)
> at org.apache.spark.repl.SparkIMain.interpret(SparkIMain.scala:642)
> at org.apache.spark.repl.SparkIMain.interpret(SparkIMain.scala:606)
> at org.apache.spark.repl.SparkILoop.reallyInterpret$1(SparkILoop.scala:790)
> at 
> org.apache.spark.repl.SparkILoop.interpretStartingWith(SparkILoop.scala:835)
> at org.apache.spark.repl.SparkILoop.command(SparkILoop.scala:747)
> at org.apache.spark.repl.SparkILoop.processLine$1(SparkILoop.scala:595)
> at org.apache.spark.repl.SparkILoop.innerLoop$1(SparkILoop.scala:602)
> at org.apache.spark.repl.SparkILoop.loop(SparkILoop.scala:605)
> at 
> org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply$mcZ$sp(SparkILoop.scala:928)
> at 
> org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply(SparkILoop.scala:878)
> at 
> org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply(SparkILoop.scala:878)
> at 
> scala.tools.nsc.util.ScalaClassLoader$.savingContextLoader(ScalaClassLoader.scala:135)
> at org.apache.spark.repl.SparkILoop.process(SparkILoop.scala:878)
> at org.apache.spark.repl.SparkILoop.process(SparkILoop.scala:970)
> at org.apache.spark.repl.Main$.main(Main.scala:31)
> at org.apache.spark.repl.Main.main(Main.scala)
> 
> 
> 
> 
> 
> 
> -- 
> Nan Zhu
> 
> 
> On Sunday, February 23, 2014 at 11:06 AM, Nick Pentreath wrote:
> 
> > Hi
> > 
> > What KeyClass and ValueClass are you trying to save as the keys/values of
> > your dataset?
> > 
> > 
> > 
> > On Sun, Feb 23, 2014 at 10:48 AM, Nan Zhu  > (mailto:zhunanmcg...@gmail.com)> wrote:
> > 
> > > Hi, all
> > > 
> > > I found the weird thing on saveAsNewAPIHadoopFile in
> > > PairRDDFunctions.scala when working on the other issue,
> > > 
> > > saveAsNewAPIHadoopFile throws java.lang.InstantiationException all the 
> > > time
> > > 
> > > I checked the commit history of the file, it seems that the API exists for
> > > a long time, no one else found this? (that's the reason I'm confusing)
> > > 
> > > Best,
> > > 
> > > --
> > > Nan Zhu
> > > 
> > 
> > 
> > 
> > 
> 
> 



Re: Anyone wants to look at SPARK-1123?

2014-02-23 Thread Nan Zhu
String, it should be get the following helper function 

private[spark] def getKeyClass() = implicitly[ClassTag[K]].runtimeClass

private[spark] def getValueClass() = implicitly[ClassTag[V]].runtimeClass

and this is what I run 

scala> val a = sc.textFile("/Users/nanzhu/code/incubator-spark/LICENSE", 
2).map(line => ("a", "b"))

scala> a.saveAsNewAPIHadoopFile("/Users/nanzhu/code/output_rdd")
java.lang.InstantiationException
at 
sun.reflect.InstantiationExceptionConstructorAccessorImpl.newInstance(InstantiationExceptionConstructorAccessorImpl.java:48)
at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
at java.lang.Class.newInstance(Class.java:374)
at 
org.apache.spark.rdd.PairRDDFunctions.saveAsNewAPIHadoopFile(PairRDDFunctions.scala:632)
at 
org.apache.spark.rdd.PairRDDFunctions.saveAsNewAPIHadoopFile(PairRDDFunctions.scala:590)
at $iwC$$iwC$$iwC$$iwC.(:15)
at $iwC$$iwC$$iwC.(:20)
at $iwC$$iwC.(:22)
at $iwC.(:24)
at (:26)
at .(:30)
at .()
at .(:7)
at .()
at $print()
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.spark.repl.SparkIMain$ReadEvalPrint.call(SparkIMain.scala:774)
at org.apache.spark.repl.SparkIMain$Request.loadAndRun(SparkIMain.scala:1042)
at org.apache.spark.repl.SparkIMain.loadAndRunReq$1(SparkIMain.scala:611)
at org.apache.spark.repl.SparkIMain.interpret(SparkIMain.scala:642)
at org.apache.spark.repl.SparkIMain.interpret(SparkIMain.scala:606)
at org.apache.spark.repl.SparkILoop.reallyInterpret$1(SparkILoop.scala:790)
at org.apache.spark.repl.SparkILoop.interpretStartingWith(SparkILoop.scala:835)
at org.apache.spark.repl.SparkILoop.command(SparkILoop.scala:747)
at org.apache.spark.repl.SparkILoop.processLine$1(SparkILoop.scala:595)
at org.apache.spark.repl.SparkILoop.innerLoop$1(SparkILoop.scala:602)
at org.apache.spark.repl.SparkILoop.loop(SparkILoop.scala:605)
at 
org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply$mcZ$sp(SparkILoop.scala:928)
at 
org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply(SparkILoop.scala:878)
at 
org.apache.spark.repl.SparkILoop$$anonfun$process$1.apply(SparkILoop.scala:878)
at 
scala.tools.nsc.util.ScalaClassLoader$.savingContextLoader(ScalaClassLoader.scala:135)
at org.apache.spark.repl.SparkILoop.process(SparkILoop.scala:878)
at org.apache.spark.repl.SparkILoop.process(SparkILoop.scala:970)
at org.apache.spark.repl.Main$.main(Main.scala:31)
at org.apache.spark.repl.Main.main(Main.scala)






-- 
Nan Zhu


On Sunday, February 23, 2014 at 11:06 AM, Nick Pentreath wrote:

> Hi
> 
> What KeyClass and ValueClass are you trying to save as the keys/values of
> your dataset?
> 
> 
> 
> On Sun, Feb 23, 2014 at 10:48 AM, Nan Zhu  (mailto:zhunanmcg...@gmail.com)> wrote:
> 
> > Hi, all
> > 
> > I found the weird thing on saveAsNewAPIHadoopFile in
> > PairRDDFunctions.scala when working on the other issue,
> > 
> > saveAsNewAPIHadoopFile throws java.lang.InstantiationException all the time
> > 
> > I checked the commit history of the file, it seems that the API exists for
> > a long time, no one else found this? (that's the reason I'm confusing)
> > 
> > Best,
> > 
> > --
> > Nan Zhu
> > 
> 
> 
> 




Re: Anyone wants to look at SPARK-1123?

2014-02-23 Thread Nick Pentreath
Hi

What KeyClass and ValueClass are you trying to save as the keys/values of
your dataset?



On Sun, Feb 23, 2014 at 10:48 AM, Nan Zhu  wrote:

> Hi, all
>
> I found the weird thing on saveAsNewAPIHadoopFile  in
> PairRDDFunctions.scala when working on the other issue,
>
> saveAsNewAPIHadoopFile throws java.lang.InstantiationException all the time
>
> I checked the commit history of the file, it seems that the API exists for
> a long time, no one else found this? (that's the reason I'm confusing)
>
> Best,
>
> --
> Nan Zhu
>
>


Anyone wants to look at SPARK-1123?

2014-02-23 Thread Nan Zhu
Hi, all  

I found the weird thing on saveAsNewAPIHadoopFile  in PairRDDFunctions.scala 
when working on the other issue,  

saveAsNewAPIHadoopFile throws java.lang.InstantiationException all the time

I checked the commit history of the file, it seems that the API exists for a 
long time, no one else found this? (that’s the reason I’m confusing)

Best,  

--  
Nan Zhu