Re: Unable to find proto buffer class error with RDDprotobuf

2014-09-19 Thread Paul Wais
Well it looks like this is indeed a protobuf issue. Poked a little more with Kryo. Since protobuf messages are serializable, I tried just making Kryo use the JavaSerializer for my messages. The resulting stack trace made it look like protobuf GeneratedMessageLite is actually using the

Re: Unable to find proto buffer class error with RDDprotobuf

2014-09-19 Thread Paul Wais
Derp, one caveat to my solution: I guess Spark doesn't use Kryo for Function serde :( On Fri, Sep 19, 2014 at 12:44 AM, Paul Wais pw...@yelp.com wrote: Well it looks like this is indeed a protobuf issue. Poked a little more with Kryo. Since protobuf messages are serializable, I tried just

Unable to find proto buffer class error with RDDprotobuf

2014-09-18 Thread Paul Wais
Dear List, I'm writing an application where I have RDDs of protobuf messages. When I run the app via bin/spar-submit with --master local --driver-class-path path/to/my/uber.jar, Spark is able to ser/deserialize the messages correctly. However, if I run WITHOUT --driver-class-path

Re: Unable to find proto buffer class error with RDDprotobuf

2014-09-18 Thread Paul Wais
Well, it looks like Spark is just not loading my code into the driver/executors E.g.: ListString foo = JavaRDDMyMessage bars.map( new Function MyMessage, String() { { System.err.println(classpath: + System.getProperty(java.class.path)); CodeSource src =

Re: Unable to find proto buffer class error with RDDprotobuf

2014-09-18 Thread Paul Wais
Ah, can one NOT create an RDD of any arbitrary Serializable type? It looks like I might be getting bitten by the same java.io.ObjectInputStream uses root class loader only bugs mentioned in: * http://apache-spark-user-list.1001560.n3.nabble.com/java-lang-ClassNotFoundException-td3259.html *

Re: Unable to find proto buffer class error with RDDprotobuf

2014-09-18 Thread Paul Wais
hmm would using kyro help me here? On Thursday, September 18, 2014, Paul Wais pw...@yelp.com wrote: Ah, can one NOT create an RDD of any arbitrary Serializable type? It looks like I might be getting bitten by the same java.io.ObjectInputStream uses root class loader only bugs mentioned

Re: Unable to find proto buffer class error with RDDprotobuf

2014-09-18 Thread Paul Wais
It turns out Kyro doesn't play well with protobuf. Out of the box I see: com.esotericsoftware.kryo.KryoException: java.lang.UnsupportedOperationException Serialization trace: extra_ (com.foo.bar.MyMessage)