Re: removing first record from RDD[String]

2014-12-23 Thread Jörg Schad
Hi, maybe the drop function is helpful for you (even though this is probably more than you need, still interesting read) http://erikerlandson.github.io/blog/2014/07/27/some-implications-of-supporting-the-scala-drop-method-for-spark-rdds/ Joerg On Tue, Dec 23, 2014 at 5:45 PM, Hao Ren

SchemaRDD compute function

2014-11-26 Thread Jörg Schad
Hi, I have a short question regarding the compute() of an SchemaRDD. For SchemaRDD the actual queryExecution seems to be triggered via collect(), while the compute triggers only the compute() of the parent and copies the data (Please correct me if I am wrong!). Is this compute() triggered at all

Re: how to force graphx to execute transfomtation

2014-11-26 Thread Jörg Schad
Hi, can't you just use graph.partitionBy(..).collect()? Cheers, Joerg On Wed, Nov 26, 2014 at 2:25 PM, Hlib Mykhailenko hlib.mykhaile...@inria.fr wrote: Hello, I work with Graphx. When I call graph.partitionBy(..) nothing happens, because, as I understood, that all transformation are lazy