Transformation question

Eduardo Wed, 27 Apr 2016 11:50:32 -0700

Is there a way to write a transformation that for each entry of an RDD uses
certain other values of another RDD? As an example, image you have a RDD of
entries to predict a certain label. In a second RDD, you have historical
data. So for each entry in the first RDD, you want to find similar entries
in the second RDD and take, let's say, the average. Does that fit the Spark
model? Is there any alternative?


Thanks in advance

Transformation question

Reply via email to