Is there a way to write a transformation that for each entry of an RDD uses
certain other values of another RDD? As an example, image you have a RDD of
entries to predict a certain label. In a second RDD, you have historical
data. So for each entry in the first RDD, you want to find similar entries
in the second RDD and take, let's say, the average. Does that fit the Spark
model? Is there any alternative?

Thanks in advance

Reply via email to