Rdd union will result in
1 2
3 4
5 6
7 8
9 10
11 12
What you are trying to do is join.
There must be a logic/key to perform join operation.
I think in your case you want the order (index) to be the joining key here.
RDD is a distributed data structure and is not apt for your
Hi,
I am using spark 1.2.1
I am using thrift server to query my data.
while executing query CACHE TABLE tablename
Fails with exception
Error: org.apache.spark.SparkException: Job aborted due to stage failure:
Task 0 in stage 10.0 failed 4 times, most recent failure: Lost task 0.3 in
stage