Hi, I have two datasets that i load from HDFS file system. After loading the files, i cache the datasets.
I have a requirement to do join on these datasets (LEFT/RIGHT INNER/OUTER JOIN) and apply some filter conditions and then run multiple queries on this joined filtered dataset. I have created JavaPairRDDs for the datasets, doing the join, applying filter and caching the dataset. After running the queries, i need to unpersist this dataset. Currently i don't see any unpersist api for JavaPairRDD. Is there any way to unpersist the JavaPairRDD? Thanks, Sasmita