if the datasets hold objects of different classes, then you will have to convert both of them to rdd and then rename the columns befrore you call rdd1.subtract(rdd2)
On Thu, Oct 12, 2017 at 10:16 PM, Shashikant Kulkarni < shashikant.kulka...@gmail.com> wrote: > Hello, > > I have 2 datasets, Dataset<Class1> and other is Dataset<Class2>. I want > the list of records which are in Dataset<Class1> but not in > Dataset<Class2>. How can I do this in Apache Spark using Java Connector? I > am using Apache Spark 2.2.0 > > Thank you > --------------------------------------------------------------------- > To unsubscribe e-mail: user-unsubscr...@spark.apache.org > > -- I.R