You can use spark testing base's rdd comparators. Create 2 different dataframes from these 2 hive tables. Convert them to rdd and use spark-testing-base compareRDD.
Here is an example for rdd comparison: https://github.com/holdenk/spark-testing-base/wiki/RDDComparisons On Mon, Jan 30, 2017 at 9:07 PM, Alex <siri8...@gmail.com> wrote: > Hi Team, > > how to compare two avro format hive tables if there is same data in it > > if i give limit 5 its giving different results > > > > > > -- Thanks Deepak www.bigdatabig.com www.keosha.net