Re: package for data quality in Spark 1.5.2

2016-05-05 Thread Mich Talebzadeh
for something similar to above solution . > -- Forwarded message -- > From: "Divya Gehlot" <divya.htco...@gmail.com> > Date: May 5, 2016 6:51 PM > Subject: package for data quality in Spark 1.5.2 > To: "user @spark" <user@spark.apache.org&

Fwd: package for data quality in Spark 1.5.2

2016-05-05 Thread Divya Gehlot
package for data quality in Spark 1.5.2 To: "user @spark" <user@spark.apache.org> Cc: Hi, Is there any package or project in Spark/scala which supports Data Quality check? For instance checking null values , foreign key constraint Would really appreciate ,if somebody has already done

Re: package for data quality in Spark 1.5.2

2016-05-05 Thread Mich Talebzadeh
Hi, Spark is a query tool. It stores data in HDFS or Hive database or anything else but does not have its own generic database nulls values and foreign key constraint belong to the domain of databases. What is exactly the nature of your requirements? Do you want to use Spark tool to look at the

package for data quality in Spark 1.5.2

2016-05-05 Thread Divya Gehlot
Hi, Is there any package or project in Spark/scala which supports Data Quality check? For instance checking null values , foreign key constraint Would really appreciate ,if somebody has already done it and happy to share or has any open source package . Thanks, Divya