Re: Structured Streaming - Can I start using it?

2017-03-14 Thread Adline Dsilva
On 14 Mar 2017 4:19 p.m., Gaurav Pandya wrote: Thanks a lot Michal & Ofir for your insights. To Ofir - I have not yet finalized my spark streaming code. it is still work in progress. Now we have Structured streaming available, so thought to re write it to gain maximum

Resizing Image with Scrimage in Spark

2016-10-17 Thread Adline Dsilva
Hi All, I have a Hive Table which contains around 500 million photos(Profile picture of Users) stored as hex string and total size of the table is 5TB. I'm trying to make a solution where images can be retrieved in real-time. Current Solution, Resize the images, index it along the user

RE: Anyone got a good solid example of integrating Spark and Solr

2016-09-14 Thread Adline Dsilva
Hi Take a look into https://github.com/lucidworks/spark-solr . this support authentication with kerberized solr. Unfortunately this implementation has support from solr 5.x+. and CDH has Solr 4.x. One option is to use Apache Solr 6.X with CDH. Regards, Adline Sent from

RE: Window Functions with SQLContext

2016-08-31 Thread Adline Dsilva
Hi, Use function rowNumber instead of row_number df1.withColumn("row_number", rowNumber.over(w)); Regards, Adline From: saurabh3d [saurabh.s.du...@oracle.com] Sent: 01 September 2016 13:16 To: user@spark.apache.org Subject: Window Functions with