On 14 Mar 2017 4:19 p.m., Gaurav Pandya wrote:
Thanks a lot Michal & Ofir for your insights.
To Ofir - I have not yet finalized my spark streaming code. it is still work in
progress. Now we have Structured streaming available, so thought to re write it
to gain maximum
Hi All,
I have a Hive Table which contains around 500 million photos(Profile picture
of Users) stored as hex string and total size of the table is 5TB. I'm trying
to make a solution where images can be retrieved in real-time.
Current Solution, Resize the images, index it along the user
Hi
Take a look into https://github.com/lucidworks/spark-solr . this support
authentication with kerberized solr. Unfortunately this implementation has
support from solr 5.x+. and CDH has Solr 4.x. One option is to use Apache Solr
6.X with CDH.
Regards,
Adline
Sent from
Hi,
Use function rowNumber instead of row_number
df1.withColumn("row_number", rowNumber.over(w));
Regards,
Adline
From: saurabh3d [saurabh.s.du...@oracle.com]
Sent: 01 September 2016 13:16
To: user@spark.apache.org
Subject: Window Functions with