unsubscribe

2019-06-23 Thread Tushar Marne

Re: RE - Apache Spark compatibility with Hadoop 2.9.2

2019-06-23 Thread Mark Bidewell
Note that we selected Spark 2.2.2 because we were trying to align with DSE Search 6. A new version might have fewer issues. On Sun, Jun 23, 2019 at 10:56 AM Bipul kumar wrote: > Hi Mark, > > Thanks for your wonderful suggestion. > I look forward to try that version. > > Respectfully, >

Re: RE - Apache Spark compatibility with Hadoop 2.9.2

2019-06-23 Thread Bipul kumar
Hi Mark, Thanks for your wonderful suggestion. I look forward to try that version. Respectfully, Bipul PUBLIC KEY 97F0 2E08 7DE7 D538 BDFA B708 86D8 BE27 8196 D466 ** Please excuse brevity and typos. ** On Sun, Jun 23, 2019 at 8:06 PM

Re: RE - Apache Spark compatibility with Hadoop 2.9.2

2019-06-23 Thread Mark Bidewell
I have done a setup with Hadoop 2.9.2 and Spark 2.2.2. Apache Zeppelin is fine but some our internally developed apps need work on dependencies On Sun, Jun 23, 2019, 07:50 Bipul kumar wrote: > Hello People ! > > I am new to Apache Spark , and just started learning it. > Few questions i have in

RE - Apache Spark compatibility with Hadoop 2.9.2

2019-06-23 Thread Bipul kumar
Hello People ! I am new to Apache Spark , and just started learning it. Few questions i have in my mind which i am seeking here for 1 . Is there any compatibility with Apache Spark while using Hadoop.? Let say i am running Hadoop 2.9.2, which Apache Spark should i use? 2. As mentioned , i

Re: Structured Streaming foreach function

2019-06-23 Thread Magnus Nilsson
Row is a generic ordered collection of fields that most likely contain a Schema of StructType. You need to keep track of the datatypes of the fields yourself. If you want compile time safety of datatypes (and intellisense support) you need to use RDD:s or the Dataset[T] api. Dataset[T] might

Structured Streaming foreach function

2019-06-23 Thread RanXin
I use spark 2.4.3, python to build a structured streaming. May I know the data type of the parameter "row" in process_row function? The following codes is how the official programming guide instruct us to deal with foreach function: def process_row(row): # Write row to storage