XML Processing using Spark SQL

2016-05-12 Thread Arunkumar Chandrasekar
Hello,

Greetings.

I'm trying to process a xml file exported from Health Kit application using
Spark SQL for learning purpose. The sample record data is like the below:

 

 .

I want to have the column name of my table as the field value like type,
sourceName, sourceVersion and the row entries as their respective values
like HKQuantityTypeIdentifierStepCount, Vizhi, 9.3.1,..

I took a look at the Spark-XML ,
but didn't get any information in my case (my xml is not well formed with
the tags). Is there any other option to convert the record that I have
mentioned above into a schema format for playing with Spark SQL?

Thanks in Advance.

*Thank You*,
Arun Chandrasekar
chan.arunku...@gmail.com


Re: Spark replacing Hadoop

2016-04-14 Thread Arunkumar Chandrasekar
Hello,

I would stand in side of Spark. Spark provides numerous add-ons like Spark SQL, 
Spark MLIB that are possibly something hard to set it up with Map Reduce. 

Thank You.



> On Apr 15, 2016, at 1:16 AM, Ashok Kumar  wrote:
> 
> Hello,
> 
> Well, Sounds like Andy is implying that Spark can replace Hadoop whereas Mich 
> still believes that HDFS is a keeper?
> 
> thanks
> 
> 
> 
> 
> On Thursday, 14 April 2016, 20:40, David Newberger 
>  wrote:
> 
> 
> Can we assume your question is “Will Spark replace Hadoop MapReduce?” or do 
> you literally mean replacing the whole of Hadoop?
>  
> David
>  
> From: Ashok Kumar [mailto:ashok34...@yahoo.com.INVALID] 
> Sent: Thursday, April 14, 2016 2:13 PM
> To: User
> Subject: Spark replacing Hadoop
>  
> Hi,
>  
> I hear that some saying that Hadoop is getting old and out of date and will 
> be replaced by Spark!
>  
> Does this make sense and if so how accurate is it?
>  
> Best
> 
>