Re: Best way to read XML data from RDD

2016-08-22 Thread Diwakar Dhanuskodi
; >> Hi Darin, >> >> Ate you using this utility to parse single line XML? >> >> >> Sent from Samsung Mobile. >> >> >> Original message >> From: Darin McBeath <ddmcbe...@yahoo.com> >> Date:21/08/2016 17:

Re: Best way to read XML data from RDD

2016-08-22 Thread Darin McBeath
lt;ddmcbe...@yahoo.com>; Hyukjin Kwon <gurwls...@gmail.com>; Jörn Franke <jornfra...@gmail.com> Cc: Felix Cheung <felixcheun...@hotmail.com>; user <user@spark.apache.org> Sent: Monday, August 22, 2016 6:53 AM Subject: Re: Best way to read XML data from RDD Hi Darin,  Ate  

RE: Best way to read XML data from RDD

2016-08-22 Thread Puneet Tripathi
out(50) ssc.stop() Hope this is helpful. Puneet From: Hyukjin Kwon [mailto:gurwls...@gmail.com] Sent: Monday, August 22, 2016 4:34 PM To: Diwakar Dhanuskodi Cc: Darin McBeath; Jörn Franke; Felix Cheung; user Subject: Re: Best way to read XML data from RDD Do you mind share your codes and sample

Re: Best way to read XML data from RDD

2016-08-22 Thread Hyukjin Kwon
kar.dhanusk...@gmail.com>, Felix Cheung < > felixcheun...@hotmail.com>, user <user@spark.apache.org> > Subject: Re: Best way to read XML data from RDD > > Another option would be to look at spark-xml-utils. We use this > extensively in the manipulation of our XML con

Re: Best way to read XML data from RDD

2016-08-22 Thread Diwakar Dhanuskodi
.@gmail.com> Cc: Diwakar Dhanuskodi <diwakar.dhanusk...@gmail.com>, Felix Cheung <felixcheun...@hotmail.com>, user <user@spark.apache.org> Subject: Re: Best way to read XML data from RDD Another option would be to look at spark-xml-utils. We use this extensively in the man

Re: Best way to read XML data from RDD

2016-08-22 Thread Diwakar Dhanuskodi
(GMT+05:30) To: Diwakar Dhanuskodi <diwakar.dhanusk...@gmail.com> Cc: Felix Cheung <felixcheun...@hotmail.com>, user <user@spark.apache.org> Subject: Re: Best way to read XML data from RDD I fear the issue is that this will create and destroy a XML parser object 2

Re: Best way to read XML data from RDD

2016-08-22 Thread Diwakar Dhanuskodi
: Jörn Franke <jornfra...@gmail.com> Cc: Diwakar Dhanuskodi <diwakar.dhanusk...@gmail.com>, Felix Cheung <felixcheun...@hotmail.com>, user <user@spark.apache.org> Subject: Re: Best way to read XML data from RDD Hi Diwakar, Spark XML library can take RDD as sourc

Re: Best way to read XML data from RDD

2016-08-21 Thread Darin McBeath
gmail.com> Cc: Diwakar Dhanuskodi <diwakar.dhanusk...@gmail.com>; Felix Cheung <felixcheun...@hotmail.com>; user <user@spark.apache.org> Sent: Sunday, August 21, 2016 6:10 AM Subject: Re: Best way to read XML data from RDD Hi Diwakar, Spark XML library can take RD

Re: Best way to read XML data from RDD

2016-08-21 Thread Hyukjin Kwon
t; > Date:20/08/2016 09:49 (GMT+05:30) > To: Diwakar Dhanuskodi <diwakar.dhanusk...@gmail.com>, user < > user@spark.apache.org> > Cc: > Subject: Re: Best way to read XML data from RDD > > Have you tried > > https://github.com/databricks/spark-xml > ? >

Re: Best way to read XML data from RDD

2016-08-20 Thread Jörn Franke
iwakar Dhanuskodi <diwakar.dhanusk...@gmail.com>, user > <user@spark.apache.org> > Cc: > Subject: Re: Best way to read XML data from RDD > > Have you tried > > https://github.com/databricks/spark-xml > ? > > > > > On Fri, Aug 19, 2016 at 1:

Re: Best way to read XML data from RDD

2016-08-19 Thread Felix Cheung
way to read XML data from RDD To: Felix Cheung <felixcheun...@hotmail.com<mailto:felixcheun...@hotmail.com>>, user <user@spark.apache.org<mailto:user@spark.apache.org>> Yes . It accepts a xml file as source but not RDD. The XML data embedded inside json is streamed

Re: Best way to read XML data from RDD

2016-08-19 Thread Diwakar Dhanuskodi
user <user@spark.apache.org> Cc: Subject: Re: Best way to read XML data from RDD Have you tried https://github.com/databricks/spark-xml ? On Fri, Aug 19, 2016 at 1:07 PM -0700, "Diwakar Dhanuskodi" <diwakar.dhanusk...@gmail.com> wrote: Hi, There is a RDD with json

Re: Best way to read XML data from RDD

2016-08-19 Thread Felix Cheung
Have you tried https://github.com/databricks/spark-xml ? On Fri, Aug 19, 2016 at 1:07 PM -0700, "Diwakar Dhanuskodi" > wrote: Hi, There is a RDD with json data. I could read json data using rdd.read.json . The json data has

Best way to read XML data from RDD

2016-08-19 Thread Diwakar Dhanuskodi
Hi, There is a RDD with json data. I could read json data using rdd.read.json . The json data has XML data in couple of key-value paris. Which is the best method to read and parse XML from rdd. Is there any specific xml libraries for spark. Could anyone help on this. Thanks.