Databricks Spark XML parsing exception while iterating

2017-07-10 Thread Amol Talap
Hi All Does anyone know a fix for below exception. The XML parsing function works fine for unit test as you see in below code but fails while using in RDD. new_xml: org.apache.spark.rdd.RDD[List[(String, String)]] = MapPartitionsRDD[119] at map at :57 17/07/10 08:29:54 ERROR Executor: Exception

Re: SparkSQL to read XML Blob data to create multiple rows

2017-07-08 Thread Amol Talap
Hi Zhang & All, Thanks so much for your earlier response. I am trying to get final solution. We could parse the data successfully however I am getting Nullpointerexception while mapping it back. Can you please suggest on below findings ? spark-shell --packages com.databricks:spark-xml_2.11:0.4.1