One approach would be to use SaveAsNewAPIHadoop file and specify
jsonOutputFormat.
Another simple one would be like:
val rdd = sc.parallelize(1 to 100)
val json = rdd.map(x = {
val m: Map[String, Int] = Map(id - x)
new JSONObject(m) })
json.saveAsTextFile(output)
Thanks
Best
We have RDD.saveAsTextFile and RDD.saveAsObjectFile for saving the output
to any location specified. The params to be provided are:
path of storage location
no. of partitions
For giving an hdfs path we use the following format:
/user/user-name/directory-to-sore/
On Tue, Nov 11, 2014 at 6:28 PM,