Hi Charles, The definition of object from www.json.org: An *object* is an unordered set of name/value pairs. An object begins with { (left brace) and ends with } (right brace). Each name is followed by : (colon) and the name/value pairs are separated by , (comma).
Its a pretty much OOPS paradigm , isn't it? Regards, Umesh On Thu, Mar 31, 2016 at 2:34 PM, charles li <charles.up...@gmail.com> wrote: > hi, UMESH, I think you've misunderstood the json definition. > > there is only one object in a json file: > > > for the file, people.json, as bellow: > > > -------------------------------------------------------------------------------------------- > > {"name":"Yin", "address":{"city":"Columbus","state":"Ohio"}} > {"name":"Michael", "address":{"city":null, "state":"California"}} > > > ----------------------------------------------------------------------------------------------- > > it does have two valid format: > > 1. > > > -------------------------------------------------------------------------------------------- > > [ {"name":"Yin", "address":{"city":"Columbus","state":"Ohio"}}, > {"name":"Michael", "address":{"city":null, "state":"California"}} > ] > > > ----------------------------------------------------------------------------------------------- > > 2. > > > -------------------------------------------------------------------------------------------- > > {"name": ["Yin", "Michael"], > "address":[ {"city":"Columbus","state":"Ohio"}, > {"city":null, "state":"California"} ] > } > > ----------------------------------------------------------------------------------------------- > > > > On Thu, Mar 31, 2016 at 4:53 PM, UMESH CHAUDHARY <umesh9...@gmail.com> > wrote: > >> Hi, >> Look at below image which is from json.org : >> >> [image: Inline image 1] >> >> The above image describes the object formulation of below JSON: >> >> Object 1=> {"name":"Yin", "address":{"city":"Columbus","state":"Ohio"}} >> Object=> {"name":"Michael", "address":{"city":null, "state":"California"}} >> >> >> Note that "address" is also an object. >> >> >> >> On Thu, Mar 31, 2016 at 1:53 PM, charles li <charles.up...@gmail.com> >> wrote: >> >>> as this post says, that in spark, we can load a json file in this way >>> bellow: >>> >>> *post* : >>> https://databricks.com/blog/2015/02/02/an-introduction-to-json-support-in-spark-sql.html >>> >>> >>> >>> ----------------------------------------------------------------------------------------------- >>> sqlContext.jsonFile(file_path) >>> or >>> sqlContext.read.json(file_path) >>> >>> ----------------------------------------------------------------------------------------------- >>> >>> >>> and the *json file format* looks like bellow, say *people.json* >>> >>> >>> --------------------------------------------------------------------------------------------{"name":"Yin", >>> "address":{"city":"Columbus","state":"Ohio"}} >>> {"name":"Michael", "address":{"city":null, "state":"California"}} >>> >>> ----------------------------------------------------------------------------------------------- >>> >>> >>> and here comes my *problems*: >>> >>> Is that the *standard json format*? according to http://www.json.org/ , >>> I don't think so. it's just a *collection of records* [ a dict ], not a >>> valid json format. as the json official doc, the standard json format of >>> people.json should be : >>> >>> >>> --------------------------------------------------------------------------------------------{"name": >>> ["Yin", "Michael"], >>> "address":[ {"city":"Columbus","state":"Ohio"}, >>> {"city":null, "state":"California"} ] >>> } >>> >>> ----------------------------------------------------------------------------------------------- >>> >>> So, why we define the json format as a collection of records in spark, I >>> mean, it will lead to some unconvenient, for if we had a large standard >>> json file, we need to firstly format it to make it correctly readable in >>> spark, which will low-efficiency, time-consuming, un-compatible and >>> space-consuming. >>> >>> >>> great thanks, >>> >>> >>> >>> >>> >>> >>> -- >>> *--------------------------------------* >>> a spark lover, a quant, a developer and a good man. >>> >>> http://github.com/litaotao >>> >> >> > > > -- > *--------------------------------------* > a spark lover, a quant, a developer and a good man. > > http://github.com/litaotao >