Github user srowen commented on the issue:
https://github.com/apache/spark/pull/15511
I think we should close this. I don't believe it's worth a new API method.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your proj
Github user codlife commented on the issue:
https://github.com/apache/spark/pull/15511
@srowen , you are right! I propose this method just to make it more user
friendly, With this method, user can load a standard json file directly.
---
If your project is set up for it, you can repl
Github user srowen commented on the issue:
https://github.com/apache/spark/pull/15511
OK, I think in both cases "standard" JSON is read, and in both cases, each
record is a JSON document. These aren't different cases. If you mean to read
small JSON files as records, you just use whole
Github user codlife commented on the issue:
https://github.com/apache/spark/pull/15511
Compile is ok, but when we call show(), we will get a _corrupt_record,
besides when we call select on this df, we will get an exception.
---
If your project is set up for it, you can reply to this
Github user HyukjinKwon commented on the issue:
https://github.com/apache/spark/pull/15511
BTW, I guess per-line JSON also complies a standard -
https://tools.ietf.org/html/rfc7159#section-4. We should add a test, fix the
title to summarise what the PR proposes and fill the PR descrip
Github user HyukjinKwon commented on the issue:
https://github.com/apache/spark/pull/15511
I guess it'd be nicer if this PR resembles
https://github.com/apache/spark/pull/14151
The suggested change is to read each JSON object per file which I guess we
can share some codes in the P
Github user codlife commented on the issue:
https://github.com/apache/spark/pull/15511
In standard json file, multi lines json object is legal, but currently, we
can just load single-line json obejct directly.
---
If your project is set up for it, you can reply to this email and hav
Github user srowen commented on the issue:
https://github.com/apache/spark/pull/15511
I don't quite understand this -- what does "standard" mean? This still
doesn't load a 'standard JSON' file.
---
If your project is set up for it, you can reply to this email and have your
reply app
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/15511
Can one of the admins verify this patch?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feat