[GitHub] [arrow-datafusion] alamb commented on issue #103: [Rust] Add support for JSON data sources

GitBox Mon, 26 Apr 2021 06:20:33 -0700


alamb commented on issue #103:
URL: 
https://github.com/apache/arrow-datafusion/issues/103#issuecomment-826829341



   Comment from Neville Dipale(nevi_me) @ 2020-11-27T22:59:54.476+0000:
   <pre>[~andygrove] why is it not practical to parse the JSON files first to 
get the schema?</pre>
   
   Comment from Andy Grove(andygrove) @ 2021-02-24T01:51:01.855+0000:
   <pre>Well, we could add schema inference but it could be slow for large JSON
   files especially where the schema varies between objects and where there
   are nested structs with varying schemas.
   
   Maybe there are two different stories here.
   
   1) Support JSON using schema inference
   
   2) Support JSON in a schemaless way. For example, if I run "SELECT a, b,
   c.d.e.f ..." I would expect to get NULLs for any of these attributes that
   do not exist on any particular row.
   
   On Fri, Nov 27, 2020 at 4:00 PM Neville Dipale (Jira) <j...@apache.org>
   
   </pre>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[GitHub] [arrow-datafusion] alamb commented on issue #103: [Rust] Add support for JSON data sources

Reply via email to