alamb commented on issue #103: URL: https://github.com/apache/arrow-datafusion/issues/103#issuecomment-826829341
Comment from Neville Dipale(nevi_me) @ 2020-11-27T22:59:54.476+0000: <pre>[~andygrove] why is it not practical to parse the JSON files first to get the schema?</pre> Comment from Andy Grove(andygrove) @ 2021-02-24T01:51:01.855+0000: <pre>Well, we could add schema inference but it could be slow for large JSON files especially where the schema varies between objects and where there are nested structs with varying schemas. Maybe there are two different stories here. 1) Support JSON using schema inference 2) Support JSON in a schemaless way. For example, if I run "SELECT a, b, c.d.e.f ..." I would expect to get NULLs for any of these attributes that do not exist on any particular row. On Fri, Nov 27, 2020 at 4:00 PM Neville Dipale (Jira) <j...@apache.org> </pre> -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org