[
https://issues.apache.org/jira/browse/DRILL-19?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13550629#comment-13550629
]
Jacques Nadeau commented on DRILL-19:
-------------------------------------
Nulls:
I'm generally inclined to drop nulls. I can imagine a few scenarios where this
might be an issue. However, by and large I think that could be handled
case-by-case in the scanner (e.g. an option for a particular scanner such as
"encode-nulls-as").
Index number:
The array should probably carry a subtype value which is the value of all array
values. If the array is heteregenous, then the array subtype should be latebind
and then the indvidual values should be latebind encoded (schema in place). In
your case that means that the schema should be something like Field{type=ARRAY,
id=2, fullFieldName="test2", subType="INTEGER"}.
> Build a JSON scanner that does schema discovery
> -----------------------------------------------
>
> Key: DRILL-19
> URL: https://issues.apache.org/jira/browse/DRILL-19
> Project: Apache Drill
> Issue Type: New Feature
> Reporter: Jacques Nadeau
> Assignee: Timothy Chen
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira