[ https://issues.apache.org/jira/browse/DRILL-3562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15818118#comment-15818118 ]
Serhii Harnyk commented on DRILL-3562: -------------------------------------- Besides the initialization of empty arrays we have the problem with ordering of columns with arrays. Query {code} select * from example {code} for Json {noformat} { "a": [], "c": [], "c1": 1 } { "a": [1], "c": [1], "c1": 1 } {noformat} returns result {noformat} ------------------------------------------------------------------- | c1<BIGINT(OPTIONAL)>| a<BIGINT(REPEATED)> | c<BIGINT(REPEATED)> | ------------------------------------------------------------------- | 1 | [] | [] | | 1 | [1] | [1] | ------------------------------------------------------------------- {noformat} with wrong columns order. > Query fails when using flatten on JSON data where some documents have an > empty array > ------------------------------------------------------------------------------------ > > Key: DRILL-3562 > URL: https://issues.apache.org/jira/browse/DRILL-3562 > Project: Apache Drill > Issue Type: Bug > Components: Storage - JSON > Affects Versions: 1.1.0 > Reporter: Philip Deegan > Assignee: Serhii Harnyk > Fix For: Future > > > Drill query fails when using flatten when some records contain an empty array > {noformat} > SELECT COUNT(*) FROM (SELECT FLATTEN(t.a.b.c) AS c FROM dfs.`flat.json` t) > flat WHERE flat.c.d.e = 'f' limit 1; > {noformat} > Succeeds on > { "a": { "b": { "c": [ { "d": { "e": "f" } } ] } } } > Fails on > { "a": { "b": { "c": [] } } } > Error > {noformat} > Error: SYSTEM ERROR: ClassCastException: Cannot cast > org.apache.drill.exec.vector.NullableIntVector to > org.apache.drill.exec.vector.complex.RepeatedValueVector > {noformat} > Is it possible to ignore the empty arrays, or do they need to be populated > with dummy data? -- This message was sent by Atlassian JIRA (v6.3.4#6332)