[jira] [Commented] (DRILL-2141) Data type error in group by and order by for JSON
[ https://issues.apache.org/jira/browse/DRILL-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14522268#comment-14522268 ] Daniel Barclay (Drill) commented on DRILL-2141: --- This doesn't seem reproducible without further information. What exactly did ./nfl in the query refer to when the query was run? That is, what exactly is the relationship of attached file FlumeData.1422748800086 to that reference? ( Having a copy of the attached JSON file at /tmp/nfs/FlumeData.1422748800086.json (with nothing else in nfl/) and using from `dfs.tmp`.`nfl` in the query did not yield an error. Having a copy of that file at /tmp/nfs/FlumeData.1422748800086 (with nothing else in nfl/) and using from `dfs.tmp`.`nfl` in the query expectedly yields a table not found error. Having a copy of that file at /tmp/nfl.json and using from `dfs.tmp`.`nfl.json` in the query did not yield an error. ) Data type error in group by and order by for JSON - Key: DRILL-2141 URL: https://issues.apache.org/jira/browse/DRILL-2141 Project: Apache Drill Issue Type: Bug Components: Execution - Data Types Affects Versions: 0.7.0 Reporter: Andries Engelbrecht Assignee: Daniel Barclay (Drill) Fix For: 1.0.0 Attachments: FlumeData.1422748800086, drillbit.log, new_drillbit.log When doing group by and oder by on complex nested JSON getting Data type errors. Query: select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) as rt_count from `./nfl` t where t.retweeted_status.`user`.name is not null group by t.retweeted_status.`user`.name order by count(t.retweeted_status.id) desc limit 10; Screen output: Query failed: Query failed: Failure while running fragment., Failure while reading vector. Expected vector class of org.apache.drill.exec.vector.NullableIntVector but was holding vector class org.apache.drill.exec.vector.NullableVarCharVector. [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ] [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ] java.lang.RuntimeException: java.sql.SQLException: Failure while executing query. at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514) at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148) at sqlline.SqlLine.print(SqlLine.java:1809) at sqlline.SqlLine$Commands.execute(SqlLine.java:3766) at sqlline.SqlLine$Commands.sql(SqlLine.java:3663) at sqlline.SqlLine.dispatch(SqlLine.java:889) at sqlline.SqlLine.begin(SqlLine.java:763) at sqlline.SqlLine.start(SqlLine.java:498) at sqlline.SqlLine.main(SqlLine.java:460) Drill log attached -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-2141) Data type error in group by and order by for JSON
[ https://issues.apache.org/jira/browse/DRILL-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14522386#comment-14522386 ] Andries Engelbrecht commented on DRILL-2141: Please mark as resolved as of 0.8 Have not experienced the issue with 0.8 on the same data set. Data type error in group by and order by for JSON - Key: DRILL-2141 URL: https://issues.apache.org/jira/browse/DRILL-2141 Project: Apache Drill Issue Type: Bug Components: Execution - Data Types Affects Versions: 0.7.0 Reporter: Andries Engelbrecht Assignee: Daniel Barclay (Drill) Fix For: 1.0.0 Attachments: FlumeData.1422748800086, drillbit.log, new_drillbit.log When doing group by and oder by on complex nested JSON getting Data type errors. Query: select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) as rt_count from `./nfl` t where t.retweeted_status.`user`.name is not null group by t.retweeted_status.`user`.name order by count(t.retweeted_status.id) desc limit 10; Screen output: Query failed: Query failed: Failure while running fragment., Failure while reading vector. Expected vector class of org.apache.drill.exec.vector.NullableIntVector but was holding vector class org.apache.drill.exec.vector.NullableVarCharVector. [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ] [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ] java.lang.RuntimeException: java.sql.SQLException: Failure while executing query. at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514) at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148) at sqlline.SqlLine.print(SqlLine.java:1809) at sqlline.SqlLine$Commands.execute(SqlLine.java:3766) at sqlline.SqlLine$Commands.sql(SqlLine.java:3663) at sqlline.SqlLine.dispatch(SqlLine.java:889) at sqlline.SqlLine.begin(SqlLine.java:763) at sqlline.SqlLine.start(SqlLine.java:498) at sqlline.SqlLine.main(SqlLine.java:460) Drill log attached -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-2141) Data type error in group by and order by for JSON
[ https://issues.apache.org/jira/browse/DRILL-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343847#comment-14343847 ] Hanifi Gunes commented on DRILL-2141: - The query seems to be working fine on the dataset. Would you try to see if this bug is still there? Data type error in group by and order by for JSON - Key: DRILL-2141 URL: https://issues.apache.org/jira/browse/DRILL-2141 Project: Apache Drill Issue Type: Bug Components: Execution - Data Types Affects Versions: 0.7.0 Reporter: Andries Engelbrecht Assignee: Hanifi Gunes Fix For: 0.9.0 Attachments: FlumeData.1422748800086, drillbit.log, new_drillbit.log When doing group by and oder by on complex nested JSON getting Data type errors. Query: select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) as rt_count from `./nfl` t where t.retweeted_status.`user`.name is not null group by t.retweeted_status.`user`.name order by count(t.retweeted_status.id) desc limit 10; Screen output: Query failed: Query failed: Failure while running fragment., Failure while reading vector. Expected vector class of org.apache.drill.exec.vector.NullableIntVector but was holding vector class org.apache.drill.exec.vector.NullableVarCharVector. [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ] [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ] java.lang.RuntimeException: java.sql.SQLException: Failure while executing query. at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514) at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148) at sqlline.SqlLine.print(SqlLine.java:1809) at sqlline.SqlLine$Commands.execute(SqlLine.java:3766) at sqlline.SqlLine$Commands.sql(SqlLine.java:3663) at sqlline.SqlLine.dispatch(SqlLine.java:889) at sqlline.SqlLine.begin(SqlLine.java:763) at sqlline.SqlLine.start(SqlLine.java:498) at sqlline.SqlLine.main(SqlLine.java:460) Drill log attached -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-2141) Data type error in group by and order by for JSON
[ https://issues.apache.org/jira/browse/DRILL-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344376#comment-14344376 ] Andries Engelbrecht commented on DRILL-2141: Error still present on a much larger sample set of data in a cluster. select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) as rt_count from `./nfl` t where t.`text` like '%RT_@%' group by t.retweeted_status.`user`.name order by count(t.retweeted_status.id) desc limit 10; +++ |name| rt_count | +++ Query failed: Query stopped., Undefined failure occurred. [ 79f5d0d4-5101-48e6-a6bc-f25c147db6d8 on se-node11.se.lab:31010 ] java.lang.RuntimeException: java.sql.SQLException: Failure while executing query. at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514) at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148) at sqlline.SqlLine.print(SqlLine.java:1809) at sqlline.SqlLine$Commands.execute(SqlLine.java:3766) at sqlline.SqlLine$Commands.sql(SqlLine.java:3663) at sqlline.SqlLine.dispatch(SqlLine.java:889) at sqlline.SqlLine.begin(SqlLine.java:763) at sqlline.SqlLine.start(SqlLine.java:498) at sqlline.SqlLine.main(SqlLine.java:460) Data type error in group by and order by for JSON - Key: DRILL-2141 URL: https://issues.apache.org/jira/browse/DRILL-2141 Project: Apache Drill Issue Type: Bug Components: Execution - Data Types Affects Versions: 0.7.0 Reporter: Andries Engelbrecht Assignee: Hanifi Gunes Fix For: 0.9.0 Attachments: FlumeData.1422748800086, drillbit.log, new_drillbit.log When doing group by and oder by on complex nested JSON getting Data type errors. Query: select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) as rt_count from `./nfl` t where t.retweeted_status.`user`.name is not null group by t.retweeted_status.`user`.name order by count(t.retweeted_status.id) desc limit 10; Screen output: Query failed: Query failed: Failure while running fragment., Failure while reading vector. Expected vector class of org.apache.drill.exec.vector.NullableIntVector but was holding vector class org.apache.drill.exec.vector.NullableVarCharVector. [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ] [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ] java.lang.RuntimeException: java.sql.SQLException: Failure while executing query. at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514) at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148) at sqlline.SqlLine.print(SqlLine.java:1809) at sqlline.SqlLine$Commands.execute(SqlLine.java:3766) at sqlline.SqlLine$Commands.sql(SqlLine.java:3663) at sqlline.SqlLine.dispatch(SqlLine.java:889) at sqlline.SqlLine.begin(SqlLine.java:763) at sqlline.SqlLine.start(SqlLine.java:498) at sqlline.SqlLine.main(SqlLine.java:460) Drill log attached -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-2141) Data type error in group by and order by for JSON
[ https://issues.apache.org/jira/browse/DRILL-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14339862#comment-14339862 ] Hanifi Gunes commented on DRILL-2141: - [~aengelbrecht] ping. Data type error in group by and order by for JSON - Key: DRILL-2141 URL: https://issues.apache.org/jira/browse/DRILL-2141 Project: Apache Drill Issue Type: Bug Components: Execution - Data Types Affects Versions: 0.7.0 Reporter: Andries Engelbrecht Assignee: Hanifi Gunes Fix For: 0.9.0 Attachments: drillbit.log, new_drillbit.log When doing group by and oder by on complex nested JSON getting Data type errors. Query: select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) as rt_count from `./nfl` t where t.retweeted_status.`user`.name is not null group by t.retweeted_status.`user`.name order by count(t.retweeted_status.id) desc limit 10; Screen output: Query failed: Query failed: Failure while running fragment., Failure while reading vector. Expected vector class of org.apache.drill.exec.vector.NullableIntVector but was holding vector class org.apache.drill.exec.vector.NullableVarCharVector. [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ] [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ] java.lang.RuntimeException: java.sql.SQLException: Failure while executing query. at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514) at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148) at sqlline.SqlLine.print(SqlLine.java:1809) at sqlline.SqlLine$Commands.execute(SqlLine.java:3766) at sqlline.SqlLine$Commands.sql(SqlLine.java:3663) at sqlline.SqlLine.dispatch(SqlLine.java:889) at sqlline.SqlLine.begin(SqlLine.java:763) at sqlline.SqlLine.start(SqlLine.java:498) at sqlline.SqlLine.main(SqlLine.java:460) Drill log attached -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-2141) Data type error in group by and order by for JSON
[ https://issues.apache.org/jira/browse/DRILL-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14301762#comment-14301762 ] Andries Engelbrecht commented on DRILL-2141: When changing the query to use a different field to filter out JSON docs without the top level map a different error is received (similar to DRILL-2140). New Query: select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) as rt_count from `./nfl` t where t.`text` like '%RT_@%' group by t.retweeted_status.`user`.name order by count(t.retweeted_status.id) desc limit 10; Screen Output: Query failed: Query stopped., Undefined failure occurred. [ c480ac84-9dfa-4e1d-922e-d2aabe279b10 on drilldemo:31010 ] java.lang.RuntimeException: java.sql.SQLException: Failure while executing query. at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514) at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148) at sqlline.SqlLine.print(SqlLine.java:1809) at sqlline.SqlLine$Commands.execute(SqlLine.java:3766) at sqlline.SqlLine$Commands.sql(SqlLine.java:3663) at sqlline.SqlLine.dispatch(SqlLine.java:889) at sqlline.SqlLine.begin(SqlLine.java:763) at sqlline.SqlLine.start(SqlLine.java:498) at sqlline.SqlLine.main(SqlLine.java:460) New Drill log attached as new_drillbit.log Also note this is a single node drill system, and also used alter session set `store.format` = 'json'; Data type error in group by and order by for JSON - Key: DRILL-2141 URL: https://issues.apache.org/jira/browse/DRILL-2141 Project: Apache Drill Issue Type: Bug Components: Execution - Data Types Affects Versions: 0.7.0 Reporter: Andries Engelbrecht Assignee: Daniel Barclay (Drill/MapR) Attachments: drillbit.log, new_drillbit.log When doing group by and oder by on complex nested JSON getting Data type errors. Query: select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) as rt_count from `./nfl` t where t.retweeted_status.`user`.name is not null group by t.retweeted_status.`user`.name order by count(t.retweeted_status.id) desc limit 10; Screen output: Query failed: Query failed: Failure while running fragment., Failure while reading vector. Expected vector class of org.apache.drill.exec.vector.NullableIntVector but was holding vector class org.apache.drill.exec.vector.NullableVarCharVector. [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ] [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ] java.lang.RuntimeException: java.sql.SQLException: Failure while executing query. at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514) at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148) at sqlline.SqlLine.print(SqlLine.java:1809) at sqlline.SqlLine$Commands.execute(SqlLine.java:3766) at sqlline.SqlLine$Commands.sql(SqlLine.java:3663) at sqlline.SqlLine.dispatch(SqlLine.java:889) at sqlline.SqlLine.begin(SqlLine.java:763) at sqlline.SqlLine.start(SqlLine.java:498) at sqlline.SqlLine.main(SqlLine.java:460) Drill log attached -- This message was sent by Atlassian JIRA (v6.3.4#6332)