[jira] [Commented] (DRILL-2141) Data type error in group by and order by for JSON

2015-04-30 Thread Daniel Barclay (Drill) (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14522268#comment-14522268
 ] 

Daniel Barclay (Drill) commented on DRILL-2141:
---

This doesn't seem reproducible without further information.

What exactly did ./nfl in the query refer to when the query was run?  That 
is, what exactly is the relationship of attached file FlumeData.1422748800086 
to that reference? 

(
Having a copy of the attached JSON file at 
/tmp/nfs/FlumeData.1422748800086.json (with nothing else in nfl/) and using 
from `dfs.tmp`.`nfl` in the query did not yield an error.

Having a copy of that file at /tmp/nfs/FlumeData.1422748800086 (with nothing 
else in nfl/) and using from `dfs.tmp`.`nfl` in the query expectedly yields a 
table not found error.

Having a copy of that file at /tmp/nfl.json and using from 
`dfs.tmp`.`nfl.json` in the query did not yield an error.
)


 Data type error in group by and order by for JSON
 -

 Key: DRILL-2141
 URL: https://issues.apache.org/jira/browse/DRILL-2141
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Affects Versions: 0.7.0
Reporter: Andries Engelbrecht
Assignee: Daniel Barclay (Drill)
 Fix For: 1.0.0

 Attachments: FlumeData.1422748800086, drillbit.log, new_drillbit.log


 When doing group by and oder by on complex nested JSON getting Data type 
 errors.
 Query:
 select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) 
 as rt_count from `./nfl` t where t.retweeted_status.`user`.name is not null 
 group by t.retweeted_status.`user`.name order by count(t.retweeted_status.id) 
 desc limit 10;
 Screen output:
 Query failed: Query failed: Failure while running fragment., Failure while 
 reading vector.  Expected vector class of 
 org.apache.drill.exec.vector.NullableIntVector but was holding vector class 
 org.apache.drill.exec.vector.NullableVarCharVector. [ 
 c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ]
 [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ]
 java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
 query.
   at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
   at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
   at sqlline.SqlLine.print(SqlLine.java:1809)
   at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
   at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
   at sqlline.SqlLine.dispatch(SqlLine.java:889)
   at sqlline.SqlLine.begin(SqlLine.java:763)
   at sqlline.SqlLine.start(SqlLine.java:498)
   at sqlline.SqlLine.main(SqlLine.java:460)
 Drill log attached



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (DRILL-2141) Data type error in group by and order by for JSON

2015-04-30 Thread Andries Engelbrecht (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14522386#comment-14522386
 ] 

Andries Engelbrecht commented on DRILL-2141:


Please mark as resolved as of 0.8

Have not experienced the issue with 0.8 on the same data set.

 Data type error in group by and order by for JSON
 -

 Key: DRILL-2141
 URL: https://issues.apache.org/jira/browse/DRILL-2141
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Affects Versions: 0.7.0
Reporter: Andries Engelbrecht
Assignee: Daniel Barclay (Drill)
 Fix For: 1.0.0

 Attachments: FlumeData.1422748800086, drillbit.log, new_drillbit.log


 When doing group by and oder by on complex nested JSON getting Data type 
 errors.
 Query:
 select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) 
 as rt_count from `./nfl` t where t.retweeted_status.`user`.name is not null 
 group by t.retweeted_status.`user`.name order by count(t.retweeted_status.id) 
 desc limit 10;
 Screen output:
 Query failed: Query failed: Failure while running fragment., Failure while 
 reading vector.  Expected vector class of 
 org.apache.drill.exec.vector.NullableIntVector but was holding vector class 
 org.apache.drill.exec.vector.NullableVarCharVector. [ 
 c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ]
 [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ]
 java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
 query.
   at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
   at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
   at sqlline.SqlLine.print(SqlLine.java:1809)
   at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
   at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
   at sqlline.SqlLine.dispatch(SqlLine.java:889)
   at sqlline.SqlLine.begin(SqlLine.java:763)
   at sqlline.SqlLine.start(SqlLine.java:498)
   at sqlline.SqlLine.main(SqlLine.java:460)
 Drill log attached



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (DRILL-2141) Data type error in group by and order by for JSON

2015-03-02 Thread Hanifi Gunes (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343847#comment-14343847
 ] 

Hanifi Gunes commented on DRILL-2141:
-

The query seems to be working fine on the dataset. Would you try to see if this 
bug is still there?

 Data type error in group by and order by for JSON
 -

 Key: DRILL-2141
 URL: https://issues.apache.org/jira/browse/DRILL-2141
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Affects Versions: 0.7.0
Reporter: Andries Engelbrecht
Assignee: Hanifi Gunes
 Fix For: 0.9.0

 Attachments: FlumeData.1422748800086, drillbit.log, new_drillbit.log


 When doing group by and oder by on complex nested JSON getting Data type 
 errors.
 Query:
 select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) 
 as rt_count from `./nfl` t where t.retweeted_status.`user`.name is not null 
 group by t.retweeted_status.`user`.name order by count(t.retweeted_status.id) 
 desc limit 10;
 Screen output:
 Query failed: Query failed: Failure while running fragment., Failure while 
 reading vector.  Expected vector class of 
 org.apache.drill.exec.vector.NullableIntVector but was holding vector class 
 org.apache.drill.exec.vector.NullableVarCharVector. [ 
 c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ]
 [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ]
 java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
 query.
   at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
   at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
   at sqlline.SqlLine.print(SqlLine.java:1809)
   at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
   at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
   at sqlline.SqlLine.dispatch(SqlLine.java:889)
   at sqlline.SqlLine.begin(SqlLine.java:763)
   at sqlline.SqlLine.start(SqlLine.java:498)
   at sqlline.SqlLine.main(SqlLine.java:460)
 Drill log attached



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (DRILL-2141) Data type error in group by and order by for JSON

2015-03-02 Thread Andries Engelbrecht (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14344376#comment-14344376
 ] 

Andries Engelbrecht commented on DRILL-2141:


Error still present on a much larger sample set of data in a cluster.

select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) as 
rt_count from `./nfl` t where t.`text` like '%RT_@%' group by 
t.retweeted_status.`user`.name order by count(t.retweeted_status.id) desc limit 
10;
+++
|name|  rt_count  |
+++
Query failed: Query stopped., Undefined failure occurred. [ 
79f5d0d4-5101-48e6-a6bc-f25c147db6d8 on se-node11.se.lab:31010 ]


java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
query.
at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
at sqlline.SqlLine.print(SqlLine.java:1809)
at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
at sqlline.SqlLine.dispatch(SqlLine.java:889)
at sqlline.SqlLine.begin(SqlLine.java:763)
at sqlline.SqlLine.start(SqlLine.java:498)
at sqlline.SqlLine.main(SqlLine.java:460)


 Data type error in group by and order by for JSON
 -

 Key: DRILL-2141
 URL: https://issues.apache.org/jira/browse/DRILL-2141
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Affects Versions: 0.7.0
Reporter: Andries Engelbrecht
Assignee: Hanifi Gunes
 Fix For: 0.9.0

 Attachments: FlumeData.1422748800086, drillbit.log, new_drillbit.log


 When doing group by and oder by on complex nested JSON getting Data type 
 errors.
 Query:
 select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) 
 as rt_count from `./nfl` t where t.retweeted_status.`user`.name is not null 
 group by t.retweeted_status.`user`.name order by count(t.retweeted_status.id) 
 desc limit 10;
 Screen output:
 Query failed: Query failed: Failure while running fragment., Failure while 
 reading vector.  Expected vector class of 
 org.apache.drill.exec.vector.NullableIntVector but was holding vector class 
 org.apache.drill.exec.vector.NullableVarCharVector. [ 
 c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ]
 [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ]
 java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
 query.
   at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
   at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
   at sqlline.SqlLine.print(SqlLine.java:1809)
   at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
   at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
   at sqlline.SqlLine.dispatch(SqlLine.java:889)
   at sqlline.SqlLine.begin(SqlLine.java:763)
   at sqlline.SqlLine.start(SqlLine.java:498)
   at sqlline.SqlLine.main(SqlLine.java:460)
 Drill log attached



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (DRILL-2141) Data type error in group by and order by for JSON

2015-02-27 Thread Hanifi Gunes (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14339862#comment-14339862
 ] 

Hanifi Gunes commented on DRILL-2141:
-

[~aengelbrecht] ping.

 Data type error in group by and order by for JSON
 -

 Key: DRILL-2141
 URL: https://issues.apache.org/jira/browse/DRILL-2141
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Affects Versions: 0.7.0
Reporter: Andries Engelbrecht
Assignee: Hanifi Gunes
 Fix For: 0.9.0

 Attachments: drillbit.log, new_drillbit.log


 When doing group by and oder by on complex nested JSON getting Data type 
 errors.
 Query:
 select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) 
 as rt_count from `./nfl` t where t.retweeted_status.`user`.name is not null 
 group by t.retweeted_status.`user`.name order by count(t.retweeted_status.id) 
 desc limit 10;
 Screen output:
 Query failed: Query failed: Failure while running fragment., Failure while 
 reading vector.  Expected vector class of 
 org.apache.drill.exec.vector.NullableIntVector but was holding vector class 
 org.apache.drill.exec.vector.NullableVarCharVector. [ 
 c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ]
 [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ]
 java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
 query.
   at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
   at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
   at sqlline.SqlLine.print(SqlLine.java:1809)
   at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
   at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
   at sqlline.SqlLine.dispatch(SqlLine.java:889)
   at sqlline.SqlLine.begin(SqlLine.java:763)
   at sqlline.SqlLine.start(SqlLine.java:498)
   at sqlline.SqlLine.main(SqlLine.java:460)
 Drill log attached



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (DRILL-2141) Data type error in group by and order by for JSON

2015-02-02 Thread Andries Engelbrecht (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14301762#comment-14301762
 ] 

Andries Engelbrecht commented on DRILL-2141:


When changing the query to use a different field to filter out JSON docs 
without the top level map a different error is received (similar to DRILL-2140).

New Query:
select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) as 
rt_count from `./nfl` t where t.`text` like '%RT_@%' group by 
t.retweeted_status.`user`.name order by count(t.retweeted_status.id) desc limit 
10;

Screen Output:
Query failed: Query stopped., Undefined failure occurred. [ 
c480ac84-9dfa-4e1d-922e-d2aabe279b10 on drilldemo:31010 ]


java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
query.
at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
at sqlline.SqlLine.print(SqlLine.java:1809)
at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
at sqlline.SqlLine.dispatch(SqlLine.java:889)
at sqlline.SqlLine.begin(SqlLine.java:763)
at sqlline.SqlLine.start(SqlLine.java:498)
at sqlline.SqlLine.main(SqlLine.java:460)

New Drill log attached as new_drillbit.log

Also note this is a single node drill system, and also used alter session set 
`store.format` = 'json';



 Data type error in group by and order by for JSON
 -

 Key: DRILL-2141
 URL: https://issues.apache.org/jira/browse/DRILL-2141
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Affects Versions: 0.7.0
Reporter: Andries Engelbrecht
Assignee: Daniel Barclay (Drill/MapR)
 Attachments: drillbit.log, new_drillbit.log


 When doing group by and oder by on complex nested JSON getting Data type 
 errors.
 Query:
 select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) 
 as rt_count from `./nfl` t where t.retweeted_status.`user`.name is not null 
 group by t.retweeted_status.`user`.name order by count(t.retweeted_status.id) 
 desc limit 10;
 Screen output:
 Query failed: Query failed: Failure while running fragment., Failure while 
 reading vector.  Expected vector class of 
 org.apache.drill.exec.vector.NullableIntVector but was holding vector class 
 org.apache.drill.exec.vector.NullableVarCharVector. [ 
 c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ]
 [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ]
 java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
 query.
   at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
   at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
   at sqlline.SqlLine.print(SqlLine.java:1809)
   at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
   at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
   at sqlline.SqlLine.dispatch(SqlLine.java:889)
   at sqlline.SqlLine.begin(SqlLine.java:763)
   at sqlline.SqlLine.start(SqlLine.java:498)
   at sqlline.SqlLine.main(SqlLine.java:460)
 Drill log attached



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)