[ 
https://issues.apache.org/jira/browse/DRILL-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Abhishek Girish updated DRILL-1556:
-----------------------------------
    Description: 
Querying JSON data works at higher values for limit:
> select * from `yelp_academic_dataset_review.json` limit 1125458;

Querying Parquet data (converted from JSON) fails at higher values for limit:
> create table yelp_academic_dataset_review as select * from 
> `yelp_academic_dataset_review.json`;
[success]

>select * from yelp_academic_dataset_review limit 40000;
[data]
java.lang.RuntimeException: java.sql.SQLException: Failure while trying to get 
next result batch.

Logs indicate an error in decoding the Parquet file. Drillbit.log is attached. 

2014-10-20 15:21:22,739 [bf4a3f58-781b-4c89-b718-e1ef6eab6da4:frag:1:0] ERROR 
o.a.drill.exec.ops.FragmentContext - Fragment Context received 
failure.
parquet.io.ParquetDecodingException: Can't read value in column [votes, funny] 
INT64 at value 61063 out of 61063, 61063 out of 61063 in currentPage. 
repetition level: 0, definition level: 2

This is at times consistent and some other times intermittent, for varied 
values provided to the limit clause. 

  was:
Querying JSON data works at higher values for limit:
> select * from `yelp_academic_dataset_review.json` limit 1125458;

Querying Parquet data (converted from JSON) fails at higher values for limit:
> create table yelp_academic_dataset_review as select * from 
> `yelp_academic_dataset_review.json`;
[success]

>select * from yelp_academic_dataset_review limit 40000;
[data]
java.lang.RuntimeException: java.sql.SQLException: Failure while trying to get 
next result batch.
        at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
        at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
        at sqlline.SqlLine.print(SqlLine.java:1809)
        at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
        at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
        at sqlline.SqlLine.dispatch(SqlLine.java:889)
        at sqlline.SqlLine.begin(SqlLine.java:763)
        at sqlline.SqlLine.start(SqlLine.java:498)
        at sqlline.SqlLine.main(SqlLine.java:460)

This is at times thrown consistently and some other times thrown intermittently 
for varied values for the limit clause. 


> Querying JSON-converted-Parquet file throws 
> parquet.io.ParquetDecodingException
> -------------------------------------------------------------------------------
>
>                 Key: DRILL-1556
>                 URL: https://issues.apache.org/jira/browse/DRILL-1556
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - Parquet
>            Reporter: Abhishek Girish
>            Assignee: Jason Altekruse
>
> Querying JSON data works at higher values for limit:
> > select * from `yelp_academic_dataset_review.json` limit 1125458;
> Querying Parquet data (converted from JSON) fails at higher values for limit:
> > create table yelp_academic_dataset_review as select * from 
> > `yelp_academic_dataset_review.json`;
> [success]
> >select * from yelp_academic_dataset_review limit 40000;
> [data]
> java.lang.RuntimeException: java.sql.SQLException: Failure while trying to 
> get next result batch.
> Logs indicate an error in decoding the Parquet file. Drillbit.log is 
> attached. 
> 2014-10-20 15:21:22,739 [bf4a3f58-781b-4c89-b718-e1ef6eab6da4:frag:1:0] ERROR 
> o.a.drill.exec.ops.FragmentContext - Fragment Context received 
> failure.
> parquet.io.ParquetDecodingException: Can't read value in column [votes, 
> funny] INT64 at value 61063 out of 61063, 61063 out of 61063 in currentPage. 
> repetition level: 0, definition level: 2
> This is at times consistent and some other times intermittent, for varied 
> values provided to the limit clause. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to