[ https://issues.apache.org/jira/browse/DRILL-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jason Altekruse updated DRILL-1556: ----------------------------------- Fix Version/s: (was: 1.2.0) 1.3.0 > Querying JSON-converted-Parquet file throws > parquet.io.ParquetDecodingException (Intermittent) > ---------------------------------------------------------------------------------------------- > > Key: DRILL-1556 > URL: https://issues.apache.org/jira/browse/DRILL-1556 > Project: Apache Drill > Issue Type: Bug > Components: Storage - Parquet > Reporter: Abhishek Girish > Assignee: Jason Altekruse > Priority: Critical > Fix For: 1.3.0 > > Attachments: drillbit.log > > > Querying JSON data works at higher values for limit: > > select * from `yelp_academic_dataset_review.json` limit 1125458; > Querying Parquet data (converted from JSON) fails at higher values for limit: > > create table yelp_academic_dataset_review as select * from > > `yelp_academic_dataset_review.json`; > [success] > >select * from yelp_academic_dataset_review limit 40000; > [data] > java.lang.RuntimeException: java.sql.SQLException: Failure while trying to > get next result batch. > Logs indicate an error in decoding the Parquet file. Drillbit.log is > attached. > 2014-10-20 15:21:22,739 [bf4a3f58-781b-4c89-b718-e1ef6eab6da4:frag:1:0] ERROR > o.a.drill.exec.ops.FragmentContext - Fragment Context received > failure. > parquet.io.ParquetDecodingException: Can't read value in column [votes, > funny] INT64 at value 61063 out of 61063, 61063 out of 61063 in currentPage. > repetition level: 0, definition level: 2 > This is at times consistent and some other times intermittent, for varied > values provided to the limit clause. -- This message was sent by Atlassian JIRA (v6.3.4#6332)