Parth Chandra created DRILL-1359:
------------------------------------

             Summary: Parquet Complex read/write does not handle repeated 
scalar values correctly
                 Key: DRILL-1359
                 URL: https://issues.apache.org/jira/browse/DRILL-1359
             Project: Apache Drill
          Issue Type: Bug
          Components: Execution - Operators
            Reporter: Parth Chandra


Reading from a JSON containing a repeated scalar type and then writing to a 
parquet file causes the server to assert -

java.lang.ClassCastException

{panel}
parquet.io.PrimitiveColumnIO cannot be cast to parquet.io.GroupColumnIO

at 
parquet.io.MessageColumnIO$MessageColumnIORecordConsumer.startField(MessageColumnIO.java:171)
 ~[parquet-column-1.5.1-drill-r3.jar:na]

at 
org.apache.drill.exec.store.ParquetOutputRecordWriter$RepeatedVarCharParquetConverter.writeField(ParquetOutputRecordWriter.java:1879)
 ~[classes/:na]

at 
org.apache.drill.exec.store.EventBasedRecordWriter.write(EventBasedRecordWriter.java:58)
 ~[classes/:na]

at 
org.apache.drill.exec.physical.impl.WriterRecordBatch.innerNext(WriterRecordBatch.java:111)
 ~[classes/:na]

at 
org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:95)
 ~[classes/:na]

at 
org.apache.drill.exec.physical.impl.validate.IteratorValidatorBatchIterator.next(IteratorValidatorBatchIterator.java:116)
 ~[classes/:na]

at 
org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:75)
 ~[classes/:na]

at 
org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:65)
 ~[classes/:na]

at 
org.apache.drill.exec.record.AbstractSingleRecordBatch.innerNext(AbstractSingleRecordBatch.java:45)
 ~[classes/:na]

at 
org.apache.drill.exec.physical.impl.project.ProjectRecordBatch.innerNext(ProjectRecordBatch.java:128)
 ~[classes/:na]

at 
org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:95)
 ~[classes/:na]

at 
org.apache.drill.exec.physical.impl.validate.IteratorValidatorBatchIterator.next(IteratorValidatorBatchIterator.java:116)
 ~[classes/:na]

at org.apache.drill.exec.physical.impl.BaseRootExec.next(BaseRootExec.java:59) 
~[classes/:na]

at 
org.apache.drill.exec.physical.impl.ScreenCreator$ScreenRoot.innerNext(ScreenCreator.java:98)
 ~[classes/:na]

at org.apache.drill.exec.physical.impl.BaseRootExec.next(BaseRootExec.java:49) 
~[classes/:na]

at 
org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:104)
 ~[classes/:na]

at 
org.apache.drill.exec.work.WorkManager$RunnableWrapper.run(WorkManager.java:250)
 [classes/:na]

at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_51]

at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_51]

at java.lang.Thread.run(Thread.java:744) [na:1.7.0_51]

{panel}


example:
create table test_repeated as select * from cp.`testRepeatedWrite.json`





--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to