[
https://issues.apache.org/jira/browse/DRILL-238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13862057#comment-13862057
]
Jason Altekruse commented on DRILL-238:
---------------------------------------
I combined this with work to clean up the parquet tests, so I created a new
review board with combined patch.
https://reviews.apache.org/r/16628/
> parquet reader hangs when reading nullable columns
> --------------------------------------------------
>
> Key: DRILL-238
> URL: https://issues.apache.org/jira/browse/DRILL-238
> Project: Apache Drill
> Issue Type: Bug
> Reporter: Steven Phillips
> Assignee: Jason Altekruse
> Attachments:
> 0001-Drill-238.-Fixed-bug-with-repeated-varlength-columns.patch
>
>
> Running this physical plan:
> {
> head : {
> type : "APACHE_DRILL_PHYSICAL",
> version : 1,
> generator : {
> type : "optiq",
> info : "na"
> }
> },
> graph : [ {
> pop : "parquet-scan",
> @id : 1,
> entries : [ {
> path : "/tmp/tpc-h/customer"
> } ],
> storageengine : {
> type : "parquet",
> dfsName : "file:///"
> },
> ref : "_MAP",
> fragmentPointer : 0
> }, {
> pop : "project",
> @id : 2,
> exprs : [ {
> ref : "output.$f0",
> expr : "1"
> } ],
> child : 1
> }, {
> pop : "streaming-aggregate",
> @id : 4,
> child : 2,
> keys : [ ],
> exprs : [ {
> ref : "EXPR$0",
> expr : "count($f0) "
> } ]
> }, {
> pop : "screen",
> @id : 6,
> child : 4
> } ]
> }
> against parquet data generated with Pig. The reader hangs. the last message
> in the log is this:
> records to read in this pass: 4000
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)