[
https://issues.apache.org/jira/browse/DRILL-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14126075#comment-14126075
]
Jason Altekruse edited comment on DRILL-1388 at 9/8/14 10:19 PM:
-----------------------------------------------------------------
I had generated the select list from looking at all of the schema elements
listed by parquet. pig_schema is not actually a column in the file, but instead
a name given to the schema root, so the parquet reader currently will be
producing a column with the name that is null filled. It appears that the
project operator might not be handling this correctly, so it should be
reviewed. I downgraded the priority as there is not an issue reading the real
data.
was (Author: jaltekruse):
I had generated the select list from looking at all of the schema elements
listed by parquet. pig_schema is not actually a column in the file, so the
parquet reader currently will be producing a column with the name that is null
filled. It appears that the project operator might not be handling this
correctly, so it should be reviewed. I downgraded the priority as there is not
an issue reading the real data.
> Incorrect results when projecting nulls
> ---------------------------------------
>
> Key: DRILL-1388
> URL: https://issues.apache.org/jira/browse/DRILL-1388
> Project: Apache Drill
> Issue Type: Bug
> Reporter: Jason Altekruse
>
> While testing fixed for the parquet nullable support I ran into an issue with
> unexpected results. I was selecting several columns out of file parquet file,
> which supports project pushdown. Currently the planner still includes a
> project operation after the scan in this case (to properly modify schema in
> the case of array indexing, project pushdown into scans is currently not
> supposed to be changing structure). I pulled the physical plan from the query
> and ran it without the extra project (as I was not selecting any array
> values) and got the expected results.
> Here is the query I ran, the file is too large to attach so you can e-mail me
> to get a copy of it.
> select pig_schema,ss_sold_date_sk,ss_item_sk,ss_cdemo_sk,ss_addr_sk,
> ss_hdemo_sk from store_sales
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)