[ 
https://issues.apache.org/jira/browse/IMPALA-3316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16489443#comment-16489443
 ] 

Boris Tyukin commented on IMPALA-3316:
--------------------------------------

[~attilaj] would you be kind to share an update on the fix? IMHO this should be 
classified as a major issue not minor and deserves more attention since parquet 
is a recommended format for Impala and most companies I know use Hive to 
process data for Impala to consume. I wonder why it does not get more 
attention. 

> convert_legacy_hive_parquet_utc_timestamps=true makes reading parquet tables 
> 30x slower
> ---------------------------------------------------------------------------------------
>
>                 Key: IMPALA-3316
>                 URL: https://issues.apache.org/jira/browse/IMPALA-3316
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Backend
>    Affects Versions: impala 2.3
>         Environment: CDH 5.5.2/ Impala 2.3
> Parquet table with a timestamp column
> Secure cluster
> convert_legacy_hive_parquet_utc_timestamps=true
> Timestamp column is not being filtered on
>            Reporter: Ruslan Dautkhanov
>            Assignee: Attila Jeges
>            Priority: Minor
>         Attachments: screenshot-1.png, screenshot-2.png
>
>
> Enabling convert_legacy_hive_parquet_utc_timestamps=true
> makes simple queries that don't even filter on a timestamp attribute perform 
> really poorly.
> Parquet table.
> Impala 2.3 / CDH 5.5.2.
> convert_legacy_hive_parquet_utc_timestamps=true makes following simple query 
> 30x slower (1.1minutes -> over 30 minutes).
> {quote} select * from parquet_table_with_a_timestamp_attribute where 
> bigint_attribute=1000771658169 {quote}
> Notice I did not even filter on a timestamp attribute.
> Made multiple tests with and without 
> convert_legacy_hive_parquet_utc_timestamps=true impalad present.
> Also, from https://issues.cloudera.org/browse/IMPALA-1658
> {quote} Casey Ching added a comment - 15/Jun/15 5:12 PM
> Btw, a perf test showed enabling this flag was 10x slower. {quote}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org

Reply via email to