[ 
https://issues.apache.org/jira/browse/HIVE-14574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15427233#comment-15427233
 ] 

Prasanth Jayachandran edited comment on HIVE-14574 at 8/18/16 9:55 PM:
-----------------------------------------------------------------------

The 3 byte difference will still be there based on what split strategy is 
chosen. If a big file is chosen by ETL split strategy the first split will 
start from 3 offset. If chosen by BI split strategy the first split will start 
from 0. My fix was related to inconsistently choosing strategies based on AM 
cache being on or off. 


was (Author: prasanth_j):
The 3 byte difference will still be there based on what split strategy is 
chose. If a big file is chosen by ETL split strategy the first split will start 
from 3 offset. If chosen by BI split strategy the first split will start from 
0. My fix was related to inconsistently choosing strategies based on AM cache 
being on or off. 

> use consistent hashing for LLAP consistent splits to alleviate impact from 
> cluster changes
> ------------------------------------------------------------------------------------------
>
>                 Key: HIVE-14574
>                 URL: https://issues.apache.org/jira/browse/HIVE-14574
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Sergey Shelukhin
>            Assignee: Sergey Shelukhin
>         Attachments: HIVE-14574.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to