[ 
https://issues.apache.org/jira/browse/HIVE-6382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasanth J updated HIVE-6382:
-----------------------------

    Description: In PATCHED_BLOB encoding (added in HIVE-4123), gapVsPatchList 
is an array of long that stores gap between the values that are patched (g) and 
the patch value (p). The maximum distance of gap can be 511 that require 8 bits 
to encode. And patch values can take more than 56 bits. When patch values take 
more than 56 bits, p + g will become > 64 bits which cannot be packed to a 
long. This will result in data corruption under the case where patch values are 
> 56 bits.   (was: In PATCHED_BLOB encoding, gapVsPatchList is an array of long 
that stores gap between the values that are patched (g) and the patch value 
(p). The maximum distance of gap can be 511 that require 8 bits to encode. And 
patch values can take more than 56 bits. When patch values take more than 56 
bits, p + g will become > 64 bits which cannot be packed to a long. This will 
result in data corruption under the case where patch values are > 56 bits. )

> PATCHED_BLOB encoding in ORC will corrupt data in some cases
> ------------------------------------------------------------
>
>                 Key: HIVE-6382
>                 URL: https://issues.apache.org/jira/browse/HIVE-6382
>             Project: Hive
>          Issue Type: Bug
>          Components: Serializers/Deserializers
>    Affects Versions: 0.13.0
>            Reporter: Prasanth J
>            Assignee: Prasanth J
>              Labels: orcfile
>
> In PATCHED_BLOB encoding (added in HIVE-4123), gapVsPatchList is an array of 
> long that stores gap between the values that are patched (g) and the patch 
> value (p). The maximum distance of gap can be 511 that require 8 bits to 
> encode. And patch values can take more than 56 bits. When patch values take 
> more than 56 bits, p + g will become > 64 bits which cannot be packed to a 
> long. This will result in data corruption under the case where patch values 
> are > 56 bits. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to