[
https://issues.apache.org/jira/browse/HIVE-6382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Prasanth J updated HIVE-6382:
-----------------------------
Summary: PATCHED_BLOB encoding in ORC will corrupt data in some cases
(was: PATCHED_BLOB encoding in ORC will corrupt the data in some cases)
> PATCHED_BLOB encoding in ORC will corrupt data in some cases
> ------------------------------------------------------------
>
> Key: HIVE-6382
> URL: https://issues.apache.org/jira/browse/HIVE-6382
> Project: Hive
> Issue Type: Bug
> Components: Serializers/Deserializers
> Affects Versions: 0.13.0
> Reporter: Prasanth J
> Assignee: Prasanth J
> Labels: orcfile
>
> In PATCHED_BLOB encoding, gapVsPatchList is an array of long that stores gap
> between the values that are patched (g) and the patch value (p). The maximum
> distance of gap can be 511 that require 8 bits to encode. And patch values
> can take more than 56 bits. When patch values take more than 56 bits, p + g
> will become > 64 bits which cannot be packed to a long. This will result in
> data corruption under the case where patch values are > 56 bits.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)