[ https://issues.apache.org/jira/browse/HIVE-6382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Prasanth J updated HIVE-6382: ----------------------------- Summary: PATCHED_BLOB encoding in ORC will corrupt data in some cases (was: PATCHED_BLOB encoding in ORC will corrupt the data in some cases) > PATCHED_BLOB encoding in ORC will corrupt data in some cases > ------------------------------------------------------------ > > Key: HIVE-6382 > URL: https://issues.apache.org/jira/browse/HIVE-6382 > Project: Hive > Issue Type: Bug > Components: Serializers/Deserializers > Affects Versions: 0.13.0 > Reporter: Prasanth J > Assignee: Prasanth J > Labels: orcfile > > In PATCHED_BLOB encoding, gapVsPatchList is an array of long that stores gap > between the values that are patched (g) and the patch value (p). The maximum > distance of gap can be 511 that require 8 bits to encode. And patch values > can take more than 56 bits. When patch values take more than 56 bits, p + g > will become > 64 bits which cannot be packed to a long. This will result in > data corruption under the case where patch values are > 56 bits. -- This message was sent by Atlassian JIRA (v6.1.5#6160)