[ https://issues.apache.org/jira/browse/HIVE-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14053398#comment-14053398 ]
Lefty Leverenz commented on HIVE-7231: -------------------------------------- Facepalm! Now that the patch is committed I've finally noticed that hive.exec.orc.block.padding.tolerance is not a percentage but a decimal fraction. For example, with a 64 MB stripe size the default 0.05 gives 3.2 MB tolerance (0.05 * 64, not 0.05% of 64). This is only a tech-writer's quibble which isn't likely to confuse anyone. I'll explain it in the wiki and put a request in HIVE-6586 to fix it with HIVE-6037. > Improve ORC padding > ------------------- > > Key: HIVE-7231 > URL: https://issues.apache.org/jira/browse/HIVE-7231 > Project: Hive > Issue Type: Improvement > Components: File Formats > Affects Versions: 0.14.0 > Reporter: Prasanth J > Assignee: Prasanth J > Labels: TODOC14, orcfile > Fix For: 0.14.0 > > Attachments: HIVE-7231.1.patch, HIVE-7231.2.patch, HIVE-7231.3.patch, > HIVE-7231.4.patch, HIVE-7231.5.patch, HIVE-7231.6.patch, HIVE-7231.7.patch, > HIVE-7231.8.patch > > > Current ORC padding is not optimal because of fixed stripe sizes within > block. The padding overhead will be significant in some cases. Also padding > percentage relative to stripe size is not configurable. -- This message was sent by Atlassian JIRA (v6.2#6252)