[
https://issues.apache.org/jira/browse/HIVE-8584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14182089#comment-14182089
]
Xiaobing Zhou commented on HIVE-8584:
-------------------------------------
In the dumping files attached, *-1-* and *-2-* means PST and non-PST timezone,
respectively.
> Setting hive.exec.orc.default.compress to ZLIB will lead to orc file size
> delta byte(s) shorter on Windows than Linux
> ---------------------------------------------------------------------------------------------------------------------
>
> Key: HIVE-8584
> URL: https://issues.apache.org/jira/browse/HIVE-8584
> Project: Hive
> Issue Type: Bug
> Affects Versions: 0.14.0
> Environment: Windows
> Reporter: Xiaobing Zhou
> Assignee: Xiaobing Zhou
> Attachments: orc-win-none-1.dump, orc-win-none-2.dump,
> orc-win-snappy-1.dump, orc-win-snappy-2.dump, orc-win-zlib-1.dump,
> orc-win-zlib-2.dump, orc_analyze.q
>
>
> repo steps:
> 1. run query orc_analyze.q
> 2. hive --orcfiledump <target_orc_file_generated>
> run 1 and 2 on PST timezone on Linux, and one more time on other timezone
> e.g. CST on Windows.
> Compare two target orc file dumping. Windows orc file is 1 byte shorter than
> Linux one.
> That's the case even if running 1 and 2 on Windows for different timezones,
> however, no problem on Linux.
> The issue only exists by using ZLIB mode, eventually OS native compression
> lib is used.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)