[ 
https://issues.apache.org/jira/browse/HIVE-8584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14182333#comment-14182333
 ] 

Lefty Leverenz commented on HIVE-8584:
--------------------------------------

The wiki has a few places this could be mentioned:

* [ORC -- Compression | 
https://cwiki.apache.org/confluence/display/Hive/LanguageManual+ORC#LanguageManualORC-Compression]
* orc.compress table property in [ORC -- HiveQLSyntax | 
https://cwiki.apache.org/confluence/display/Hive/LanguageManual+ORC#LanguageManualORC-HiveQLSyntax]
* [hive.exec.orc.default.compress in Configuration Properties | 
https://cwiki.apache.org/confluence/display/Hive/Configuration+Properties#ConfigurationProperties-hive.exec.orc.default.compress]
* no ORC discussion, just Gzip & Bzip2 for TextFile (doc needs updating):  
[Compressed Data Storage | 
https://cwiki.apache.org/confluence/display/Hive/CompressedStorage]

> Setting hive.exec.orc.default.compress to ZLIB will lead to orc file size 
> delta byte(s) shorter on Windows than Linux
> ---------------------------------------------------------------------------------------------------------------------
>
>                 Key: HIVE-8584
>                 URL: https://issues.apache.org/jira/browse/HIVE-8584
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 0.14.0
>         Environment: Windows
>            Reporter: Xiaobing Zhou
>            Assignee: Xiaobing Zhou
>            Priority: Critical
>         Attachments: HIVE-8584.1.patch, orc-win-none-1.dump, 
> orc-win-none-2.dump, orc-win-snappy-1.dump, orc-win-snappy-2.dump, 
> orc-win-zlib-1.dump, orc-win-zlib-2.dump, orc_analyze.q
>
>
> repo steps:
> 1. run query orc_analyze.q
> 2. hive --orcfiledump <target_orc_file_generated>
> run 1 and 2 on PST timezone on Linux, and one more time on other timezone 
> e.g. CST on Windows.
> Compare two target orc file dumping. Windows orc file is 1 byte shorter than 
> Linux one.
> That's the case even if running 1 and 2 on Windows for different timezones, 
> however, no problem on Linux.
> The issue only exists by using ZLIB mode, eventually OS native compression 
> lib is used.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to