[
https://issues.apache.org/jira/browse/HIVE-8720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14196623#comment-14196623
]
Prasanth J commented on HIVE-8720:
----------------------------------
[~hagleitn] Can we have this for 0.14? These are just test file diffs to make
the qfile results consistent across platforms.
> Update orc_merge tests to make it consistent across OS'es
> ---------------------------------------------------------
>
> Key: HIVE-8720
> URL: https://issues.apache.org/jira/browse/HIVE-8720
> Project: Hive
> Issue Type: Bug
> Affects Versions: 0.14.0
> Reporter: Prasanth J
> Assignee: Prasanth J
> Attachments: HIVE-8720.1.patch, orc_merge5_filedump_macosx.txt,
> orc_merge5_filedump_opensuse.txt
>
>
> orc_merge*.q test cases fails with qfile diffs related to file size on
> different OSes. I have seen failures with Open SUSE and CentOS. The order of
> insertion of rows into ORC table impacts the file size because of run length
> encoding. Since the order of rows is not guaranteed during insertion into
> table we may get different file sizes. We cannot add ORDER BY to insert
> queries as it will force insertion through single reducer which will disable
> orc merge file optimization. Since these test cases test if the files are
> merged or not it is sufficient to know the number of files after merging.
> Instead of DESCRIBE FORMATTED (which shows the numFiles and fileSize) we can
> use "dfs -ls" to know the number of files.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)