Prasanth J created HIVE-8720:
--------------------------------
Summary: Update orc_merge tests to make it consistent across OSes
Key: HIVE-8720
URL: https://issues.apache.org/jira/browse/HIVE-8720
Project: Hive
Issue Type: Bug
Affects Versions: 0.14.0
Reporter: Prasanth J
Assignee: Prasanth J
orc_merge*.q test cases fails with qfile diffs related to file size on
different OSes. I have seen failures with Open SUSE and CentOS. The order of
insertion of rows into ORC table impacts the file size because of run length
encoding. Since the order of rows is not guaranteed during insertion into table
we may get different file sizes. We cannot add ORDER BY to insert queries as it
will force insertion through single reducer which will disable orc merge file
optimization. Since these test cases test if the files are merged or not it is
sufficient to know the number of files after merging. Instead of DESCRIBE
FORMATTED (which shows the numFiles and fileSize) we can use "dfs -ls" to know
the number of files.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)