[ https://issues.apache.org/jira/browse/HIVE-5546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796999#comment-13796999 ]
Hudson commented on HIVE-5546: ------------------------------ FAILURE: Integrated in Hive-trunk-hadoop1-ptest #206 (See [https://builds.apache.org/job/Hive-trunk-hadoop1-ptest/206/]) HIVE-5546 : A change in ORCInputFormat made by HIVE4113 was reverted by HIVE5391 (Yin Huai via Ashutosh Chauhan) (hashutosh: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1532800) * /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcInputFormat.java > A change in ORCInputFormat made by HIVE-4113 was reverted by HIVE-5391 > ---------------------------------------------------------------------- > > Key: HIVE-5546 > URL: https://issues.apache.org/jira/browse/HIVE-5546 > Project: Hive > Issue Type: Bug > Affects Versions: 0.13.0 > Reporter: Yin Huai > Assignee: Yin Huai > Fix For: 0.13.0 > > Attachments: HIVE-5546.1.patch, HIVE-5546.2.patch > > > {code} > 2013-10-15 10:49:49,386 INFO org.apache.hadoop.hive.ql.io.orc.OrcInputFormat: > included column ids = > 2013-10-15 10:49:49,386 INFO org.apache.hadoop.hive.ql.io.orc.OrcInputFormat: > included columns names = > 2013-10-15 10:49:49,386 INFO org.apache.hadoop.hive.ql.io.orc.OrcInputFormat: > No ORC pushdown predicate > 2013-10-15 10:49:49,834 INFO > org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader: Processing file > hdfs://localhost:54310/user/hive/warehouse/web_sales_orc/000000_0 > 2013-10-15 10:49:49,834 INFO org.apache.hadoop.mapred.MapTask: > numReduceTasks: 1 > 2013-10-15 10:49:49,840 INFO org.apache.hadoop.mapred.MapTask: io.sort.mb = > 100 > 2013-10-15 10:49:49,968 INFO org.apache.hadoop.mapred.TaskLogsTruncater: > Initializing logs' truncater with mapRetainSize=-1 and reduceRetainSize=-1 > 2013-10-15 10:49:49,994 INFO org.apache.hadoop.io.nativeio.NativeIO: > Initialized cache for UID to User mapping with a cache timeout of 14400 > seconds. > 2013-10-15 10:49:49,994 INFO org.apache.hadoop.io.nativeio.NativeIO: Got > UserName yhuai for UID 1000 from the native implementation > 2013-10-15 10:49:49,996 FATAL org.apache.hadoop.mapred.Child: Error running > child : java.lang.OutOfMemoryError: Java heap space > at > org.apache.hadoop.mapred.MapTask$MapOutputBuffer.<init>(MapTask.java:949) > at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:428) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:372) > at org.apache.hadoop.mapred.Child$4.run(Child.java:255) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:415) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1136) > at org.apache.hadoop.mapred.Child.main(Child.java:249) > {code} > If includedColumnIds is an empty list, we do not need to read any column. > But, right now, in OrcInputFormat.findIncludedColumns, we have ... > {code} > if (ColumnProjectionUtils.isReadAllColumns(conf) || > includedStr == null || includedStr.trim().length() == 0) { > return null; > } > {code} > If includedStr is an empty string, the code assumes that we need all columns, > which is not correct. -- This message was sent by Atlassian JIRA (v6.1#6144)