[ https://issues.apache.org/jira/browse/HIVE-4421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Phabricator updated HIVE-4421: ------------------------------ Attachment: HIVE-4421.D10545.2.patch omalley updated the revision "HIVE-4421 [jira] Improve memory usage by ORC dictionaries". Changed the memory manager to check on each 5000 total rows added. This seems to give the best trade off between handling too many writers in a small heap and still managing memory pretty accurately. Reviewers: JIRA REVISION DETAIL https://reviews.facebook.net/D10545 CHANGE SINCE LAST DIFF https://reviews.facebook.net/D10545?vs=32889&id=33201#toc AFFECTED FILES ql/src/java/org/apache/hadoop/hive/ql/io/orc/DynamicByteArray.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/DynamicIntArray.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/MemoryManager.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/OutStream.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/PositionedOutputStream.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/RedBlackTree.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/StringRedBlackTree.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/WriterImpl.java ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestMemoryManager.java ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestStringRedBlackTree.java ql/src/test/resources/orc-file-dump.out To: JIRA, omalley > Improve memory usage by ORC dictionaries > ---------------------------------------- > > Key: HIVE-4421 > URL: https://issues.apache.org/jira/browse/HIVE-4421 > Project: Hive > Issue Type: Bug > Components: Serializers/Deserializers > Reporter: Owen O'Malley > Assignee: Owen O'Malley > Fix For: 0.11.0 > > Attachments: HIVE-4421.D10545.1.patch, HIVE-4421.D10545.2.patch > > > Currently, for tables with many string columns, it is possible to > significantly underestimate the memory used by the ORC dictionaries and cause > the query to run out of memory in the task. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira