[ https://issues.apache.org/jira/browse/HIVE-7155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14486727#comment-14486727 ]
Lefty Leverenz commented on HIVE-7155: -------------------------------------- Doc note: *templeton.mapper.memory.mb* is documented in the WebHCat Configuration wiki, at the end of the table of configuration variables. (Better late than never.) I took the liberty of changing "Templeton" to "WebHCat" in the description -- should have thought of that before the commit. * [WebHCat Configuration -- Configuration Variables | https://cwiki.apache.org/confluence/display/Hive/WebHCat+Configure#WebHCatConfigure-ConfigurationVariables] > WebHCat controller job exceeds container memory limit > ----------------------------------------------------- > > Key: HIVE-7155 > URL: https://issues.apache.org/jira/browse/HIVE-7155 > Project: Hive > Issue Type: Bug > Components: WebHCat > Affects Versions: 0.13.0 > Reporter: shanyu zhao > Assignee: shanyu zhao > Labels: TODOC14 > Fix For: 0.14.0 > > Attachments: HIVE-7155.1.patch, HIVE-7155.2.patch, HIVE-7155.patch > > > Submit a Hive query on a large table via WebHCat results in failure because > the WebHCat controller job is killed by Yarn since it exceeds the memory > limit (set by mapreduce.map.memory.mb, defaults to 1GB): > {code} > INSERT OVERWRITE TABLE Temp_InjusticeEvents_2014_03_01_00_00 SELECT * from > Stage_InjusticeEvents where LogTimestamp > '2014-03-01 00:00:00' and > LogTimestamp <= '2014-03-01 01:00:00'; > {code} > We could increase mapreduce.map.memory.mb to solve this problem, but this way > we are changing this setting system wise. > We need to provide a WebHCat configuration to overwrite > mapreduce.map.memory.mb when submitting the controller job. -- This message was sent by Atlassian JIRA (v6.3.4#6332)