[ https://issues.apache.org/jira/browse/HIVE-131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Joydeep Sen Sarma updated HIVE-131: ----------------------------------- Attachment: hive-131.patch.2 Dhruba said: > 1. I see that execute returns values 1, 2, and 3. It will be good to document > what these values mean. > 2. Staring hadoop 0.19, it might make sense to set FileSystem.deleteOnExit() > for files that are temporary. > 3. It is interesting to note that now there is an extra step jobClose() that > gets triggered on the client-side after the job is complete. Prior to this > patch, a job would be successful even if the client-side has disappeared > before the job is completed. This patch requires that the client remains > active and healthy till the entire job is complete. This probably is ok for > Hive, especially because Hive anyway requires job-chaining and I do not see > any other way to do it - incorporated suggestion to use deleteOnExit where available. - return codes are always accompanied by a corresponding message on the console/log. So don't see much point creating additional documentation around them. - hive has always depended on client side code-patch for query completion. > insert overwrite directory leaves behind uncommitted/tmp files from failed > tasks > -------------------------------------------------------------------------------- > > Key: HIVE-131 > URL: https://issues.apache.org/jira/browse/HIVE-131 > Project: Hadoop Hive > Issue Type: Bug > Components: Query Processor > Reporter: Joydeep Sen Sarma > Assignee: Joydeep Sen Sarma > Priority: Critical > Attachments: HIVE-131.patch.1, hive-131.patch.2 > > > _tmp files are getting left behind on insert overwrite directory: > /user/jssarma/ctst1/40422_m_000195_0.deflate <r 3> 13285 2008-12-07 01:47 > rw-r--r-- jssarma supergroup > /user/jssarma/ctst1/40422_m_000196_0.deflate <r 3> 3055 2008-12-07 01:46 > rw-r--r-- jssarma supergroup > /user/jssarma/ctst1/_tmp.40422_m_000033_0 <r 3> 0 2008-12-07 01:53 rw-r--r-- > jssarma supergroup > /user/jssarma/ctst1/_tmp.40422_m_000037_1 <r 3> 0 2008-12-07 01:53 rw-r--r-- > jssarma supergroup > this happened with speculative execution. the code looks good (in fact in > this case many speculative tasks were launched - and only a couple caused > problems). Almost seems like these files did not appear in the namespace > until after the map-reduce job finished and the movetask did a listing of the > output dir .. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.