[ 
https://issues.apache.org/jira/browse/HIVE-131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joydeep Sen Sarma updated HIVE-131:
-----------------------------------

    Attachment: hive-131.patch.2

Dhruba said:

> 1. I see that execute returns values 1, 2, and 3. It will be good to document 
> what these values mean.
> 2. Staring hadoop 0.19, it might make sense to set FileSystem.deleteOnExit() 
> for files that are temporary.
> 3. It is interesting to note that now there is an extra step jobClose() that 
> gets triggered on the client-side after the job is complete. Prior to this 
> patch, a job would be successful even if the client-side has disappeared 
> before the job is completed. This patch requires that the client remains 
> active and healthy till the entire job is complete. This probably is ok for 
> Hive, especially because Hive anyway requires job-chaining and I do not see 
> any other way to do it

- incorporated  suggestion to use deleteOnExit where available.
- return codes are always accompanied by a corresponding message on the 
console/log. So don't see much point creating additional documentation around 
them.
- hive has always depended on client side code-patch for query completion.

> insert overwrite directory leaves behind uncommitted/tmp files from failed 
> tasks
> --------------------------------------------------------------------------------
>
>                 Key: HIVE-131
>                 URL: https://issues.apache.org/jira/browse/HIVE-131
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Joydeep Sen Sarma
>            Assignee: Joydeep Sen Sarma
>            Priority: Critical
>         Attachments: HIVE-131.patch.1, hive-131.patch.2
>
>
> _tmp files are getting left behind on insert overwrite directory:
> /user/jssarma/ctst1/40422_m_000195_0.deflate  <r 3> 13285 2008-12-07 01:47  
> rw-r--r-- jssarma supergroup
> /user/jssarma/ctst1/40422_m_000196_0.deflate  <r 3> 3055  2008-12-07 01:46  
> rw-r--r-- jssarma supergroup
> /user/jssarma/ctst1/_tmp.40422_m_000033_0 <r 3> 0 2008-12-07 01:53  rw-r--r-- 
> jssarma supergroup
> /user/jssarma/ctst1/_tmp.40422_m_000037_1 <r 3> 0 2008-12-07 01:53  rw-r--r-- 
> jssarma supergroup
> this happened with speculative execution. the code looks good (in fact in 
> this case many speculative tasks were launched - and only a couple caused 
> problems). Almost seems like these files did not appear in the namespace 
> until after the map-reduce job finished and the movetask did a listing of the 
> output dir ..

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to