[jira] Commented: (HADOOP-2116) Job.local.dir to be exposed to tasks

Milind Bhandarkar (JIRA) Thu, 27 Dec 2007 11:40:15 -0800

    [ 
https://issues.apache.org/jira/browse/HADOOP-2116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12554603
 ]


Milind Bhandarkar commented on HADOOP-2116:
-------------------------------------------

I would prefer separating the two. I.e. where job.jar goes, versus where the 
job.local.dir goes. Especially for streaming, where side-effect tasks are 
common, the mapper and reducer commands would need to have a clean directory 
(empty) where they can cache job-specific data (dictionaries downloaded off the 
network etc, that cannot be packaged as distributed archives). If job.jar also 
lives there, it might someday clash with the files downloaded, and cause issues.

So, mapred.jar, jobCacheDir, and job.local.dir all need to be different 
locations.

Is jobCacheDir available via a config variable ?

> Job.local.dir to be exposed to tasks
> ------------------------------------
>
>                 Key: HADOOP-2116
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2116
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>    Affects Versions: 0.14.3
>         Environment: All
>            Reporter: Milind Bhandarkar
>             Fix For: 0.16.0
>
>
> Currently, since all task cwds are created under a jobcache directory, users 
> that need a job-specific shared directory for use as scratch space, create 
> ../work. This is hacky, and will break when HADOOP-2115 is addressed. For 
> such jobs, hadoop mapred should expose job.local.dir via localized 
> configuration.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HADOOP-2116) Job.local.dir to be exposed to tasks

Reply via email to