[
https://issues.apache.org/jira/browse/HADOOP-2116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arun C Murthy updated HADOOP-2116:
----------------------------------
Status: Open (was: Patch Available)
I light of HADOOP-2570, I'm cancelling this patch.
Reasoning:
The *-file* option works by putting the script into the job's jar file by
unjar-ing, copying and then jar-ing it again. (yuck!)
This means that on the TaskTracker the script has moved from jobCache/work to
jobCache/job_jar_xml (I propose we rename that to *private*, heh). Clearly
user-scripts which rely on "../work/<script_name>" will break again...
Having said that we need to debate whether this feature is an
incompatible-change, what do folks think?
If people say otherwise we need to ensure all files in jobCache/private are
smylinked into jobCache/work... ugh!
----
I'd like to take this opportunity to take a hard look at streaming's *-file*
option too. The unjar/jar way is completely backwards! We _should_ rework the
-file option to use the DistributedCache and the symlink option it provides.
So, user-scripts can simply be "./<script>" rather than "../work/<script>".
Yes, the way to maintain compatibility (if we want) is to use the previous
option of symlinking files into jobCache/work also. I'd strongly vote for this
option.
Thoughts?
> Job.local.dir to be exposed to tasks
> ------------------------------------
>
> Key: HADOOP-2116
> URL: https://issues.apache.org/jira/browse/HADOOP-2116
> Project: Hadoop
> Issue Type: Improvement
> Components: mapred
> Affects Versions: 0.14.3
> Environment: All
> Reporter: Milind Bhandarkar
> Assignee: Amareshwari Sri Ramadasu
> Fix For: 0.16.0
>
> Attachments: patch-2116.txt, patch-2116.txt
>
>
> Currently, since all task cwds are created under a jobcache directory, users
> that need a job-specific shared directory for use as scratch space, create
> ../work. This is hacky, and will break when HADOOP-2115 is addressed. For
> such jobs, hadoop mapred should expose job.local.dir via localized
> configuration.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.