[jira] Commented: (HADOOP-2178) Job history on HDFS

eric baldeschwieler (JIRA) Wed, 02 Jan 2008 12:57:54 -0800

    [ 
https://issues.apache.org/jira/browse/HADOOP-2178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12555417#action_12555417
 ]


eric baldeschwieler commented on HADOOP-2178:
---------------------------------------------

Why copy files once a job is complete?  Why not just always write them directly 
to the HDFS (or any other URL the user configures)?

I'm not a real fan of hidden directories like these.  The user will not know of 
them and potentially will fill a lot of disk/name space with never viewed 
material.  I'd be much happier if job history were considered part of the 
output of a job, unless configured otherwise.  IE put it in the map-reduce 
output directory in a file or directories prefixed with an underscore.  So 
<output>/_jobHistory or perhaps <output>/_logs/history.

We added the convention that map-reduce ignores underscore prefixed files 
specifically to allow this use case...

This also reduces jobid/name confusion, since the history is directly 
associated with the job's output.

We could then provide an option to put it in another location if the user 
desires.

thoughts?



> Job history on HDFS
> -------------------
>
>                 Key: HADOOP-2178
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2178
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Amareshwari Sri Ramadasu
>            Assignee: Amareshwari Sri Ramadasu
>             Fix For: 0.16.0
>
>
> This issue addresses the following items :
> 1.  Check for accuracy of job tracker history logs.
> 2.  After completion of the job, copy the JobHistory.log(Master index file) 
> and the job history files to the DFS.
> 3. User can load the history with commands
> bin/hadoop job -history <directory> 
> or
> bin/hadoop job -history <jobid>
> This will start a stand-alone jetty and load jsps

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HADOOP-2178) Job history on HDFS

Reply via email to