[ 
https://issues.apache.org/jira/browse/HADOOP-2178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12556625#action_12556625
 ] 

Devaraj Das commented on HADOOP-2178:
-------------------------------------

+1 on Eric's suggestion. The jobhistory viewer (web server with the job history 
related JSPs) can take the output directory as the input and populate the 
history datastructures. This can be on a per-user basis for now (e.g., 
bin/hadoop jobhistoryview -output <dir> .. ), and, in the future, we could make 
the viewer a centralized web-enabled server that anyone can use. Thoughts?

bq. The output data may be deleted anytime when it is no longer needed. The log 
data may be needed long after the output data is deleted.

As Eric suggested, this can be solved by allowing the user to move the files to 
some persistent location... 



> Job history on HDFS
> -------------------
>
>                 Key: HADOOP-2178
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2178
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Amareshwari Sri Ramadasu
>            Assignee: Amareshwari Sri Ramadasu
>             Fix For: 0.16.0
>
>
> This issue addresses the following items :
> 1.  Check for accuracy of job tracker history logs.
> 2.  After completion of the job, copy the JobHistory.log(Master index file) 
> and the job history files to the DFS.
> 3. User can load the history with commands
> bin/hadoop job -history <directory> 
> or
> bin/hadoop job -history <jobid>
> This will start a stand-alone jetty and load jsps

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to