[
https://issues.apache.org/jira/browse/HADOOP-5083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12668746#action_12668746
]
Amar Kamat commented on HADOOP-5083:
------------------------------------
bq. 1. If a job is completed and retired, and then the JT as well as the
History Server restarts. Can a user get to the logs of a job that was completed
earlier?
As of now the History server simply provides a web interface for the job
history files on the history-fs. It simply reads the history file, parses it
and allows users to analyze it. JobTracker restart will make sure that
- the jobs that were marked completed will remain untouched
- the jobs that were running/pending will be completed. This also includes
maintaining the history files and making sure that in the end there is only one
history files for a completed job
bq. Does the History Server keep some sort of an persistent index into the
completed/failed jobs?
Nope. It doesnt require to keep any. All the files are maintained in a
job-history folder.
> Optionally a separate daemon should serve JobHistory
> ----------------------------------------------------
>
> Key: HADOOP-5083
> URL: https://issues.apache.org/jira/browse/HADOOP-5083
> Project: Hadoop Core
> Issue Type: Improvement
> Components: mapred
> Reporter: Arun C Murthy
> Assignee: Amar Kamat
> Attachments: HADOOP-5083-v1.2.patch, HADOOP-5083-v1.9.patch
>
>
> Currently the JobTracker serves the JobHistory to end-users off files
> local-disk/hdfs. While running very large clusters with a large user-base
> might result in lots of traffic for job-history which needlessly taxes the
> JobTracker. The proposal is to have an optional daemon which handles serving
> of job-history requests.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.