want InputFormat for task logs
------------------------------
Key: HADOOP-1199
URL: https://issues.apache.org/jira/browse/HADOOP-1199
Project: Hadoop
Issue Type: New Feature
Components: mapred
Reporter: Doug Cutting
We should provide an InputFormat implementation that includes all the task logs
from a job. Folks should be able to do something like:
job = new JobConf();
job.setInputFormatClass(TaskLogInputFormat.class);
TaskLogInputFormat.setJobId(jobId);
...
Tasks should ideally be localized to the node that each log is on.
Examining logs should be as lightweight as possible, to facilitate debugging.
It should not require a copy to HDFS. A faster debug loop is like a faster
search engine: it makes people more productive. The sooner one can find that,
e.g., most tasks failed with a NullPointerException on line 723, the better.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.