[
https://issues.apache.org/jira/browse/HADOOP-1533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Owen O'Malley updated HADOOP-1533:
----------------------------------
Status: Open (was: Patch Available)
This patch seems confused whether the "log path" is a Path, URI, or a String.
Since you are implementing it as the output of the Map (with reduces = 0), it
pretty clearly must be a Path. It should be coded as such.
Instead of requiring a parameter for the log path, I think it should be an
option like:
... - log hdfs://server:1234/my/log
and I'd propose that the default be: "$output_dir/_distcp_logs"
That will work well in most cases and keep the output/logs together.
The chunk of code that looks like:
+ String logURL = "hdfs://" + fsConfig.get("fs.default.name", "local") +
+ archiveDirectory + "distcpLogDir." +
System.currentTimeMillis();
is problematic, since it won't construct useful urls in a wide variety of
values for fs.default.name.
> Distcp should log to specified location
> ---------------------------------------
>
> Key: HADOOP-1533
> URL: https://issues.apache.org/jira/browse/HADOOP-1533
> Project: Hadoop
> Issue Type: Improvement
> Components: util
> Reporter: Senthil Subramanian
> Fix For: 0.14.0
>
> Attachments: HADOOP-1533_1.patch
>
>
> Distcp errors are now not logged anywhere (other that reporter.setStatus
> which gets overwritten). Add support to mention a log URI where the log files
> should go.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.