ClusterDumper writes to System.out or local filesystem only (I would like to write to s3 when running on Elastic MapReduce) ---------------------------------------------------------------------------------------------------------------------------
Key: MAHOUT-701 URL: https://issues.apache.org/jira/browse/MAHOUT-701 Project: Mahout Issue Type: Bug Components: Utils Environment: Amazon Elastic MapReduce Reporter: Dave Lewis Priority: Minor At the end of a kmeans job at EMR, I like to look through the clusters. Unfortunately ClusterDumper writes to System.out or a local file. I added a small conditional to examine the filename to see if it starts with s3n://, and if so to open up a FileSystem to write to s3 instead of to the local filesystem so that the output file is available after the cluster is shut down. I am creating the patch now (tests are still running from the change in MAHOUT-700), so I will add the patch to this issue shortly. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira