[
https://issues.apache.org/jira/browse/HAMA-750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Edward J. Yoon updated HAMA-750:
--------------------------------
Status: Open (was: Patch Available)
Canceling because ..
{code}
OK. I tested it on my 8-core laptop. It seems that the problem with comma
separated HDFS files in distributed mode has not been fixed yet:
FileInputFormat.setInputPaths(job,"hdfs://localhost:9000/user/fegaras/tests/data/orders.tbl,hdfs://localhost:9000/user/fegaras/tests/data/customer.tbl");
I get the error:
java.net.URISyntaxException: Relative path in absolute URI: localhost:9000
So I can't do joins.
Queries that work on a single input file work fine in distributed mode. Their
runtime on my laptop is comparable to that of Hama 0.5.0.
{code}
> Determine the path of partition files
> -------------------------------------
>
> Key: HAMA-750
> URL: https://issues.apache.org/jira/browse/HAMA-750
> Project: Hama
> Issue Type: Bug
> Components: bsp core
> Reporter: Edward J. Yoon
> Assignee: Edward J. Yoon
> Fix For: 0.7.0
>
> Attachments: HAMA-750.patch
>
>
> The parent directory of input file was used to determine the path of base
> directory for partition files. The problem is when input is multiple files.
> {code}
> protected BSPJob partition(BSPJob job, int maxTasks) throws IOException {
> String inputPath = job.getConfiguration().get(Constants.JOB_INPUT_DIR);
> Path inputDir = new Path(inputPath);
> if (fs.isFile(inputDir)) {
> inputDir = inputDir.getParent();
> }
> Path partitionDir = new Path(inputDir + "/partitions");
> if (fs.exists(partitionDir)) {
> fs.delete(partitionDir, true);
> }
> {code}
> Simply we can create partitions on temp directory. For example,
> /tmp/hama-partitions/{$JOB_NAME}/
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira