[ 
https://issues.apache.org/jira/browse/HAMA-750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Edward J. Yoon updated HAMA-750:
--------------------------------

    Status: Open  (was: Patch Available)

Canceling because ..

{code}
OK. I tested it on my 8-core laptop. It seems that the problem with comma 
separated HDFS files in distributed mode has not been fixed yet:

FileInputFormat.setInputPaths(job,"hdfs://localhost:9000/user/fegaras/tests/data/orders.tbl,hdfs://localhost:9000/user/fegaras/tests/data/customer.tbl");


I get the error:
java.net.URISyntaxException: Relative path in absolute URI: localhost:9000

So I can't do joins.
Queries that work on a single input file work fine in distributed mode. Their 
runtime on my laptop is comparable to that of Hama 0.5.0.
{code}
                
> Determine the path of partition files
> -------------------------------------
>
>                 Key: HAMA-750
>                 URL: https://issues.apache.org/jira/browse/HAMA-750
>             Project: Hama
>          Issue Type: Bug
>          Components: bsp core
>            Reporter: Edward J. Yoon
>            Assignee: Edward J. Yoon
>             Fix For: 0.7.0
>
>         Attachments: HAMA-750.patch
>
>
> The parent directory of input file was used to determine the path of base 
> directory for partition files. The problem is when input is multiple files.
> {code}
>   protected BSPJob partition(BSPJob job, int maxTasks) throws IOException {
>     String inputPath = job.getConfiguration().get(Constants.JOB_INPUT_DIR);
>     Path inputDir = new Path(inputPath);
>     if (fs.isFile(inputDir)) {
>       inputDir = inputDir.getParent();
>     }
>     Path partitionDir = new Path(inputDir + "/partitions");
>     if (fs.exists(partitionDir)) {
>       fs.delete(partitionDir, true);
>     }
> {code}
> Simply we can create partitions on temp directory. For example, 
> /tmp/hama-partitions/{$JOB_NAME}/

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to