Hello, > 1) What is the status of the project, specifically the graph processing part > (Angrapa?). Is it sufficiently stable to be used? Although this is an > academic research project, it would be better to work on a stable one.
At present, we're focussing on a framework for more general-purpose BSP computing, so yet far from the graph processing framework such as Google Pregel. We have a release plan for 0.2.0 version and we're working on it.The release 0.2.0 will include: * BSP computing framework (no fault tolerant mechanism, no data input-output API) * and its examples > 2) I haven't come across any installation/building steps for Hama. How to > integrate with HDFS/HBase? We'll create a input-output system that can be used to process data. You can think it as a M/R computing framework on HDFS/HBase. > 3) Are there more extensive performance tests say w.r.t the latest branch of > development? Do they have better performance? Not yet. > 4) Can the data assigned to each partition (cluster) be split according to > some condition i.e. can it be controlled unlike a MR split? Do you mean, whether it can assign a task to slaves according to other condition (not based on local)? Then, no. The all splits should be loaded and computed locally. Otherwise, it will cause meaningless huge data-copy overhead among servers. Thanks :) On Fri, Sep 10, 2010 at 7:09 AM, Raghava Mutharaju <[email protected]> wrote: > Hi all, > > I am working on a research project where I faced the issues that formed the > motivation for Hama (Hamburg) -- the splits in the data depend on each other > and data locality issue in case of multiple MR iterations. I was thinking of > checking other alternatives to MR when I came across Hama. I am in the > process of checking whether Hama would fit our project needs and I need your > help in that regard. > > I am interested in the graph processing part of Hama. > > I have the following questions > > 1) What is the status of the project, specifically the graph processing part > (Angrapa?). Is it sufficiently stable to be used? Although this is an > academic research project, it would be better to work on a stable one. > 2) I haven't come across any installation/building steps for Hama. How to > integrate with HDFS/HBase? > 3) Are there more extensive performance tests say w.r.t the latest branch of > development? Do they have better performance? > 4) Can the data assigned to each partition (cluster) be split according to > some condition i.e. can it be controlled unlike a MR split? > > Thank you. > > Regards, > Raghava. > -- Best Regards, Edward J. Yoon [email protected] http://blog.udanax.org
