Re: evaluating Hama

Edward J. Yoon Thu, 09 Sep 2010 19:48:16 -0700

Hello,

> 1) What is the status of the project, specifically the graph processing part
> (Angrapa?). Is it sufficiently stable to be used? Although this is an
> academic research project, it would be better to work on a stable one.


At present, we're focussing on a framework for more general-purpose
BSP computing, so yet far from the graph processing framework such as
Google Pregel.

We have a release plan for 0.2.0 version and we're working on it.The
release 0.2.0 will include:

 * BSP computing framework (no fault tolerant mechanism, no data
input-output API)
 * and its examples

> 2) I haven't come across any installation/building steps for Hama. How to
> integrate with HDFS/HBase?

We'll create a input-output system that can be used to process data.
You can think it as a M/R computing framework on HDFS/HBase.

> 3) Are there more extensive performance tests say w.r.t the latest branch of
> development? Do they have better performance?

Not yet.

> 4) Can the data assigned to each partition (cluster) be split according to
> some condition i.e. can it be controlled unlike a MR split?

Do you mean, whether it can assign a task to slaves according to other
condition (not based on local)? Then, no.

The all splits should be loaded and computed locally. Otherwise, it
will cause meaningless huge data-copy overhead among servers.

Thanks :)

On Fri, Sep 10, 2010 at 7:09 AM, Raghava Mutharaju
<[email protected]> wrote:
> Hi all,
>
> I am working on a research project where I faced the issues that formed the
> motivation for Hama (Hamburg) -- the splits in the data depend on each other
> and data locality issue in case of multiple MR iterations. I was thinking of
> checking other alternatives to MR when I came across Hama. I am in the
> process of checking whether Hama would fit our project needs and I need your
> help in that regard.
>
> I am interested in the graph processing part of Hama.
>
> I have the following questions
>
> 1) What is the status of the project, specifically the graph processing part
> (Angrapa?). Is it sufficiently stable to be used? Although this is an
> academic research project, it would be better to work on a stable one.
> 2) I haven't come across any installation/building steps for Hama. How to
> integrate with HDFS/HBase?
> 3) Are there more extensive performance tests say w.r.t the latest branch of
> development? Do they have better performance?
> 4) Can the data assigned to each partition (cluster) be split according to
> some condition i.e. can it be controlled unlike a MR split?
>
> Thank you.
>
> Regards,
> Raghava.
>



-- 
Best Regards, Edward J. Yoon
[email protected]
http://blog.udanax.org

Re: evaluating Hama

Reply via email to