Hello

Is there any information (post/wiki page) on how data locality works in hbase. From the documentation in the site I was able to spot following paragraph in the "Old Road Maps" section at url http://wiki.apache.org/hadoop/HBase/RoadMaps

Data-Locality Awareness The Hadoop map reduce framework does -------------------- in network I/O.

I am looking for answer of the following:

1) when using hbase, does the jobs end up where the data is stored (I will guess so); if yes then how is it done (links to related packages/ pointers).

2) Is it possible to find out where the data resides (the way one could do when using hadoop file system directly).


Also with respect to the performance charts included in Raghu's keynote at LADIS 2009 (slide 84-87)

http://www.cs.cornell.edu/projects/ladis2009/talks/ramakrishnan-keynote-ladis2009.pdf

Do we have numbers for latest releases (or these numbers are still valid for newer releases as well). The latency numbers look pretty bad for hbase.

thanks a lot




Reply via email to