Hello
Is there any information (post/wiki page) on how data locality works
in hbase. From the documentation in the site I was able to spot
following paragraph in the "Old Road Maps" section at url http://wiki.apache.org/hadoop/HBase/RoadMaps
Data-Locality Awareness The Hadoop map reduce framework does
-------------------- in network I/O.
I am looking for answer of the following:
1) when using hbase, does the jobs end up where the data is stored (I
will guess so); if yes then how is it done (links to related packages/
pointers).
2) Is it possible to find out where the data resides (the way one
could do when using hadoop file system directly).
Also with respect to the performance charts included in Raghu's
keynote at LADIS 2009 (slide 84-87)
http://www.cs.cornell.edu/projects/ladis2009/talks/ramakrishnan-keynote-ladis2009.pdf
Do we have numbers for latest releases (or these numbers are still
valid for newer releases as well). The latency numbers look pretty bad
for hbase.
thanks a lot