Architectural overview of HBase internals with description of conceptual gulf
between HBase and HDFS
----------------------------------------------------------------------------------------------------
Key: HBASE-1991
URL: https://issues.apache.org/jira/browse/HBASE-1991
Project: Hadoop HBase
Issue Type: Improvement
Components: documentation
Affects Versions: 0.20.0
Reporter: Doug Meil
One of the conceptual gulfs that needs addressing in HBase documentation is
that if people are looking at the Hadoop website, they will read about HDFS
that it is for (paraphrasing) "high throughput but does not promise low latency
and is not suited for random reads."
HBase runs on top of HDFS, and it promises both low-latency *and* random reads.
How?
I'm not disputing that HBase does it... but not much is written down anywhere
other than references to "caching."
Lars George put together a great page on some of the HBase file structures as
they are stored in HDFS. Information like that would be useful to have in the
HBase documentation, etc.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.