On 17/03/11 07:05, Matthew John wrote:
Hi,Can someone provide me some pointers on the following details of Hadoop code base: 1) breakdown of HDFS code base (approximate lines of code) into following modules: - HDFS at the Datanodes - Namenode - Zookeeper - MapReduce based - Any other relevant split 2) breakdown of Hbase code into following modules: - HMaster - RegionServers - MapReduce - Any other relevant split
You are free to check out the source code and do whatever analysis you want. You can also look at the entire SVN history and do some really interesting analysis, especially if you have any data mining tooling to hand, like a small hadoop cluster.
