Recently on IRC, there was a request for a list of projects/areas that
noobie hbase developers could dig-in on. I'm posting one such list here
in case others were wondering the same:
1. InfoServers' UI extension/revamp: Expose more server attributes and
counts, lists of regions by table, graphing of regionserver loading
history, etc
2. Help with mapfile rewrite: Experiment with
algorithms/format/compressions to improve the fundamental hbase building
block
3. Write a PIG loader for hbase and PIG LATIN scripts to do bulk table
upload and backup
4. Improve hbase performance/latency (core dev is currently focused on
robustness and scaling).
5. Build an hbsck tool, a tool to run common repairs to an hbase
install. Develop referential integrity to support this tool (Tool could
be written in java or even in groovy TBD).
6. Write clone of google app engine python datastore module
Ask here (or on IRC) if any questions about the above.
Thanks,
St.Ack