Hello everyone in the mailing list. I want to introduce an open-source library Haeinsa which supports multi-row, multi-table transaction on HBase. Haeinsa is client-only library that support serializability and linear scalability. Here is the github repository and presentation that describes mechanism of it.
github : https://github.com/VCNC/haeinsa presentation : https://speakerdeck.com/vcnc/haeinsa-overview It was inspired by Google's percolator, but implementation detail is different. It use two-phase commit protocol and optimistic concurrency control to implement, and Haeinsa now processes more than 300M+ transactions per day in single cluster without any consistency problem for more than 2 month. I know that there has been lots of libraries and papers for HBase transaction, but within my knowledge, this is the only open-source library which support serializability and linear scalability. There is no theoretical limit of transaction throughput. It was tested against cluster on AWS until 40,000 transaction/sec. (Still testing on bigger cluster) If you find it interesting, please leave me comment. Thanks, Andrew Kim
