Performance Question

2016-05-27 Thread Benjamin Kim
I am just curious. How will Kudu compare with Aerospike (http://www.aerospike.com)? I went to a Spark Roadshow and found out about this piece of software. It appears to fit our use case perfectly since we are an ad-tech company trying to leverage our user profiles data. Plus, it already has a S

Re: Performance Question

2016-05-27 Thread Mike Percy
Have you considered whether you have a scan heavy or a random access heavy workload? Have you considered whether you always access / update a whole row vs only a partial row? Kudu is a column store so has some awesome performance characteristics when you are doing a lot of scanning of just a couple

Re: Performance Question

2016-05-27 Thread Benjamin Kim
Hi Mike, First of all, thanks for the link. It looks like an interesting read. I checked that Aerospike is currently at version 3.8.2.3, and in the article, they are evaluating version 3.5.4. The main thing that impressed me was their claim that they can beat Cassandra and HBase by 8x for writi

Re: Performance Question

2016-05-27 Thread Todd Lipcon
On Fri, May 27, 2016 at 8:20 PM, Benjamin Kim wrote: > Hi Mike, > > First of all, thanks for the link. It looks like an interesting read. I > checked that Aerospike is currently at version 3.8.2.3, and in the article, > they are evaluating version 3.5.4. The main thing that impressed me was > the

Re: Performance Question

2016-05-27 Thread Pablo Vazquez
Hi Todd. In order to reach that performance ( +1M /sec) did you change any Kudu parameters? I read some papers about flush parameters in tablet servers. I started playing with Kudu with just 2 nodes and my best was 30k per second with 30 columns row. Nothing compared with your testing but any adv