Hi,

We are interested in querying performance of Phoenix on small to large 
datasets. We have Apache Phoenix installed on an EMR with 5 instances.


The querying times we get are slow compared to the equivalent query in hbase, 
for example:


Table Documents with primary key UUID and index on profile_id


        Apache Phoenix  Hbase
400k rows dataset: select * from documents where uuid = 10-a
        0.25 sec        0.02 sec
400k rows dataset: select profile_id from documents where uuid = 10-a
        0.20 sec        0.02 sec


Hbase seems 10x faster than Phoenix, is there some tuning we can do to achieve 
better results?
We are querying the DB programatically (scala) & also using the client sqlline.

Thank you!

Reply via email to