Tanuj Khurana created PHOENIX-7319:
--------------------------------------
Summary: Leverage Bloom Filters to improve performance on write
path
Key: PHOENIX-7319
URL: https://issues.apache.org/jira/browse/PHOENIX-7319
Project: Phoenix
Issue Type: Improvement
Affects Versions: 5.2.0
Reporter: Tanuj Khurana
Assignee: Tanuj Khurana
Fix For: 5.2.1, 5.3.0
On the write path if the write is an atomic upsert or if the table has one or
more indexes Phoenix first does a read. All these reads on the data table are
point lookups. Bloom Filters can help optimize the performance of these
lookups.
* For new rows (inserts), the point lookup will not return any result. This
negative lookup is ideal for bloom filters as our read can return by just
checking the bloom filter block.
* For updates, since new updates get accumulated into memstore and then
flushed into new store files. A region can have multiple store files and when
doing a read we have to read multiple store files. Bloom filter can help
eliminate which store files should be read.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)