Duo Zhang created HBASE-20525:
---------------------------------

             Summary: Refactoring the code of read path
                 Key: HBASE-20525
                 URL: https://issues.apache.org/jira/browse/HBASE-20525
             Project: HBase
          Issue Type: Umbrella
          Components: scan
            Reporter: Duo Zhang
             Fix For: 3.0.0


The known problems of the current implementation:

1. 'Seek or skip' should be decided at StoreFileScanner level, not StoreScanner.
2. As now we support creating multiple StoreFileReader instances for a single 
HFile, we do not need to load the file info and other meta infos every time 
when creating a new StoreFileReader instance.
3. 'Pread or stream' should be decided at StoreFileScanner level, not 
StoreScanner.
4. Make sure that we can return at any point during a scan, at least when 
filterRowKey we can not stop until we reach the next row, no matter how many 
cells we need to skip...
5. Doing bytes comparing everywhere, where we need to know if there is a row 
change, a family change, a qualifier change, etc. This is a performance killer.

And the most important thing is that, the code is way too complicated now and 
become out of control...

This should be done before our 3.0.0 release.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to