[ https://issues.apache.org/jira/browse/HBASE-16421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15763176#comment-15763176 ]
Anoop Sam John commented on HBASE-16421: ---------------------------------------- Again sorry for me not being clear I dont mean that the way Cell pojos will get created be different. What I mean is this In case of read path of reading from off heap cache or from HDFS fetch block data, this creation of Cells are a must. And this was happening from day one. So there is no diff happened wrt that in off heaping read path work. So no possible perf impact because of that But write to memstore and read from there case is diff.. Till now we have Cell objects as such as long as those are in memstore. (Active segment or Immutable segment). So when read happens (this read can be because of user read or scan for in memory compaction / disk flush) no need to create Cell objects newly. They are already in Java heap and just return those. But once we have CellChunkMap, what that flatten will do is remove these Cell objects and keep an index instead (chunk id + offset + length). So when there is a read over the segment, we will end up creating all these Cell POJOs back in heap. There wont be any data copy. Just POJO creation. So u can see the diff it is going to make from what we have today and all these days. Am not saying this will kill perf. This is surely an added pressure wrt more garbage. But surely the CellChunkMap helps with avoiding heap pressure as it eliminate the need to keep Cells (As long no read happens).. So there is +ve and -ve sides. What I say is we just need to make sure the +ve side is more so that overall we get benefit.. I guess we will get. Just to make sure. So before splitting the code into patches and spending time in making it and review and rework on those, a small PoC and testing these with more data and load (the way ur recent tests did) can help getting these perf numbers. U might be having most part of the code CellChunkMap.. Any thing u need from us, we can provide as a PoC patch too. Just to make sure no effort will go waste am saying this. Again am NOT saying CellChunkMap will make perf impact and am NOT against doing it. Just want to make sure we know the perf numbers before doing actual patches. U get me now? :-) > Introducing the CellChunkMap as a new additional index variant in the MemStore > ------------------------------------------------------------------------------ > > Key: HBASE-16421 > URL: https://issues.apache.org/jira/browse/HBASE-16421 > Project: HBase > Issue Type: Umbrella > Reporter: Anastasia Braginsky > Attachments: CellChunkMapRevived.pdf, > IntroductiontoNewFlatandCompactMemStore.pdf > > > Follow up for HBASE-14921. This is going to be the umbrella JIRA to include > all the parts of integration of the CellChunkMap to the MemStore. -- This message was sent by Atlassian JIRA (v6.3.4#6332)