Ted Yu created HBASE-20196: ------------------------------ Summary: Maintain all regions with same size in memstore flusher Key: HBASE-20196 URL: https://issues.apache.org/jira/browse/HBASE-20196 Project: HBase Issue Type: Improvement Reporter: Ted Yu Assignee: Ted Yu
Here is the javadoc for getCopyOfOnlineRegionsSortedByOffHeapSize() : {code} * the biggest. If two regions are the same size, then the last one found wins; i.e. this * method may NOT return all regions. {code} Currently value type is HRegion - we only store one region per size. I think we should change value type to Collection<HRegion> so that we don't miss any region (potentially with big size). e.g. Suppose there are there regions (R1, R2 and R3) with sizes 100, 100 and 1, respectively. Using the current data structure, R2 would be stored in the Map, evicting R1 from the Map. This means that the current code would choose to flush regions R2 and R3, releasing 101 from memory. If value type is changed to Collection<HRegion>, we would flush both R1 and R2. This achieves faster memory reclamation. Confirmed with [~eshcar] over in HBASE-20090 -- This message was sent by Atlassian JIRA (v7.6.3#76005)