[ https://issues.apache.org/jira/browse/HBASE-15205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159305#comment-15159305 ]
ramkrishna.s.vasudevan commented on HBASE-15205: ------------------------------------------------ bq.What is 'bulk load replication'.... Can you say more on the scenario you describe above on bulk loading and replication? We have a feature for bulk load replication. In case of where there is no WAL based 'put' and only bulk loaded files we read the WAL for the WALEdit markers that denotes bulk load and reading those edits we do the replicaiton of the bulk loaded files. If the bulk load replication feature is not enabled then there is no need for the Logkey to be set with replication scopes. That is what I meant there. > Do not find the replication scope for every WAL#append() > -------------------------------------------------------- > > Key: HBASE-15205 > URL: https://issues.apache.org/jira/browse/HBASE-15205 > Project: HBase > Issue Type: Sub-task > Components: regionserver > Reporter: ramkrishna.s.vasudevan > Assignee: ramkrishna.s.vasudevan > Priority: Minor > Fix For: 2.0.0 > > Attachments: HBASE-15204_6.patch, HBASE-15205.patch, > HBASE-15205_1.patch, HBASE-15205_2.patch, HBASE-15205_3.patch, > HBASE-15205_4.patch, HBASE-15205_6.patch, HBASE-15205_6.patch, > HBASE-15205_7.patch, ScopeWALEdits.jpg, ScopeWALEdits_afterpatch.jpg > > > After the byte[] and char[] the other top contributor for lot of GC (though > it is only 2.86%) is the UTF_8.newDecoder. > This happens because for every WAL append we try to calculate the replication > scope associate with the families associated with the TableDescriptor. I > think per WAL append doing this is very costly and creates lot of garbage. -- This message was sent by Atlassian JIRA (v6.3.4#6332)