[ https://issues.apache.org/jira/browse/HBASE-12853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14282337#comment-14282337 ]
Michael Segel commented on HBASE-12853: ---------------------------------------- Sure... Just a couple of things... 1) I would like to make sure I understand the split between client/server in HBase works the way I think it does. 2) I get some free time. (Day Job, conference talks, R&D, ...) This is one issue that is specific to HBase and doesn't conflict with any prior work I may have done. > distributed write pattern to replace ad hoc 'salting' > ----------------------------------------------------- > > Key: HBASE-12853 > URL: https://issues.apache.org/jira/browse/HBASE-12853 > Project: HBase > Issue Type: New Feature > Reporter: Michael Segel > Priority: Minor > > In reviewing HBASE-11682 (Description of Hot Spotting), one of the issues is > that while 'salting' alleviated regional hot spotting, it increased the > complexity required to utilize the data. > Through the use of coprocessors, it should be possible to offer a method > which distributes the data on write across the cluster and then manages > reading the data returning a sort ordered result set, abstracting the > underlying process. > On table creation, a flag is set to indicate that this is a parallel table. > On insert in to the table, if the flag is set to true then a prefix is added > to the key. e.g. <region server#>- or <region server #|| where the region > server # is an integer between 1 and the number of region servers defined. > On read (scan) for each region server defined, a separate scan is created > adding the prefix. Since each scan will be in sort order, its possible to > strip the prefix and return the lowest value key from each of the subsets. -- This message was sent by Atlassian JIRA (v6.3.4#6332)