[ 
https://issues.apache.org/jira/browse/HADOOP-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12559105#action_12559105
 ] 

Tom White commented on HADOOP-2604:
-----------------------------------

The current design of MapFile.Reader makes it difficult to write an in-memory 
implementation. For example, to implement next() it's no good having a copy of 
the keys and values in memory as you can't copy their values into the Writables 
passed into the next method. Perhaps Writable should have a 
readFields(Writable) method? Or maybe the API should change.

To write an in-memory implementation with the current design, I think you would 
need to do it at a lower level and hold the data file bytes in memory. Keys and 
values would be reconstructed each time next() or get() was called, so this 
would be less efficient than an implementation that cached keys and values.

> [hbase] Create an HBase-specific MapFile implementation
> -------------------------------------------------------
>
>                 Key: HADOOP-2604
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2604
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: contrib/hbase
>            Reporter: Bryan Duxbury
>            Priority: Minor
>
> Today, HBase uses the Hadoop MapFile class to store data persistently to 
> disk. This is convenient, as it's already done (and maintained by other 
> people :). However, it's beginning to look like there might be possible 
> performance benefits to be had from doing an HBase-specific implementation of 
> MapFile that incorporated some precise features.
> This issue should serve as a place to track discussion about what features 
> might be included in such an implementation.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to