[
https://issues.apache.org/jira/browse/ACCUMULO-418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Keith Turner updated ACCUMULO-418:
----------------------------------
Fix Version/s: (was: 1.4.1)
(was: 1.3.6)
Assignee: Keith Turner (was: Eric Newton)
I suspect this will require modifications to the file format to make it more of
a local operation. With the current file format, the index (at the end of the
file and possibly in a remote block) must be read.
> Make RFiles splittable
> ----------------------
>
> Key: ACCUMULO-418
> URL: https://issues.apache.org/jira/browse/ACCUMULO-418
> Project: Accumulo
> Issue Type: New Feature
> Components: master, tserver
> Affects Versions: 1.3.5, 1.4.0, 1.5.0-SNAPSHOT
> Environment: All
> Reporter: Ivan Bella
> Assignee: Keith Turner
> Labels: RFile, hadoop, mapreduce
> Fix For: 1.5.0
>
> Original Estimate: 72h
> Remaining Estimate: 72h
>
> There are times when iterating over RFiles is useful in map-reduce jobs. I
> know that RFiles logically can be split on the block boundary, however there
> is no easy way to do this currently as there is no RFile RecordReader or
> InputFormat provided.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira