[ 
https://issues.apache.org/jira/browse/HBASE-13356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512866#comment-14512866
 ] 

Ted Yu commented on HBASE-13356:
--------------------------------

MultiTableSnapshotInputFormat.java and MultiTableSnapshotInputFormatImpl.java 
need Apache license. Add annotation for audience.

There're several long lines - please limit line width to 100 characters.
{code}
125        *  Sets up the job for reading from one or more multiple table 
snapshots, with one or more scan per snapshot.
{code}
Should 'one or more multiple table snapshots' be 'one or more table snapshots' ?
nit: 'one or more scan' -> 'one or more scans'
{code}
26      public class MultiTableSnapshotInputFormatImpl {
27      
28        private static final Log LOG = 
LogFactory.getLog(MultiTableSnapshotInputFormat.class);
{code}
Classname for LOG doesn't match the real classname.
{code}
85              for (TableSnapshotInputFormatImpl.InputSplit split : splits) {
86                rtn.add(split);
87              }
{code}
Can you use 
https://docs.oracle.com/javase/7/docs/api/java/util/List.html#addAll(java.util.Collection)
 ?
{code}
177       private Map<String, Path> 
generateSnapshotToRestoreDir(Collection<String> snapshots, Path baseRestoreDir) 
{
{code}
Name the method generateSnapshotToRestoreDirMapping().


> HBase should provide an InputFormat supporting multiple scans in mapreduce 
> jobs over snapshots
> ----------------------------------------------------------------------------------------------
>
>                 Key: HBASE-13356
>                 URL: https://issues.apache.org/jira/browse/HBASE-13356
>             Project: HBase
>          Issue Type: New Feature
>          Components: mapreduce
>            Reporter: Andrew Mains
>            Assignee: Andrew Mains
>            Priority: Minor
>         Attachments: HBASE-13356.patch
>
>
> Currently, HBase supports the pushing of multiple scans to mapreduce jobs 
> over live tables (via MultiTableInputFormat) but only supports a single scan 
> for mapreduce jobs over table snapshots. It would be handy to support 
> multiple scans over snapshots as well, probably through another input format 
> (MultiTableSnapshotInputFormat?). To mimic the functionality present in 
> MultiTableInputFormat, the new input format would likely have to take in the 
> names of all snapshots used in addition to the scans.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to