[jira] Commented: (MAPREDUCE-370) Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.

Amareshwari Sriramadasu (JIRA) Fri, 07 Aug 2009 04:46:42 -0700

    [ 
https://issues.apache.org/jira/browse/MAPREDUCE-370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12740516#action_12740516
 ]


Amareshwari Sriramadasu commented on MAPREDUCE-370:
---------------------------------------------------

bq. To achieve this, I think we could port MultipleOutputs, and change the 
semantics of getCollector() in the multi name case, so that the multi name is 
the full name of the name of the output file. This method is typically invoked 
in the reduce() method, where the key and value are available, and can be used 
to form the name.

If we do this, this will remove generate* methods from the api proposed. And 
api for writing would look like :
{code}
public <K,V> void write(String namedOutput,  K key, V value, String outputPath)
          throws IOException, InterruptedException;
public <K,V> void write(String namedOutput,  K key, V value)
          throws IOException, InterruptedException;
public <K,V> void write( K key, V value, String outputPath)
          throws IOException, InterruptedException;
{code}

let me know if this looks fine.

> Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.
> -------------------------------------------------------------------
>
>                 Key: MAPREDUCE-370
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-370
>             Project: Hadoop Map/Reduce
>          Issue Type: Sub-task
>            Reporter: Amareshwari Sriramadasu
>            Assignee: Amareshwari Sriramadasu
>         Attachments: patch-370.txt
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-370) Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.

Reply via email to