[ 
https://issues.apache.org/jira/browse/SQOOP-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14284448#comment-14284448
 ] 

Hari Shreedharan commented on SQOOP-1938:
-----------------------------------------

We'd lose some of the benefits, yes - because you'd have to wait for all map 
tasks to be done. But even with that, we'd still be be doing I/O, where we are 
writing the intermediate output before the shuffle - correct? That is still 
I/O, so we'd still be getting some optimization in terms of how long the map 
tasks themselves take.

> DOC:update the sqoop MR engine implementation details
> -----------------------------------------------------
>
>                 Key: SQOOP-1938
>                 URL: https://issues.apache.org/jira/browse/SQOOP-1938
>             Project: Sqoop
>          Issue Type: Sub-task
>            Reporter: Veena Basavaraj
>            Assignee: Veena Basavaraj
>             Fix For: 1.99.5
>
>
> https://cwiki.apache.org/confluence/display/SQOOP/Sqoop+MR+Execution+Engine
> 1. Why we need SqoopWritable, what can be done in future?
> 2. Even though we call sqoop as a map only, is that how it always works? what 
> happend when numLoaders is non zero
> {code}
>       // Set number of reducers as number of configured loaders  or suppress
>       // reduce phase entirely if loaders are not set at all.
>       if(request.getLoaders() != null) {
>         job.setNumReduceTasks(request.getLoaders());
>       } else {
>         job.setNumReduceTasks(0);
>       }
> {code}
> 3. Internals of SqoopNullOutputFormat and how SqoopWritable is used in it



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to