[jira] [Commented] (MAPREDUCE-5886) Allow wordcount example job to accept multiple input paths.

2014-06-11 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14027861#comment-14027861
 ] 

Hudson commented on MAPREDUCE-5886:
---

FAILURE: Integrated in Hadoop-Mapreduce-trunk #1798 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1798/])
MAPREDUCE-5886. Allow wordcount example job to accept multiple input paths. 
Contributed by Chris Nauroth. (cnauroth: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1601704)
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-examples/src/main/java/org/apache/hadoop/examples/WordCount.java


> Allow wordcount example job to accept multiple input paths.
> ---
>
> Key: MAPREDUCE-5886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5886
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 3.0.0, 2.4.0
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
>Priority: Minor
> Fix For: 3.0.0, 2.5.0
>
> Attachments: MAPREDUCE-5886.1.patch, MAPREDUCE-5886.2.patch, 
> MAPREDUCE-5886.3.patch
>
>
> It would be convenient if the wordcount example MapReduce job could accept 
> multiple input paths and run the word count on all of them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAPREDUCE-5886) Allow wordcount example job to accept multiple input paths.

2014-06-11 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14027783#comment-14027783
 ] 

Hudson commented on MAPREDUCE-5886:
---

SUCCESS: Integrated in Hadoop-Hdfs-trunk #1771 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/1771/])
MAPREDUCE-5886. Allow wordcount example job to accept multiple input paths. 
Contributed by Chris Nauroth. (cnauroth: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1601704)
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-examples/src/main/java/org/apache/hadoop/examples/WordCount.java


> Allow wordcount example job to accept multiple input paths.
> ---
>
> Key: MAPREDUCE-5886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5886
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 3.0.0, 2.4.0
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
>Priority: Minor
> Fix For: 3.0.0, 2.5.0
>
> Attachments: MAPREDUCE-5886.1.patch, MAPREDUCE-5886.2.patch, 
> MAPREDUCE-5886.3.patch
>
>
> It would be convenient if the wordcount example MapReduce job could accept 
> multiple input paths and run the word count on all of them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAPREDUCE-5886) Allow wordcount example job to accept multiple input paths.

2014-06-11 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14027635#comment-14027635
 ] 

Hudson commented on MAPREDUCE-5886:
---

FAILURE: Integrated in Hadoop-Yarn-trunk #580 (See 
[https://builds.apache.org/job/Hadoop-Yarn-trunk/580/])
MAPREDUCE-5886. Allow wordcount example job to accept multiple input paths. 
Contributed by Chris Nauroth. (cnauroth: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1601704)
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-examples/src/main/java/org/apache/hadoop/examples/WordCount.java


> Allow wordcount example job to accept multiple input paths.
> ---
>
> Key: MAPREDUCE-5886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5886
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 3.0.0, 2.4.0
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
>Priority: Minor
> Fix For: 3.0.0, 2.5.0
>
> Attachments: MAPREDUCE-5886.1.patch, MAPREDUCE-5886.2.patch, 
> MAPREDUCE-5886.3.patch
>
>
> It would be convenient if the wordcount example MapReduce job could accept 
> multiple input paths and run the word count on all of them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAPREDUCE-5886) Allow wordcount example job to accept multiple input paths.

2014-06-10 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14026771#comment-14026771
 ] 

Hadoop QA commented on MAPREDUCE-5886:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12644677/MAPREDUCE-5886.3.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in 
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core 
hadoop-mapreduce-project/hadoop-mapreduce-examples.

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/4645//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/4645//console

This message is automatically generated.

> Allow wordcount example job to accept multiple input paths.
> ---
>
> Key: MAPREDUCE-5886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5886
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 3.0.0, 2.4.0
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
>Priority: Minor
> Fix For: 3.0.0, 2.5.0
>
> Attachments: MAPREDUCE-5886.1.patch, MAPREDUCE-5886.2.patch, 
> MAPREDUCE-5886.3.patch
>
>
> It would be convenient if the wordcount example MapReduce job could accept 
> multiple input paths and run the word count on all of them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAPREDUCE-5886) Allow wordcount example job to accept multiple input paths.

2014-06-10 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14026769#comment-14026769
 ] 

Hudson commented on MAPREDUCE-5886:
---

SUCCESS: Integrated in Hadoop-trunk-Commit #5673 (See 
[https://builds.apache.org/job/Hadoop-trunk-Commit/5673/])
MAPREDUCE-5886. Allow wordcount example job to accept multiple input paths. 
Contributed by Chris Nauroth. (cnauroth: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1601704)
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-examples/src/main/java/org/apache/hadoop/examples/WordCount.java


> Allow wordcount example job to accept multiple input paths.
> ---
>
> Key: MAPREDUCE-5886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5886
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 3.0.0, 2.4.0
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
>Priority: Minor
> Fix For: 3.0.0, 2.5.0
>
> Attachments: MAPREDUCE-5886.1.patch, MAPREDUCE-5886.2.patch, 
> MAPREDUCE-5886.3.patch
>
>
> It would be convenient if the wordcount example MapReduce job could accept 
> multiple input paths and run the word count on all of them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAPREDUCE-5886) Allow wordcount example job to accept multiple input paths.

2014-06-10 Thread Siddharth Seth (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14026709#comment-14026709
 ] 

Siddharth Seth commented on MAPREDUCE-5886:
---

+1. The original patch looks good to me. In a subsequent jira, does something 
need to be done with the way the addInputPath(Job, Path) eventually propagates 
the additional paths (comma separated property) to handle commas in filenames.

> Allow wordcount example job to accept multiple input paths.
> ---
>
> Key: MAPREDUCE-5886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5886
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 3.0.0, 2.4.0
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
>Priority: Minor
> Attachments: MAPREDUCE-5886.1.patch, MAPREDUCE-5886.2.patch, 
> MAPREDUCE-5886.3.patch
>
>
> It would be convenient if the wordcount example MapReduce job could accept 
> multiple input paths and run the word count on all of them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAPREDUCE-5886) Allow wordcount example job to accept multiple input paths.

2014-05-15 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13993986#comment-13993986
 ] 

Hadoop QA commented on MAPREDUCE-5886:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12644185/MAPREDUCE-5886.1.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in 
hadoop-mapreduce-project/hadoop-mapreduce-examples.

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/4594//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/4594//console

This message is automatically generated.

> Allow wordcount example job to accept multiple input paths.
> ---
>
> Key: MAPREDUCE-5886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5886
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 3.0.0, 2.4.0
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
>Priority: Minor
> Attachments: MAPREDUCE-5886.1.patch
>
>
> It would be convenient if the wordcount example MapReduce job could accept 
> multiple input paths and run the word count on all of them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAPREDUCE-5886) Allow wordcount example job to accept multiple input paths.

2014-05-14 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995682#comment-13995682
 ] 

Hadoop QA commented on MAPREDUCE-5886:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12644305/MAPREDUCE-5886.2.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in 
hadoop-mapreduce-project/hadoop-mapreduce-examples.

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/4597//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/4597//console

This message is automatically generated.

> Allow wordcount example job to accept multiple input paths.
> ---
>
> Key: MAPREDUCE-5886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5886
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 3.0.0, 2.4.0
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
>Priority: Minor
> Attachments: MAPREDUCE-5886.1.patch, MAPREDUCE-5886.2.patch
>
>
> It would be convenient if the wordcount example MapReduce job could accept 
> multiple input paths and run the word count on all of them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAPREDUCE-5886) Allow wordcount example job to accept multiple input paths.

2014-05-14 Thread Chris Nauroth (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13996830#comment-13996830
 ] 

Chris Nauroth commented on MAPREDUCE-5886:
--

Hi, [~jira.shegalov] and [~ajisakaa].  Thanks for looking at this and 
contributing some new ideas.

Regarding {{FileInputFormat#addInputPaths}}, in addition to the issue raised by 
Akira for supporting comma in a file name, there is another reason why I didn't 
use that method.  On Windows Command Prompt, the comma acts as an argument 
separator, much like space.  This would have the potential to create confusion 
for users on Windows.

The basic concept of the new API looks good to me.  We might instead consider 
passing varargs and no range indices.  Word count could chop the input args 
down to the correct range using {{Arrays#copyOfRange}} or {{List#subList}}.

Would you mind moving all of the API work to another jira?  MAPREDUCE-5889 
probably would work for that.  For this issue, I was hoping to put in a quick 
trivial patch in just word count to enable this.  IOW, I'd like to pursue a 
binding +1 on patch v1 and commit it.

Thanks again!

> Allow wordcount example job to accept multiple input paths.
> ---
>
> Key: MAPREDUCE-5886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5886
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 3.0.0, 2.4.0
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
>Priority: Minor
> Attachments: MAPREDUCE-5886.1.patch, MAPREDUCE-5886.2.patch, 
> MAPREDUCE-5886.3.patch
>
>
> It would be convenient if the wordcount example MapReduce job could accept 
> multiple input paths and run the word count on all of them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAPREDUCE-5886) Allow wordcount example job to accept multiple input paths.

2014-05-14 Thread Gera Shegalov (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13997088#comment-13997088
 ] 

Gera Shegalov commented on MAPREDUCE-5886:
--

Chris, thanks for the JDK pointer, I am aware of the behavior.

> Allow wordcount example job to accept multiple input paths.
> ---
>
> Key: MAPREDUCE-5886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5886
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 3.0.0, 2.4.0
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
>Priority: Minor
> Attachments: MAPREDUCE-5886.1.patch, MAPREDUCE-5886.2.patch, 
> MAPREDUCE-5886.3.patch
>
>
> It would be convenient if the wordcount example MapReduce job could accept 
> multiple input paths and run the word count on all of them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAPREDUCE-5886) Allow wordcount example job to accept multiple input paths.

2014-05-13 Thread Gera Shegalov (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13996980#comment-13996980
 ] 

Gera Shegalov commented on MAPREDUCE-5886:
--

I considered both {{Arrays#copyOfRange}} and {{List#subList}} but discarded 
this due to creation of throwaway objects. Thanks for discussion, [~ajisakaa] 
and [~cnauroth]. We can move FIF changes to another JIRA.

> Allow wordcount example job to accept multiple input paths.
> ---
>
> Key: MAPREDUCE-5886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5886
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 3.0.0, 2.4.0
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
>Priority: Minor
> Attachments: MAPREDUCE-5886.1.patch, MAPREDUCE-5886.2.patch, 
> MAPREDUCE-5886.3.patch
>
>
> It would be convenient if the wordcount example MapReduce job could accept 
> multiple input paths and run the word count on all of them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAPREDUCE-5886) Allow wordcount example job to accept multiple input paths.

2014-05-13 Thread Chris Nauroth (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13997045#comment-13997045
 ] 

Chris Nauroth commented on MAPREDUCE-5886:
--

bq. I considered both {{Arrays#copyOfRange}} and {{List#subList}} but discarded 
this due to creation of throwaway objects.

It's not too bad for {{ArrayList#subList}}.  It retains the original array and 
wraps it with different offset indices:

http://hg.openjdk.java.net/jdk6/jdk6/jdk/file/tip/src/share/classes/java/util/ArrayList.java#l891

You pay a flat cost for the extra indices and object overhead, but it's not a 
full array reallocation.

> Allow wordcount example job to accept multiple input paths.
> ---
>
> Key: MAPREDUCE-5886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5886
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 3.0.0, 2.4.0
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
>Priority: Minor
> Attachments: MAPREDUCE-5886.1.patch, MAPREDUCE-5886.2.patch, 
> MAPREDUCE-5886.3.patch
>
>
> It would be convenient if the wordcount example MapReduce job could accept 
> multiple input paths and run the word count on all of them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAPREDUCE-5886) Allow wordcount example job to accept multiple input paths.

2014-05-13 Thread Akira AJISAKA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13996154#comment-13996154
 ] 

Akira AJISAKA commented on MAPREDUCE-5886:
--

Thanks [~jira.shegalov] for the comment. Filed MAPREDUCE-5889 to deprecate 
{{FIF.addInputPaths((Job job, String commaSeparatedPaths)}}.

> Allow wordcount example job to accept multiple input paths.
> ---
>
> Key: MAPREDUCE-5886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5886
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 3.0.0, 2.4.0
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
>Priority: Minor
> Attachments: MAPREDUCE-5886.1.patch, MAPREDUCE-5886.2.patch
>
>
> It would be convenient if the wordcount example MapReduce job could accept 
> multiple input paths and run the word count on all of them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAPREDUCE-5886) Allow wordcount example job to accept multiple input paths.

2014-05-12 Thread Gera Shegalov (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995949#comment-13995949
 ] 

Gera Shegalov commented on MAPREDUCE-5886:
--

bq. The v2 patch cannot handle the path which includes a comma.
Akira, thanks for chiming in! That looks like a framework bug to me then. Are 
you suggesting to deprecate {{FIF.addInputPaths((Job job, String 
commaSeparatedPaths)}}  

> Allow wordcount example job to accept multiple input paths.
> ---
>
> Key: MAPREDUCE-5886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5886
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 3.0.0, 2.4.0
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
>Priority: Minor
> Attachments: MAPREDUCE-5886.1.patch, MAPREDUCE-5886.2.patch
>
>
> It would be convenient if the wordcount example MapReduce job could accept 
> multiple input paths and run the word count on all of them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAPREDUCE-5886) Allow wordcount example job to accept multiple input paths.

2014-05-12 Thread Gera Shegalov (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13994054#comment-13994054
 ] 

Gera Shegalov commented on MAPREDUCE-5886:
--

LGTM.  Possibly we can add a method to FIF:
{code}
/**
 *  add numArgs paths starting at offset to the input 
 */
public static addInputPaths(Job job, String[] args, int offset, int numArgs);
{code}

Then it can be used in other jobs and reused in FIF for
{code}
  public static void addInputPaths(Job job, 
   String commaSeparatedPaths
   ) throws IOException {
{code}

> Allow wordcount example job to accept multiple input paths.
> ---
>
> Key: MAPREDUCE-5886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5886
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 3.0.0, 2.4.0
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
>Priority: Minor
> Attachments: MAPREDUCE-5886.1.patch
>
>
> It would be convenient if the wordcount example MapReduce job could accept 
> multiple input paths and run the word count on all of them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAPREDUCE-5886) Allow wordcount example job to accept multiple input paths.

2014-05-12 Thread Akira AJISAKA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995428#comment-13995428
 ] 

Akira AJISAKA commented on MAPREDUCE-5886:
--

+1 (non-binding) for the v1 patch. The v2 patch cannot handle the path which 
includes a comma.

> Allow wordcount example job to accept multiple input paths.
> ---
>
> Key: MAPREDUCE-5886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5886
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 3.0.0, 2.4.0
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
>Priority: Minor
> Attachments: MAPREDUCE-5886.1.patch, MAPREDUCE-5886.2.patch
>
>
> It would be convenient if the wordcount example MapReduce job could accept 
> multiple input paths and run the word count on all of them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)