[jira] [Resolved] (MAPREDUCE-3410) Capacity scheduler reconfiguration of queues does not work for add sub-queues to an existing queue

2012-08-07 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy resolved MAPREDUCE-3410.
--

Resolution: Later

For now we should punt on this - this is akin to 'delete queue' as Jason 
explained.

> Capacity scheduler reconfiguration of queues does not work for add sub-queues 
> to an existing queue
> --
>
> Key: MAPREDUCE-3410
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-3410
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Anupam Seth
>
> If we have an existing queue configuration such as 
> root
>  |---> A
>  |---> B
> and we attempt to reconfigure it so that we now have
> root
>  |---> A
>|---> A1
>|---> A2
>  |---> B
> we get an IOException as follows:
> java.io.IOException: Failed to re-init queues
>   at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.reinitialize(CapacityScheduler.java:197)
>   at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.TestLeafQueue.testInitializeQueue(TestLeafQueue.java:206)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>   at java.lang.reflect.Method.invoke(Method.java:597)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:44)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:15)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:41)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:20)
>   at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:28)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:31)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runNotIgnored(BlockJUnit4ClassRunner.java:79)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:71)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:49)
>   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:193)
>   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:52)
>   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:191)
>   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:42)
>   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:184)
>   at org.junit.runners.ParentRunner.run(ParentRunner.java:236)
>   at 
> org.apache.maven.surefire.junit4.JUnit4TestSet.execute(JUnit4TestSet.java:45)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:123)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:104)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>   at java.lang.reflect.Method.invoke(Method.java:597)
>   at 
> org.apache.maven.surefire.util.ReflectionUtils.invokeMethodWithArray(ReflectionUtils.java:164)
>   at 
> org.apache.maven.surefire.booter.ProviderFactory$ProviderProxy.invoke(ProviderFactory.java:110)
>   at 
> org.apache.maven.surefire.booter.SurefireStarter.invokeProvider(SurefireStarter.java:172)
>   at 
> org.apache.maven.surefire.booter.SurefireStarter.runSuitesInProcessWhenForked(SurefireStarter.java:78)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:70)
> Caused by: java.io.IOException: Trying to reinitialize root.a from root.a
>   at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.LeafQueue.reinitialize(LeafQueue.java:524)
>   at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.ParentQueue.reinitialize(ParentQueue.java:360)
>   at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.reinitializeQueues(CapacityScheduler.java:240)
>   at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.reinitialize(CapacityScheduler.java:194)
>   ... 32 more
> This is apparently because the CapacityScheduler still wants to think of A as 
> a LeafQueue instead of realizing it to be updated as a ParentQueue.
> Maybe, this use case is not supposed to be supported, in which case, 

[jira] [Created] (MAPREDUCE-4524) Capacity Scheduler does not support adding sub-queues to the existing queues.

2012-08-07 Thread Kiran BC (JIRA)
Kiran BC created MAPREDUCE-4524:
---

 Summary: Capacity Scheduler does not support adding sub-queues to 
the existing queues.
 Key: MAPREDUCE-4524
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4524
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: capacity-sched, documentation
Affects Versions: 0.23.0
Reporter: Kiran BC


In-line to the issue, MAPREDUCE-3410, there should be a note stating that - 
"Capacity Scheduler does not support adding sub-queues to the existing queue"

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-806) WordCount example does not compile given the current instructions

2012-08-07 Thread sakurai (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13430787#comment-13430787
 ] 

sakurai commented on MAPREDUCE-806:
---

sorry, i missed out the jar command



> WordCount example does not compile given the current instructions
> -
>
> Key: MAPREDUCE-806
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-806
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: examples
>Affects Versions: 0.20.1
> Environment: linux
>Reporter: Hector Yuen
>Priority: Trivial
>
> http://hadoop.apache.org/common/docs/r0.20.0/mapred_tutorial.html#Example%3A+WordCount+v1.0
> In this example, the classpath is missing commons-cli-2.0-SNAPSHOT.jar
> If we compile according to the instructions:
> $ javac -classpath /hadoop/core/hadoop-0.20.0-core.jar -d ioperf_classes/ 
> src/WordCount.java
> src/WordCount.java:54: cannot access org.apache.commons.cli.Options
> class file for org.apache.commons.cli.Options not found
> String[] otherArgs = new GenericOptionsParser(conf, 
> args).getRemainingArgs();
> The correct compilation should be 
> $ javac -classpath 
> /hadoop/core/hadoop-0.20.0-core.jar:/hadoop/core/lib/commons-cli-2.0-SNAPSHOT.jar
>  -d ioperf_classes/ src/WordCount.java

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-4511) Add IFile readahead

2012-08-07 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13430744#comment-13430744
 ] 

Hadoop QA commented on MAPREDUCE-4511:
--

-1 overall.  Here are the results of testing the latest attachment 
  
http://issues.apache.org/jira/secure/attachment/12539743/MAPREDUCE-4511_branch-1_rev2.patch
  against trunk revision .

-1 patch.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2717//console

This message is automatically generated.

> Add IFile readahead
> ---
>
> Key: MAPREDUCE-4511
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4511
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv1, mrv2
>Reporter: Ahmed Radwan
>Assignee: Ahmed Radwan
> Attachments: MAPREDUCE-4511_branch-1_rev2.patch, 
> MAPREDUCE-4511_branch1.patch, MAPREDUCE-4511_trunk.patch, 
> MAPREDUCE-4511_trunk_rev2.patch
>
>
> This ticket is to add IFile readahead as part of HADOOP-7714.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-4511) Add IFile readahead

2012-08-07 Thread Ahmed Radwan (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ahmed Radwan updated MAPREDUCE-4511:


Attachment: MAPREDUCE-4511_branch-1_rev2.patch
MAPREDUCE-4511_trunk_rev2.patch

> Add IFile readahead
> ---
>
> Key: MAPREDUCE-4511
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4511
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv1, mrv2
>Reporter: Ahmed Radwan
>Assignee: Ahmed Radwan
> Attachments: MAPREDUCE-4511_branch-1_rev2.patch, 
> MAPREDUCE-4511_branch1.patch, MAPREDUCE-4511_trunk.patch, 
> MAPREDUCE-4511_trunk_rev2.patch
>
>
> This ticket is to add IFile readahead as part of HADOOP-7714.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-4511) Add IFile readahead

2012-08-07 Thread Ahmed Radwan (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13430732#comment-13430732
 ] 

Ahmed Radwan commented on MAPREDUCE-4511:
-

Here are updated versions for both trunk and branch-1 adding the ability to 
disable and configure the readahead and its length, respectively.

> Add IFile readahead
> ---
>
> Key: MAPREDUCE-4511
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4511
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv1, mrv2
>Reporter: Ahmed Radwan
>Assignee: Ahmed Radwan
> Attachments: MAPREDUCE-4511_branch1.patch, MAPREDUCE-4511_trunk.patch
>
>
> This ticket is to add IFile readahead as part of HADOOP-7714.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-4044) YarnClientProtocolProvider does not honor mapred.job.tracker property

2012-08-07 Thread Mohammad Kamrul Islam (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13430697#comment-13430697
 ] 

Mohammad Kamrul Islam commented on MAPREDUCE-4044:
--

Is there any update on this?


> YarnClientProtocolProvider does not honor mapred.job.tracker property
> -
>
> Key: MAPREDUCE-4044
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4044
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.24.0, 0.23.3
>Reporter: Alejandro Abdelnur
>
> The YarnClientProtocolProvider/YARNRunner/ResourceMgrDelegate bootstrap only 
> looks for 'yarn.resourcemanager.address', they ignore 'mapred.job.tracker'
> This breaks backward compatibility and creates issues in Oozie.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-4323) NM log aggregator leaks filesystems

2012-08-07 Thread Jonathan Eagles (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13430695#comment-13430695
 ] 

Jonathan Eagles commented on MAPREDUCE-4323:


Thanks to [~jlowe] for pointing out a simple way for me to check the  socket 
leak.

{code}
while true; do ls /proc//fd/ | wc; sleep 1; done
{code}

> NM log aggregator leaks filesystems
> ---
>
> Key: MAPREDUCE-4323
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4323
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: nodemanager
>Affects Versions: 0.23.0, 0.24.0, 2.0.0-alpha
>Reporter: Daryn Sharp
>Assignee: Daryn Sharp
>Priority: Critical
> Attachments: MAPREDUCE-4323.patch
>
>
> The NM is exhausting its fds because it's not closing fs instances when the 
> app is finished.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-4323) NM log aggregator leaks filesystems

2012-08-07 Thread Jonathan Eagles (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13430692#comment-13430692
 ] 

Jonathan Eagles commented on MAPREDUCE-4323:


This patch seems like it partially addresses the problem. Something else still 
seems to be leaking sockets. 

> NM log aggregator leaks filesystems
> ---
>
> Key: MAPREDUCE-4323
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4323
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: nodemanager
>Affects Versions: 0.23.0, 0.24.0, 2.0.0-alpha
>Reporter: Daryn Sharp
>Assignee: Daryn Sharp
>Priority: Critical
> Attachments: MAPREDUCE-4323.patch
>
>
> The NM is exhausting its fds because it's not closing fs instances when the 
> app is finished.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3782) teragen terasort jobs fail when using webhdfs://

2012-08-07 Thread Daryn Sharp (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13430658#comment-13430658
 ] 

Daryn Sharp commented on MAPREDUCE-3782:


+1 I'm not fond of how formatters have to explicitly obtain tokens, but this 
fits the current paradigm.  Thanks Jason!

> teragen terasort jobs fail when using webhdfs:// 
> -
>
> Key: MAPREDUCE-3782
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-3782
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.1, 0.24.0
>Reporter: Arpit Gupta
>Assignee: Jason Lowe
>Priority: Critical
> Attachments: MAPREDUCE-3782.patch
>
>
> When running a teragen job with a webhdfs:// url the delegation token that is 
> retrieved is an hdfs delegation token. 
> And the subsequent terasort job on the output fails with java io exception

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-4039) Sort Avoidance

2012-08-07 Thread Mariappan Asokan (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13430587#comment-13430587
 ] 

Mariappan Asokan commented on MAPREDUCE-4039:
-

Hi Anty,
  I have submitted a patch for MAPREDUCE-2454 to support a pluggable sort for 
MR.  I have some ideas on implementing a NullSortPlugin which will be a special 
sort plugin that avoids sorting.  The NullSortPlugin can live outside the 
Hadoop MR code.  I can share my thoughts if you are interested.
Thanks.
-- Asokan


> Sort Avoidance
> --
>
> Key: MAPREDUCE-4039
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4039
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: mrv2
>Affects Versions: 0.23.2
>Reporter: anty.rao
>Assignee: anty
>Priority: Minor
> Fix For: 0.23.2
>
> Attachments: IndexedCountingSortable.java, 
> MAPREDUCE-4039-branch-0.23.2.patch, MAPREDUCE-4039-branch-0.23.2.patch, 
> MAPREDUCE-4039-branch-0.23.2.patch
>
>
> Inspired by 
> [Tenzing|http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en//pubs/archive/37200.pdf],
>  in 5.1 MapReduce Enhanceemtns:
> {quote}*Sort Avoidance*. Certain operators such as hash join
> and hash aggregation require shuffling, but not sorting. The
> MapReduce API was enhanced to automatically turn off
> sorting for these operations. When sorting is turned off, the
> mapper feeds data to the reducer which directly passes the
> data to the Reduce() function bypassing the intermediate
> sorting step. This makes many SQL operators significantly
> more ecient.{quote}
> There are a lot of applications which need aggregation only, not 
> sorting.Using sorting to achieve aggregation is costly and inefficient. 
> Without sorting, up application can make use of hash table or hash map to do 
> aggregation efficiently.But application should bear in mind that reduce 
> memory is limited, itself is committed to manage memory of reduce, guard 
> against out of memory. Map-side combiner is not supported, you can also do 
> hash aggregation in map side  as a workaround.
> the following is the main points of sort avoidance implementation
> # add a configuration parameter ??mapreduce.sort.avoidance??, boolean type, 
> to turn on/off sort avoidance workflow.Two type of workflow are coexist 
> together.
> # key/value pairs emitted by map function is sorted by partition only, using 
> a more efficient sorting algorithm: counting sort.
> # map-side merge, use a kind of byte merge, which just concatenate bytes from 
> generated spills, read in bytes, write out bytes, without overhead of 
> key/value serialization/deserailization, comparison, which current version 
> incurs.
> # reduce can start up as soon as there is any map output available, in 
> contrast to sort workflow which must wait until all map outputs are fetched 
> and merged.
> # map output in memory can be directly consumed by reduce.When reduce can't 
> catch up with the speed of incoming map outputs, in-memory merge thread will 
> kick in, merging in-memory map outputs onto disk.
> # sequentially read in on-disk files to feed reduce, in contrast to currently 
> implementation which read multiple files concurrently, result in many disk 
> seek. Map output in memory take precedence over on disk files in feeding 
> reduce function.
> I have already implement this feature based on hadoop CDH3U3 and done some 
> performance evaluation, you can reference to 
> [https://github.com/hanborq/hadoop] for details. Now,I'm willing to port it 
> into yarn. Welcome for commenting.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3782) teragen terasort jobs fail when using webhdfs://

2012-08-07 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13430583#comment-13430583
 ] 

Hadoop QA commented on MAPREDUCE-3782:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12539702/MAPREDUCE-3782.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

-1 tests included.  The patch doesn't appear to include any new or modified 
tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 eclipse:eclipse.  The patch built with eclipse:eclipse.

+1 findbugs.  The patch does not introduce any new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed unit tests in 
hadoop-mapreduce-project/hadoop-mapreduce-examples.

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2716//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2716//console

This message is automatically generated.

> teragen terasort jobs fail when using webhdfs:// 
> -
>
> Key: MAPREDUCE-3782
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-3782
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.1, 0.24.0
>Reporter: Arpit Gupta
>Assignee: Jason Lowe
>Priority: Critical
> Attachments: MAPREDUCE-3782.patch
>
>
> When running a teragen job with a webhdfs:// url the delegation token that is 
> retrieved is an hdfs delegation token. 
> And the subsequent terasort job on the output fails with java io exception

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-4323) NM log aggregator leaks filesystems

2012-08-07 Thread Daryn Sharp (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daryn Sharp updated MAPREDUCE-4323:
---

Summary: NM log aggregator leaks filesystems  (was: NM leaks sockets)

> NM log aggregator leaks filesystems
> ---
>
> Key: MAPREDUCE-4323
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4323
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: nodemanager
>Affects Versions: 0.23.0, 0.24.0, 2.0.0-alpha
>Reporter: Daryn Sharp
>Assignee: Daryn Sharp
>Priority: Critical
> Attachments: MAPREDUCE-4323.patch
>
>
> The NM is exhausting its fds because it's not closing fs instances when the 
> app is finished.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3782) teragen terasort jobs fail when using webhdfs://

2012-08-07 Thread Jason Lowe (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Lowe updated MAPREDUCE-3782:
--

Assignee: Jason Lowe
Target Version/s: 0.23.3, 2.2.0-alpha
  Status: Patch Available  (was: Open)

> teragen terasort jobs fail when using webhdfs:// 
> -
>
> Key: MAPREDUCE-3782
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-3782
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.1, 0.24.0
>Reporter: Arpit Gupta
>Assignee: Jason Lowe
>Priority: Critical
> Attachments: MAPREDUCE-3782.patch
>
>
> When running a teragen job with a webhdfs:// url the delegation token that is 
> retrieved is an hdfs delegation token. 
> And the subsequent terasort job on the output fails with java io exception

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3782) teragen terasort jobs fail when using webhdfs://

2012-08-07 Thread Jason Lowe (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Lowe updated MAPREDUCE-3782:
--

Attachment: MAPREDUCE-3782.patch

Patch for TeraOutputFormat delegation token fix.  No automated testcase, but I 
did manually test on a secure cluster and verified that teragen obtains the 
delegation token for webhdfs.

> teragen terasort jobs fail when using webhdfs:// 
> -
>
> Key: MAPREDUCE-3782
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-3782
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.1, 0.24.0
>Reporter: Arpit Gupta
>Priority: Critical
> Attachments: MAPREDUCE-3782.patch
>
>
> When running a teragen job with a webhdfs:// url the delegation token that is 
> retrieved is an hdfs delegation token. 
> And the subsequent terasort job on the output fails with java io exception

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3782) teragen terasort jobs fail when using webhdfs://

2012-08-07 Thread Jason Lowe (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13430564#comment-13430564
 ] 

Jason Lowe commented on MAPREDUCE-3782:
---

Took a closer look at this.  The log message about getting a delegation token 
for HDFS is not in error, as that token is needed to setup the staging 
directory for the job.  So that is working as-expected.

Teragen failing to get a delegation token for webhdfs *is* a bug in 
TeraOutputFormat, as it is responsible for grabbing any necessary delegation 
tokens in the {{checkOutputSpecs}} method.

Terasort grabs two tokens, one from webhdfs (for the input files and would also 
for the output directory once TeraOutputFormat is fixed) and one from HDFS for 
the job staging area.  We won't see two delegation token messages for the same 
filesystem since it doesn't grab a new token if it already has one.

The range error is a known problem that has been already fixed, see HDFS-3101.

So to sum up, I think there's only one remaining bug here: TeraOutputFormat is 
not grabbing a delegation token for the output directory when 
{{checkOutputSpecs}} is called.

> teragen terasort jobs fail when using webhdfs:// 
> -
>
> Key: MAPREDUCE-3782
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-3782
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.1, 0.24.0
>Reporter: Arpit Gupta
>Priority: Critical
>
> When running a teragen job with a webhdfs:// url the delegation token that is 
> retrieved is an hdfs delegation token. 
> And the subsequent terasort job on the output fails with java io exception

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-4323) NM leaks sockets

2012-08-07 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13430560#comment-13430560
 ] 

Hadoop QA commented on MAPREDUCE-4323:
--

+1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12539698/MAPREDUCE-4323.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 1 new or modified test 
files.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 eclipse:eclipse.  The patch built with eclipse:eclipse.

+1 findbugs.  The patch does not introduce any new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed unit tests in 
hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager.

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2715//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2715//console

This message is automatically generated.

> NM leaks sockets
> 
>
> Key: MAPREDUCE-4323
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4323
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: nodemanager
>Affects Versions: 0.23.0, 0.24.0, 2.0.0-alpha
>Reporter: Daryn Sharp
>Assignee: Daryn Sharp
>Priority: Critical
> Attachments: MAPREDUCE-4323.patch
>
>
> The NM is exhausting its fds because it's not closing fs instances when the 
> app is finished.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-4323) NM leaks sockets

2012-08-07 Thread Daryn Sharp (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daryn Sharp updated MAPREDUCE-4323:
---

Attachment: MAPREDUCE-4323.patch

Patch ensures {{closeAllForUGI}} is called in the event of a service init 
failure, or when the aggregation thread is done.

> NM leaks sockets
> 
>
> Key: MAPREDUCE-4323
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4323
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: nodemanager
>Affects Versions: 0.23.0, 0.24.0, 2.0.0-alpha
>Reporter: Daryn Sharp
>Assignee: Daryn Sharp
>Priority: Critical
> Attachments: MAPREDUCE-4323.patch
>
>
> The NM is exhausting its fds because it's not closing fs instances when the 
> app is finished.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-4323) NM leaks sockets

2012-08-07 Thread Daryn Sharp (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daryn Sharp updated MAPREDUCE-4323:
---

Status: Patch Available  (was: Open)

> NM leaks sockets
> 
>
> Key: MAPREDUCE-4323
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4323
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: nodemanager
>Affects Versions: 2.0.0-alpha, 0.23.0, 0.24.0
>Reporter: Daryn Sharp
>Assignee: Daryn Sharp
>Priority: Critical
> Attachments: MAPREDUCE-4323.patch
>
>
> The NM is exhausting its fds because it's not closing fs instances when the 
> app is finished.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-4498) Remove hsqldb jar from Hadoop runtime classpath

2012-08-07 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13430334#comment-13430334
 ] 

Hudson commented on MAPREDUCE-4498:
---

Integrated in Hadoop-Hdfs-trunk #1128 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/1128/])
MAPREDUCE-4498. Remove hsqldb jar from Hadoop runtime classpath. (rkanter 
via tucu) (Revision 1369906)

 Result = SUCCESS
tucu : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1369906
Files : 
* 
/hadoop/common/trunk/hadoop-assemblies/src/main/resources/assemblies/hadoop-mapreduce-dist.xml
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/pom.xml
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-examples/pom.xml
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-examples/src/main/java/org/apache/hadoop/examples/DBCountPageView.java
* /hadoop/common/trunk/hadoop-mapreduce-project/pom.xml
* /hadoop/common/trunk/hadoop-project/pom.xml


> Remove hsqldb jar from Hadoop runtime classpath
> ---
>
> Key: MAPREDUCE-4498
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4498
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: build, examples
>Affects Versions: 2.2.0-alpha
>Reporter: Robert Kanter
>Assignee: Robert Kanter
>Priority: Critical
> Fix For: 2.2.0-alpha
>
> Attachments: MAPREDUCE-4498-v2.patch, MAPREDUCE-4498-v3.patch, 
> MAPREDUCE-4498.patch
>
>
> The hsqldb jar is included in hadoop for the DBCountPageView example only.  
> Currently the example is using hsqldb version 2.x; however, 2.x is 
> incompatible with 1.8.x -- having this jar in the hadoop class path conflicts 
> with dependent projects like Oozie, Hive, and Pig which still use 1.8.x.  As 
> there are no features hsqldb 2.x that are used by the example, we should 
> remove it from Hadoop's runtime classpath.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-4484) Incorrect IS_MINI_YARN_CLUSTER property name in YarnConfiguration

2012-08-07 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13430331#comment-13430331
 ] 

Hudson commented on MAPREDUCE-4484:
---

Integrated in Hadoop-Hdfs-trunk #1128 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/1128/])
MAPREDUCE-4484. Incorrect IS_MINI_YARN_CLUSTER property name in 
YarnConfiguration. (ahmed.radwan via tucu) (Revision 1370120)

 Result = SUCCESS
tucu : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1370120
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-common/src/main/java/org/apache/hadoop/yarn/conf/YarnConfiguration.java


> Incorrect IS_MINI_YARN_CLUSTER property name in YarnConfiguration
> -
>
> Key: MAPREDUCE-4484
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4484
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 2.0.0-alpha
>Reporter: Ahmed Radwan
>Assignee: Ahmed Radwan
> Fix For: 2.2.0-alpha
>
> Attachments: MAPREDUCE-4484.patch
>
>
> Noticed that the IS_MINI_YARN_CLUSTER property name in YarnConfiguration 
> ended up having an extra "." after appending to YARN_PREFIX.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-4494) TestFifoScheduler failing with Metrics source QueueMetrics,q0=default already exists!

2012-08-07 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13430332#comment-13430332
 ] 

Hudson commented on MAPREDUCE-4494:
---

Integrated in Hadoop-Hdfs-trunk #1128 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/1128/])
MAPREDUCE-4494. TestFifoScheduler failing with Metrics source 
QueueMetrics,q0=default already exists. (ahmed.radwan via tucu) (Revision 
1370118)

 Result = SUCCESS
tucu : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1370118
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestFifoScheduler.java


> TestFifoScheduler failing with Metrics source QueueMetrics,q0=default already 
> exists!
> -
>
> Key: MAPREDUCE-4494
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4494
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2, test
>Affects Versions: 2.0.0-alpha
>Reporter: Ahmed Radwan
>Assignee: Ahmed Radwan
> Fix For: 2.2.0-alpha
>
> Attachments: MAPREDUCE-4494.patch
>
>
> TestFifoScheduler is failing:
> {code}
> Running org.apache.hadoop.yarn.server.resourcemanager.TestFifoScheduler
> Tests run: 5, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 14.614 sec 
> <<< FAILURE!
> Results :
> Tests in error: 
>   test(org.apache.hadoop.yarn.server.resourcemanager.TestFifoScheduler): 
> Metrics source QueueMetrics,q0=default already exists!
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-806) WordCount example does not compile given the current instructions

2012-08-07 Thread sakurai (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13429987#comment-13429987
 ] 

sakurai commented on MAPREDUCE-806:
---

i came across here as i happened to be facing the same problem.
everything goes well after i followed the instruction here(no errors after 
this), but i cant see any wordcount.jar created..

javac -classpath 
/home/hadoop/hadoop/hadoop-core-1.0.3.jar:/home/hadoop/hadoop/lib/commons-cli-1.2.jar
 -d classes src/WordCount.java

i only have the following classes created.
-rw-rw-r-- 1 hadoop hadoop 1790 Aug  7 15:42 WordCount$TokenizerMapper.class
-rw-rw-r-- 1 hadoop hadoop 1789 Aug  7 15:42 WordCount$IntSumReducer.class
-rw-rw-r-- 1 hadoop hadoop 1911 Aug  7 15:42 WordCount.class

sorry, i am not familiar with java, can someone tell me if my compiling is 
correct?

thank you.


> WordCount example does not compile given the current instructions
> -
>
> Key: MAPREDUCE-806
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-806
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: examples
>Affects Versions: 0.20.1
> Environment: linux
>Reporter: Hector Yuen
>Priority: Trivial
>
> http://hadoop.apache.org/common/docs/r0.20.0/mapred_tutorial.html#Example%3A+WordCount+v1.0
> In this example, the classpath is missing commons-cli-2.0-SNAPSHOT.jar
> If we compile according to the instructions:
> $ javac -classpath /hadoop/core/hadoop-0.20.0-core.jar -d ioperf_classes/ 
> src/WordCount.java
> src/WordCount.java:54: cannot access org.apache.commons.cli.Options
> class file for org.apache.commons.cli.Options not found
> String[] otherArgs = new GenericOptionsParser(conf, 
> args).getRemainingArgs();
> The correct compilation should be 
> $ javac -classpath 
> /hadoop/core/hadoop-0.20.0-core.jar:/hadoop/core/lib/commons-cli-2.0-SNAPSHOT.jar
>  -d ioperf_classes/ src/WordCount.java

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira