date:20090820

[jira] Updated: (MAPREDUCE-745) TestRecoveryManager fails sometimes

2009-08-20 Thread Amar Kamat (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amar Kamat updated MAPREDUCE-745:
-

Attachment: mapred-745-yahoo-internal.patch

Attaching a patch for resolving conflicts when the yahoo-hadoop-distribution is 
rolled forwarded to MAPREDUCE-745 (aka SVN r806173).

> TestRecoveryManager fails sometimes
> ---
>
> Key: MAPREDUCE-745
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-745
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: jobtracker
>Reporter: Amareshwari Sriramadasu
>Assignee: Amar Kamat
> Fix For: 0.20.1
>
> Attachments: mapred-745-yahoo-internal.patch, 
> MAPREDUCE-745-v1.0.patch, MAPREDUCE-745-v1.2.patch, MAPREDUCE-745-v1.3.patch, 
> MAPREDUCE-745-v1.7.patch, MAPREDUCE-745-v1.8-branch-0.20.patch, 
> MAPREDUCE-745-v1.8.1-branch-0.20.patch, MAPREDUCE-745-v1.8.patch, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-824) Support a hierarchy of queues in the capacity scheduler

2009-08-20 Thread Hemanth Yamijala (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745827#action_12745827
 ] 

Hemanth Yamijala commented on MAPREDUCE-824:


Looked at the test cases:
- Code seems duplicated between CapacitySchedulerUtils and 
CapacityTaskScheduler and TestContainerQueue.
- In some test cases, when we create a queue, it is already adding a child to 
the parent. So, why do we need additional calls to addChildren ?
- What's the difference between testConfiguredCapacity and testMinCapacity ?
- The test cases testing scheduling are nice. The comments are out of sync a 
bit, and will be hard to maintain. Instead I suggest that we assert what we are 
documenting in the tests itself, so that they themselves read as comments, and 
will also always be in sync.
- As discussed, getCapacity() should not return max capacity any time. It 
should always return the current capacity or limit, whichever is smaller. 
Otherwise, the sort order of queues would be affected. 
- areTasksInQueueOverLimit should be changed to something along these lines:
{code}
  if (tsi.getMaxTaskLimit() > 0) {
if (tsi.getNumSlotsOccupied() >= tsi.getCapacity()) {
  return true;
}
  } 
  
  if (tsi.getMaxCapacity() > 0) {
if (tsi.getNumSlotsOccupied() >= tsi.getMaxCapacity()) {
  return true;
}
  }
  return false;
{code}
- At the same time, testMaxCapacity should be removed. I would instead 
recommend a test case that sets a max capacity on a queue, and checks 
scheduling honors the decision.

> Support a hierarchy of queues in the capacity scheduler
> ---
>
> Key: MAPREDUCE-824
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-824
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/capacity-sched
>Reporter: Hemanth Yamijala
> Attachments: HADOOP-824-1.patch, HADOOP-824-2.patch, 
> HADOOP-824-3.patch
>
>
> Currently in Capacity Scheduler, cluster capacity is divided among the queues 
> based on the queue capacity. These queues typically represent an organization 
> and the capacity of the queue represents the capacity the organization is 
> entitled to. Most organizations are large and need to divide their capacity 
> among sub-organizations they have. Or they may want to divide the capacity 
> based on a category or type of jobs they run. This JIRA covers the 
> requirements and other details to provide the above feature.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-898) Change DistributedCache to use new api.

2009-08-20 Thread Amareshwari Sriramadasu (JIRA)

Change DistributedCache to use new api.
---

 Key: MAPREDUCE-898
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-898
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
Reporter: Amareshwari Sriramadasu




-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-430) Task stuck in cleanup with OutOfMemoryErrors

2009-08-20 Thread Arun C Murthy (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745824#action_12745824
 ] 

Arun C Murthy commented on MAPREDUCE-430:
-

Yep, also the worst case is that the TT will kill the stuck JVM after 
mapred.task.timeout. But, yes it would be good to watch for this.

> Task stuck in cleanup with OutOfMemoryErrors
> 
>
> Key: MAPREDUCE-430
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-430
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Amareshwari Sriramadasu
>Assignee: Amar Kamat
> Fix For: 0.20.1
>
> Attachments: MAPREDUCE-430-v1.6-branch-0.20.patch, 
> MAPREDUCE-430-v1.6.patch, MAPREDUCE-430-v1.7.patch, MAPREDUCE-430-v1.8.patch
>
>
> Obesrved a task with OutOfMemory error, stuck in cleanup.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-430) Task stuck in cleanup with OutOfMemoryErrors

2009-08-20 Thread Devaraj Das (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745823#action_12745823
 ] 

Devaraj Das commented on MAPREDUCE-430:
---

+1. Let's keep a watch on situations where the communication gets stuck due to 
an OOM..

> Task stuck in cleanup with OutOfMemoryErrors
> 
>
> Key: MAPREDUCE-430
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-430
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Amareshwari Sriramadasu
>Assignee: Amar Kamat
> Fix For: 0.20.1
>
> Attachments: MAPREDUCE-430-v1.6-branch-0.20.patch, 
> MAPREDUCE-430-v1.6.patch, MAPREDUCE-430-v1.7.patch, MAPREDUCE-430-v1.8.patch
>
>
> Obesrved a task with OutOfMemory error, stuck in cleanup.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-430) Task stuck in cleanup with OutOfMemoryErrors

2009-08-20 Thread Arun C Murthy (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745820#action_12745820
 ] 

Arun C Murthy commented on MAPREDUCE-430:
-

I'm thinking we can do:

{noformat}
try {
  // existing stuff
} catch (FSError e) {
 // ...
} catch (Throwable t) {
 umblical.fatalError();
}
{noformat}

Thus we can allow the Child to exit if it can't do umbilical.fatalError(). I'm 
thinking that we'll just do a best-effort to call {umbilical.fatalError}, I 
don't think the jvm will get 'stuck' there. Thoughts?

> Task stuck in cleanup with OutOfMemoryErrors
> 
>
> Key: MAPREDUCE-430
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-430
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Amareshwari Sriramadasu
>Assignee: Amar Kamat
> Fix For: 0.20.1
>
> Attachments: MAPREDUCE-430-v1.6-branch-0.20.patch, 
> MAPREDUCE-430-v1.6.patch, MAPREDUCE-430-v1.7.patch, MAPREDUCE-430-v1.8.patch
>
>
> Obesrved a task with OutOfMemory error, stuck in cleanup.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-430) Task stuck in cleanup with OutOfMemoryErrors

2009-08-20 Thread Devaraj Das (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745817#action_12745817
 ] 

Devaraj Das commented on MAPREDUCE-430:
---

I am worried that the communication to the TT might get stuck if it was a OOM. 
On the other hand, System.exit has a much better probability of making the 
process exit. So my thinking for Child.java is:

try {
 // do the existing stuff 
} catch (OutOfMemoryError) {
  System.exit(-1);

} catch (FSError fse) {
 // do the existing stuff
} catch (Throwable t ) {
// do the existing stuff
} finally {
// do the existing stuff
}

> Task stuck in cleanup with OutOfMemoryErrors
> 
>
> Key: MAPREDUCE-430
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-430
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Amareshwari Sriramadasu
>Assignee: Amar Kamat
> Fix For: 0.20.1
>
> Attachments: MAPREDUCE-430-v1.6-branch-0.20.patch, 
> MAPREDUCE-430-v1.6.patch, MAPREDUCE-430-v1.7.patch, MAPREDUCE-430-v1.8.patch
>
>
> Obesrved a task with OutOfMemory error, stuck in cleanup.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-430) Task stuck in cleanup with OutOfMemoryErrors

2009-08-20 Thread Arun C Murthy (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745815#action_12745815
 ] 

Arun C Murthy commented on MAPREDUCE-430:
-

A suggestion for a test-case for this bug:

A job with 1 map and 1 reduce. {map|reduce}_0_0 should throw OOM, 
{map|reduce}_0_1 should throw FSError, {map|reduce}_0_2 should throw 
IOException and {map|reduce}_0_3 should succeed. We can probably add a check to 
ensure that the job completed under a couple of mins or so also.

> Task stuck in cleanup with OutOfMemoryErrors
> 
>
> Key: MAPREDUCE-430
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-430
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Amareshwari Sriramadasu
>Assignee: Amar Kamat
> Fix For: 0.20.1
>
> Attachments: MAPREDUCE-430-v1.6-branch-0.20.patch, 
> MAPREDUCE-430-v1.6.patch, MAPREDUCE-430-v1.7.patch, MAPREDUCE-430-v1.8.patch
>
>
> Obesrved a task with OutOfMemory error, stuck in cleanup.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-679) XML-based metrics as JSP servlet for JobTracker

2009-08-20 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745806#action_12745806
 ] 

Hadoop QA commented on MAPREDUCE-679:
-

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12417204/MAPREDUCE-679.3.patch
  against trunk revision 806408.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 2 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

-1 findbugs.  The patch appears to introduce 2 new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed core unit tests.

-1 contrib tests.  The patch failed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/500/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/500/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/500/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/500/console

This message is automatically generated.

> XML-based metrics as JSP servlet for JobTracker
> ---
>
> Key: MAPREDUCE-679
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-679
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: jobtracker
>Reporter: Aaron Kimball
>Assignee: Aaron Kimball
> Attachments: example-jobtracker-completed-job.xml, 
> example-jobtracker-running-job.xml, MAPREDUCE-679.2.patch, 
> MAPREDUCE-679.3.patch, MAPREDUCE-679.patch
>
>
> In HADOOP-4559, a general REST API for reporting metrics was proposed but 
> work seems to have stalled. In the interim, we have a simple XML translation 
> of the existing JobTracker status page which provides the same metrics 
> (including the tables of running/completed/failed jobs) as the human-readable 
> page. This is a relatively lightweight addition to provide some 
> machine-understandable metrics reporting.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-370) Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.

2009-08-20 Thread Amareshwari Sriramadasu (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745805#action_12745805
 ] 

Amareshwari Sriramadasu commented on MAPREDUCE-370:
---

-1 core tests : due to MAPREDUCE-880
-1 contrib tests : due to MAPREDUCE-699

> Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.
> ---
>
> Key: MAPREDUCE-370
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-370
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>Reporter: Amareshwari Sriramadasu
>Assignee: Amareshwari Sriramadasu
> Fix For: 0.21.0
>
> Attachments: patch-370-1.txt, patch-370.txt
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-834) When TaskTracker config use old memory management values its memory monitoring is diabled.

2009-08-20 Thread Hemanth Yamijala (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hemanth Yamijala updated MAPREDUCE-834:
---

   Resolution: Fixed
Fix Version/s: 0.20.1
 Hadoop Flags: [Reviewed]
   Status: Resolved  (was: Patch Available)

I just committed this to trunk and branch 0.20. Thanks, Sreekanth !

> When TaskTracker config use old memory management values its memory 
> monitoring is diabled.
> --
>
> Key: MAPREDUCE-834
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-834
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Karam Singh
>Assignee: Sreekanth Ramakrishnan
> Fix For: 0.20.1
>
> Attachments: mapred-834-20.patch, mapreduce-834-1.patch, 
> mapreduce-834-2.patch, mapreduce-834-3.patch, mapreduce-834-4.patch
>
>
> TaskTracker memory config values -:
> mapred.tasktracker.vmem.reserved=8589934592
> mapred.task.default.maxvmem=2147483648
> mapred.task.limit.maxvmem=4294967296
> mapred.tasktracker.pmem.reserved=2147483648
> TaskTracker start as -:
>2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.tasktracker.vmem.reserved is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.tasktracker.pmem.reserved is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.task.default.maxvmem is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable mapred.task.limit.maxvmem 
> is no longer used
>   2009-08-05 12:39:03,308 INFO 
> org.apache.hadoop.mapred.TaskTracker: Starting thread: Map-events fetcher for 
> all reduce tasks on 
>   2009-08-05 12:39:03,309 INFO 
> org.apache.hadoop.mapred.TaskTracker:  Using MemoryCalculatorPlugin : 
> org.apache.hadoop.util.linuxmemorycalculatorplu...@19be4777
>   2009-08-05 12:39:03,311 WARN 
> org.apache.hadoop.mapred.TaskTracker: TaskTracker's 
> totalMemoryAllottedForTasks is -1. TaskMemoryManager is disabled.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-430) Task stuck in cleanup with OutOfMemoryErrors

2009-08-20 Thread Arun C Murthy (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-430:


Status: Open  (was: Patch Available)

I had a chat with Amar to explain my take on this:

# {Map|Reduce}Task shouldn't catch Throwable except for the parts where they 
deal with threads e.g. copier threads or merge threads in which case they 
should use a Throwable variable to save the error and check for it.
# Child should catch Throwable, inform the TaskTracker via 
TaskUmbilicalProtocal.fatalError or such and then exit.

Thoughts?



> Task stuck in cleanup with OutOfMemoryErrors
> 
>
> Key: MAPREDUCE-430
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-430
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Amareshwari Sriramadasu
>Assignee: Amar Kamat
> Fix For: 0.20.1
>
> Attachments: MAPREDUCE-430-v1.6-branch-0.20.patch, 
> MAPREDUCE-430-v1.6.patch, MAPREDUCE-430-v1.7.patch, MAPREDUCE-430-v1.8.patch
>
>
> Obesrved a task with OutOfMemory error, stuck in cleanup.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (MAPREDUCE-745) TestRecoveryManager fails sometimes

2009-08-20 Thread Devaraj Das (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Devaraj Das resolved MAPREDUCE-745.
---

   Resolution: Fixed
Fix Version/s: (was: 0.21.0)
   0.20.1
 Assignee: Amar Kamat
 Hadoop Flags: [Reviewed]

I committed this. Thanks, Amar!

> TestRecoveryManager fails sometimes
> ---
>
> Key: MAPREDUCE-745
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-745
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: jobtracker
>Reporter: Amareshwari Sriramadasu
>Assignee: Amar Kamat
> Fix For: 0.20.1
>
> Attachments: MAPREDUCE-745-v1.0.patch, MAPREDUCE-745-v1.2.patch, 
> MAPREDUCE-745-v1.3.patch, MAPREDUCE-745-v1.7.patch, 
> MAPREDUCE-745-v1.8-branch-0.20.patch, MAPREDUCE-745-v1.8.1-branch-0.20.patch, 
> MAPREDUCE-745-v1.8.patch, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-745) TestRecoveryManager fails sometimes

2009-08-20 Thread Devaraj Das (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745793#action_12745793
 ] 

Devaraj Das commented on MAPREDUCE-745:
---

Yesterday, there was a problem with the machine on the Apache that I use for 
commits. Wasn't sure whether it got committed. But it actually did get 
committed.
Thanks, Amar!

> TestRecoveryManager fails sometimes
> ---
>
> Key: MAPREDUCE-745
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-745
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: jobtracker
>Reporter: Amareshwari Sriramadasu
> Fix For: 0.21.0
>
> Attachments: MAPREDUCE-745-v1.0.patch, MAPREDUCE-745-v1.2.patch, 
> MAPREDUCE-745-v1.3.patch, MAPREDUCE-745-v1.7.patch, 
> MAPREDUCE-745-v1.8-branch-0.20.patch, MAPREDUCE-745-v1.8.1-branch-0.20.patch, 
> MAPREDUCE-745-v1.8.patch, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-896) Users can set non-writable permissions on temporary files for TT and can abuse disk usage.

2009-08-20 Thread Vinod K V (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745787#action_12745787
 ] 

Vinod K V commented on MAPREDUCE-896:
-


To quote an example, TT cannot properly cleanup the following dirs.
{code}
$mapred.local.dir
   `-- taskTracker
 `-- jobcache
   `--- $jobid
 `--- work
   `-- unwritable-dir dr--r--r--  
 ` user-file -r--r--r-- 
{code}

> Users can set non-writable permissions on temporary files for TT and can 
> abuse disk usage.
> --
>
> Key: MAPREDUCE-896
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-896
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: tasktracker
>Reporter: Vinod K V
>
> As of now, irrespective of the TaskController in use, TT itself does a full 
> delete on local files created by itself or job tasks. This step, depending 
> upon TT's umask and the permissions set by files by the user, for e.g in 
> job-work/task-work or child.tmp directories, may or may not go through 
> successful completion fully. Thus is left an opportunity for abusing disk 
> space usage either accidentally or intentionally by TT/users.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-896) Users can set non-writable permissions on temporary files for TT and can abuse disk usage.

2009-08-20 Thread Vinod K V (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745788#action_12745788
 ] 

Vinod K V commented on MAPREDUCE-896:
-

This problem gets aggravated when LinuxTaskController is in use. User can 
simply set group ownership of files that he/she creates to himself thereby 
preventing TaskTracker to clean up those files. By default, all directories 
have setgid bit set and so files/dirs are cleanable by the TT.

> Users can set non-writable permissions on temporary files for TT and can 
> abuse disk usage.
> --
>
> Key: MAPREDUCE-896
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-896
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: tasktracker
>Reporter: Vinod K V
>
> As of now, irrespective of the TaskController in use, TT itself does a full 
> delete on local files created by itself or job tasks. This step, depending 
> upon TT's umask and the permissions set by files by the user, for e.g in 
> job-work/task-work or child.tmp directories, may or may not go through 
> successful completion fully. Thus is left an opportunity for abusing disk 
> space usage either accidentally or intentionally by TT/users.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-897) Provide information captured as part of JobTrackerStatistics via the Hadoop metrics API

2009-08-20 Thread Hemanth Yamijala (JIRA)

Provide information captured as part of JobTrackerStatistics via the Hadoop 
metrics API
---

 Key: MAPREDUCE-897
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-897
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: tasktracker
Reporter: Hemanth Yamijala


MAPREDUCE-467 introduced a framework to collect statistics per node on a fixed 
set of intervals. Presently there is support for collecting statistics related 
to number of task failures and also health check script failures per hour, day 
and since start of system. It is felt that this information can be made 
available via the tasktracker's metrics system as well.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-896) Users can set non-writable permissions on temporary files for TT and can abuse disk usage.

2009-08-20 Thread Vinod K V (JIRA)

Users can set non-writable permissions on temporary files for TT and can abuse 
disk usage.
--

 Key: MAPREDUCE-896
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-896
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: tasktracker
Reporter: Vinod K V


As of now, irrespective of the TaskController in use, TT itself does a full 
delete on local files created by itself or job tasks. This step, depending upon 
TT's umask and the permissions set by files by the user, for e.g in 
job-work/task-work or child.tmp directories, may or may not go through 
successful completion fully. Thus is left an opportunity for abusing disk space 
usage either accidentally or intentionally by TT/users.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-890) After HADOOP-4491, the user who started mapred system is not able to run job.

2009-08-20 Thread Vinod K V (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745784#action_12745784
 ] 

Vinod K V commented on MAPREDUCE-890:
-

In other words, the problem is the following:
{code}
$ mkdir testing
$ ls -ld testing/
drwxr-xr-x 2 vinodkv vinodkv 4096 2009-08-21 09:54 testing/
$ touch testing/t1.txt
$ echo $?
0
$ chmod 0570 testing/
$ ls -ld testing/
dr-xrwx--- 2 vinodkv vinodkv 4096 2009-08-21 09:54 testing/
$ touch testing/t2.txt
touch: cannot touch `testing/t2.txt': Permission denied
{code}

I searched and hunted this down in the Linux Kernel filesystem code: 
http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=blob;f=fs/namei.c;h=f3c5b278895a0d3e0f23fe6fd474e2728a1c6cb6;hb=6c30c53fd5ae6a99a23ad78e90c428d2c8ffb07f
 . Particularly the generic_permission() function where we check if 
current_fsuid() == inode->i_uid.

> After HADOOP-4491, the user who started mapred system is not able to run job.
> -
>
> Key: MAPREDUCE-890
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-890
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: tasktracker
>Reporter: Karam Singh
>Assignee: Vinod K V
>
> Even setup and cleanup task of job fails due exception -: It fails to create 
> job and related directories under mapred.local.dir/taskTracker/jobcache
> Directories are created as -:
> [dr-xrws--- mapred   hadoop  ]  job_200908190916_0002
> mapred is not wrtie under this. Even manually I failed to touch file.
> mapred is use of started mr cluster 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-323) Improve the way job history files are managed

2009-08-20 Thread Rajiv Chittajallu (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745785#action_12745785
 ] 

Rajiv Chittajallu commented on MAPREDUCE-323:
-

+1 for Tim Williamson suggestion.

I would also request for removing jobname from the history filename. 

> Improve the way job history files are managed
> -
>
> Key: MAPREDUCE-323
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-323
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>Reporter: Amar Kamat
>Assignee: Amar Kamat
>
> Today all the jobhistory files are dumped in one _job-history_ folder. This 
> can cause problems when there is a need to search the history folder 
> (job-recovery etc). It would be nice if we group all the jobs under a _user_ 
> folder. So all the jobs for user _amar_ will go in _history-folder/amar/_. 
> Jobs can be categorized using various features like _jobid, date, jobname_ 
> etc but using _username_ will make the search much more efficient and also 
> will not result into namespace explosion. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-890) After HADOOP-4491, the user who started mapred system is not able to run job.

2009-08-20 Thread Vinod K V (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745783#action_12745783
 ] 

Vinod K V commented on MAPREDUCE-890:
-

As part of MAPREDUCE-842, job directories are set to have the following 
permissions:

||permissions||user ownership||group 
ownership||file/dir name||
|dr-xrws---   |$job-owner|
$tt_group  |job_200908190916_0002|

CASE I: the $job-owner is other than the $tt_user
$tt_user is part of $tt_group, and so can create attempt directories inside the 
job directory as part of the task-localization because job_directory is group 
writable.

CASE II: the $job-owner is same as the $tt_user
TT cannot create attempt directories inside the job directory!! Because Linux 
seems to check uid of the process with the fsuid of the directory and return 
error if the directory is not user writable!

CASE II is what is causing the current bug.

Alternative solutions at hand:
 - Leave the code as is, and live with the fact that user's cannot submit jobs 
as mapred user
 - Set drwxrws--- on $job-dir in all cases. This means user's tasks CAN 
potentially create unwarranted files/dirs in the $job_dir
 - Set drwxrws-- if the $job-owner is same as the $tt_user, set  dr-xrws--- 
otherwise. Handles both the cases, but complicates code very slightly.

Thoughts?

> After HADOOP-4491, the user who started mapred system is not able to run job.
> -
>
> Key: MAPREDUCE-890
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-890
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: tasktracker
>Reporter: Karam Singh
>
> Even setup and cleanup task of job fails due exception -: It fails to create 
> job and related directories under mapred.local.dir/taskTracker/jobcache
> Directories are created as -:
> [dr-xrws--- mapred   hadoop  ]  job_200908190916_0002
> mapred is not wrtie under this. Even manually I failed to touch file.
> mapred is use of started mr cluster 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Assigned: (MAPREDUCE-890) After HADOOP-4491, the user who started mapred system is not able to run job.

2009-08-20 Thread Vinod K V (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod K V reassigned MAPREDUCE-890:
---

Assignee: Vinod K V

> After HADOOP-4491, the user who started mapred system is not able to run job.
> -
>
> Key: MAPREDUCE-890
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-890
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: tasktracker
>Reporter: Karam Singh
>Assignee: Vinod K V
>
> Even setup and cleanup task of job fails due exception -: It fails to create 
> job and related directories under mapred.local.dir/taskTracker/jobcache
> Directories are created as -:
> [dr-xrws--- mapred   hadoop  ]  job_200908190916_0002
> mapred is not wrtie under this. Even manually I failed to touch file.
> mapred is use of started mr cluster 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-894) DBInputformat not working with SQLServer

2009-08-20 Thread Budianto Lie (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Budianto Lie updated MAPREDUCE-894:
---

Attachment: MAPREDUCE-894.patch

> DBInputformat not working with SQLServer
> 
>
> Key: MAPREDUCE-894
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-894
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Budianto Lie
> Attachments: MAPREDUCE-894.patch
>
>
> org.apache.hadoop.mapreduce.lib.db.DBInputFormat
> Microsoft SQLServer doesn't support LIMIT and OFFSET.
> Fix:
> Based on MAPREDUCE-716, I already implemented it.
> By creating a new class 
> org.apache.hadoop.mapreduce.lib.db.MsSqlDBRecordReader 
> and modifying class org.apache.hadoop.mapreduce.lib.db.DBInputFormat 
> Note: this fix is working only with SQLServer 2005 or higher.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-712) RandomTextWriter example is CPU bound

2009-08-20 Thread Chris Douglas (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Douglas updated MAPREDUCE-712:


   Resolution: Fixed
Fix Version/s: 0.21.0
 Hadoop Flags: [Reviewed]
   Status: Resolved  (was: Patch Available)

I committed this.

> RandomTextWriter example is CPU bound
> -
>
> Key: MAPREDUCE-712
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-712
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 0.20.1, 0.21.0
> Environment: ~200 nodes cluster
> Each node has the following configuration:
> Processors: 2 x Xeon L5420 2.50GHz (8 cores) - Harpertown C0, 64-bit, 
> quad-core (8 CPUs)
> 4 Disks
> 16 GB RAM
> Linux 2.6
> Hadoop version: trunk
>Reporter: Khaled Elmeleegy
>Assignee: Chris Douglas
> Fix For: 0.21.0
>
> Attachments: MR712-0.patch, MR712-1.patch
>
>
> Running the RandomTextWritter example job ( from the examples jar) pegs the 
> machiens' CPUs.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-712) RandomTextWriter example is CPU bound

2009-08-20 Thread Hong Tang (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745762#action_12745762
 ] 

Hong Tang commented on MAPREDUCE-712:
-

Patch looks good. +1.

> RandomTextWriter example is CPU bound
> -
>
> Key: MAPREDUCE-712
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-712
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 0.20.1, 0.21.0
> Environment: ~200 nodes cluster
> Each node has the following configuration:
> Processors: 2 x Xeon L5420 2.50GHz (8 cores) - Harpertown C0, 64-bit, 
> quad-core (8 CPUs)
> 4 Disks
> 16 GB RAM
> Linux 2.6
> Hadoop version: trunk
>Reporter: Khaled Elmeleegy
>Assignee: Chris Douglas
> Attachments: MR712-0.patch, MR712-1.patch
>
>
> Running the RandomTextWritter example job ( from the examples jar) pegs the 
> machiens' CPUs.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-895) FileSystem::ListStatus will now throw FileNotFoundException, MapRed needs updated

2009-08-20 Thread Jakob Homan (JIRA)

FileSystem::ListStatus will now throw FileNotFoundException, MapRed needs 
updated
-

 Key: MAPREDUCE-895
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-895
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Jakob Homan
Assignee: Jakob Homan


HADOOP-6201 (and HDFS-538) determined the semantics of FileSystem::ListStatus 
is not correct and that the actual file system class vary in their 
implemenations, with some throwing an exception and some returning null.  
Fixing this will require adjusting code that calls this method. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-679) XML-based metrics as JSP servlet for JobTracker

2009-08-20 Thread Aaron Kimball (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron Kimball updated MAPREDUCE-679:


Status: Patch Available  (was: Open)

> XML-based metrics as JSP servlet for JobTracker
> ---
>
> Key: MAPREDUCE-679
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-679
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: jobtracker
>Reporter: Aaron Kimball
>Assignee: Aaron Kimball
> Attachments: example-jobtracker-completed-job.xml, 
> example-jobtracker-running-job.xml, MAPREDUCE-679.2.patch, 
> MAPREDUCE-679.3.patch, MAPREDUCE-679.patch
>
>
> In HADOOP-4559, a general REST API for reporting metrics was proposed but 
> work seems to have stalled. In the interim, we have a simple XML translation 
> of the existing JobTracker status page which provides the same metrics 
> (including the tables of running/completed/failed jobs) as the human-readable 
> page. This is a relatively lightweight addition to provide some 
> machine-understandable metrics reporting.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-679) XML-based metrics as JSP servlet for JobTracker

2009-08-20 Thread Aaron Kimball (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron Kimball updated MAPREDUCE-679:


Attachment: example-jobtracker-completed-job.xml

> XML-based metrics as JSP servlet for JobTracker
> ---
>
> Key: MAPREDUCE-679
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-679
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: jobtracker
>Reporter: Aaron Kimball
>Assignee: Aaron Kimball
> Attachments: example-jobtracker-completed-job.xml, 
> example-jobtracker-running-job.xml, MAPREDUCE-679.2.patch, 
> MAPREDUCE-679.3.patch, MAPREDUCE-679.patch
>
>
> In HADOOP-4559, a general REST API for reporting metrics was proposed but 
> work seems to have stalled. In the interim, we have a simple XML translation 
> of the existing JobTracker status page which provides the same metrics 
> (including the tables of running/completed/failed jobs) as the human-readable 
> page. This is a relatively lightweight addition to provide some 
> machine-understandable metrics reporting.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-679) XML-based metrics as JSP servlet for JobTracker

2009-08-20 Thread Aaron Kimball (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron Kimball updated MAPREDUCE-679:


Attachment: example-jobtracker-running-job.xml

Attaching example xml outputs

> XML-based metrics as JSP servlet for JobTracker
> ---
>
> Key: MAPREDUCE-679
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-679
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: jobtracker
>Reporter: Aaron Kimball
>Assignee: Aaron Kimball
> Attachments: example-jobtracker-completed-job.xml, 
> example-jobtracker-running-job.xml, MAPREDUCE-679.2.patch, 
> MAPREDUCE-679.3.patch, MAPREDUCE-679.patch
>
>
> In HADOOP-4559, a general REST API for reporting metrics was proposed but 
> work seems to have stalled. In the interim, we have a simple XML translation 
> of the existing JobTracker status page which provides the same metrics 
> (including the tables of running/completed/failed jobs) as the human-readable 
> page. This is a relatively lightweight addition to provide some 
> machine-understandable metrics reporting.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-679) XML-based metrics as JSP servlet for JobTracker

2009-08-20 Thread Aaron Kimball (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron Kimball updated MAPREDUCE-679:


Status: Open  (was: Patch Available)

> XML-based metrics as JSP servlet for JobTracker
> ---
>
> Key: MAPREDUCE-679
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-679
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: jobtracker
>Reporter: Aaron Kimball
>Assignee: Aaron Kimball
> Attachments: example-jobtracker-completed-job.xml, 
> example-jobtracker-running-job.xml, MAPREDUCE-679.2.patch, 
> MAPREDUCE-679.3.patch, MAPREDUCE-679.patch
>
>
> In HADOOP-4559, a general REST API for reporting metrics was proposed but 
> work seems to have stalled. In the interim, we have a simple XML translation 
> of the existing JobTracker status page which provides the same metrics 
> (including the tables of running/completed/failed jobs) as the human-readable 
> page. This is a relatively lightweight addition to provide some 
> machine-understandable metrics reporting.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-679) XML-based metrics as JSP servlet for JobTracker

2009-08-20 Thread Aaron Kimball (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron Kimball updated MAPREDUCE-679:


Attachment: MAPREDUCE-679.3.patch

Attaching new patch

* moves /jobtracker.xml.jsp to /jobtracker.jspx
* updates test to go along with it
* updates build.xml to fix bug in webapp compilation.

> XML-based metrics as JSP servlet for JobTracker
> ---
>
> Key: MAPREDUCE-679
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-679
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: jobtracker
>Reporter: Aaron Kimball
>Assignee: Aaron Kimball
> Attachments: example-jobtracker-completed-job.xml, 
> example-jobtracker-running-job.xml, MAPREDUCE-679.2.patch, 
> MAPREDUCE-679.3.patch, MAPREDUCE-679.patch
>
>
> In HADOOP-4559, a general REST API for reporting metrics was proposed but 
> work seems to have stalled. In the interim, we have a simple XML translation 
> of the existing JobTracker status page which provides the same metrics 
> (including the tables of running/completed/failed jobs) as the human-readable 
> page. This is a relatively lightweight addition to provide some 
> machine-understandable metrics reporting.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-870) Clean up the job Retire code

2009-08-20 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745736#action_12745736
 ] 

Hadoop QA commented on MAPREDUCE-870:
-

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12417097/870_v3.patch
  against trunk revision 806288.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 13 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed core unit tests.

-1 contrib tests.  The patch failed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/499/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/499/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/499/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/499/console

This message is automatically generated.

> Clean up the job Retire code
> 
>
> Key: MAPREDUCE-870
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-870
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>Reporter: Sharad Agarwal
>Assignee: Sharad Agarwal
> Attachments: 870_v1.patch, 870_v2.patch, 870_v3.patch
>
>
> Currently completed job's full data structures are kept in memory based on 
> mapred.jobtracker.completeuserjobs.maximum, 
> mapred.jobtracker.retirejob.interval.min, 
> mapred.jobtracker.retirejob.interval and mapred.jobtracker.retirejob.check 
> settings. These controls are not much useful now since MAPREDUCE-817 
> introduced a cache for keeping just the very basic info of the completed job. 
> These settings should be removed and the job should be purged as soon as the 
> history files are available in HDFS. 
> Going forward, clients can read the history files if they need to drill down 
> into more information (MAPREDUCE-864).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-871) Job/Task local files have incorrect group ownership set by LinuxTaskController binary

2009-08-20 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745637#action_12745637
 ] 

Hadoop QA commented on MAPREDUCE-871:
-

-1 overall.  Here are the results of testing the latest attachment 
  
http://issues.apache.org/jira/secure/attachment/12417121/MAPREDUCE-871-20090820.1.txt
  against trunk revision 806152.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 6 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed core unit tests.

-1 contrib tests.  The patch failed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/498/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/498/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/498/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/498/console

This message is automatically generated.

> Job/Task local files have incorrect group ownership set by 
> LinuxTaskController binary
> -
>
> Key: MAPREDUCE-871
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-871
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>  Components: tasktracker
>Reporter: Vinod K V
>Assignee: Vinod K V
> Attachments: MAPREDUCE-871-20090820.1.txt
>
>
> HADOOP-4491 fixed the secure permissions of local files on a TT. While 
> testing HADOOP-4491 on a cluster, [~karams] found out a bug. All the 
> files/dirs have should be owned by the group corresponding to the group owner 
> of the task-controller binary (via using getegid()) which in turn is a 
> special group to which *only* TT user belongs. HADOOP-4491 incorrectly set it 
> to primary group of the TT via getgid(), and not the special group.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-767) to remove mapreduce dependency on commons-cli2

2009-08-20 Thread Owen O'Malley (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Owen O'Malley updated MAPREDUCE-767:


Affects Version/s: 0.20.1
Fix Version/s: 0.20.1

+1

I've committed this to trunk with the slight change that string.split("=",2) 
was missing the "2".

We need this patch for 0.20.1. Can you make one please?

> to remove mapreduce dependency on commons-cli2
> --
>
> Key: MAPREDUCE-767
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-767
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: contrib/streaming
>Affects Versions: 0.20.1
>Reporter: Giridharan Kesavan
>Assignee: Amar Kamat
> Fix For: 0.20.1
>
> Attachments: MAPREDUCE-767-v1.1.patch, MAPREDUCE-767-v1.2.patch
>
>
> mapreduce, streaming and eclipse plugin depends on common-cli2 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-892) command line tool to list all tasktrackers and their status

2009-08-20 Thread Dmytro Molkov (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745542#action_12745542
 ] 

Dmytro Molkov commented on MAPREDUCE-892:
-

Currently some classes that hold the information relevant to this report: 
TaskTrackerStatus, JobInProgress are marked as package private, which restricts 
me from using them directly in the MRAdmin class.
Two ways to solve it might be introducing public interfaces that expose 
information useful in reports or making the classes public.

> command line tool to list all tasktrackers and their status
> ---
>
> Key: MAPREDUCE-892
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-892
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>Reporter: dhruba borthakur
>Assignee: Dmytro Molkov
>
> The "hadoop mradmin -report" could list all the tasktrackers that the 
> JobTracker knows about. It will also list a brief status summary for each of 
> the TaskTracker. (This is similar to the hadop dfsadmin -report command that 
> lists all Datanodes)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-767) to remove mapreduce dependency on commons-cli2

2009-08-20 Thread Amar Kamat (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745487#action_12745487
 ] 

Amar Kamat commented on MAPREDUCE-767:
--

>From the documentation of [cli2|http://commons.apache.org/sandbox/cli2/] and 
>[cli1.2|http://commons.apache.org/cli/release_1_2.html], its clear that cli1.2 
>doesnt support 
>[Validators|http://commons.apache.org/sandbox/cli2/manual/validators.html] and 
>[PropertyOption|http://commons.apache.org/sandbox/cli2/manual/options.html]. 
>But from my manual testing it seems like we can have a workaround for that. 
>Options like _-jobconf_ and _-cmdenv_ extend PropertyOption. Also _-file_ 
>option uses Validators which can be checked once the option is parsed. I didnt 
>see any difference in streaming command-line parsing with the patch.

> to remove mapreduce dependency on commons-cli2
> --
>
> Key: MAPREDUCE-767
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-767
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: contrib/streaming
>Reporter: Giridharan Kesavan
>Assignee: Amar Kamat
> Attachments: MAPREDUCE-767-v1.1.patch, MAPREDUCE-767-v1.2.patch
>
>
> mapreduce, streaming and eclipse plugin depends on common-cli2 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-318) Refactor reduce shuffle code

2009-08-20 Thread Jothi Padmanabhan (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jothi Padmanabhan updated MAPREDUCE-318:


Attachment: mapred-318-20Aug.patch

Some more modifications to the previous patch

# The shuffle status on the web UI -- the number of maps being copied and the 
bandwidth -- is now present. This will make this similar to the current trunk 
status
# Modified the condition for triggering on disk merge as numfies > 
(2*iosortfactor - 1), similar to the current trunk code. This ensures we do 
merges a little less agressively
# Modified to trigger an memory merge on stall. We cannot pull the trigger only 
on the cross of memory threshold, that has a possibility of hang when several 
fetchers just return if there is not sufficient memory for the current map 
output, but the total memory used has not crossed the threshold. 

> Refactor reduce shuffle code
> 
>
> Key: MAPREDUCE-318
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-318
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>Reporter: Owen O'Malley
>Assignee: Owen O'Malley
> Attachments: HADOOP-5233_api.patch, HADOOP-5233_part0.patch, 
> mapred-318-14Aug.patch, mapred-318-20Aug.patch, mapred-318-common.patch
>
>
> The reduce shuffle code has become very complex and entangled. I think we 
> should move it out of ReduceTask and into a separate package 
> (org.apache.hadoop.mapred.task.reduce). Details to follow.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-873) Simplify Job Recovery

2009-08-20 Thread Sharad Agarwal (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sharad Agarwal updated MAPREDUCE-873:
-

Attachment: 873_v1.patch

Early patch. Testing in progress. It:
- removes the old recovery logic.
- recovery is done by submitting the jobIds from the mapred.system dir to 
Jobtracker#submitJob

> Simplify Job Recovery
> -
>
> Key: MAPREDUCE-873
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-873
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: jobtracker
>Affects Versions: 0.20.1
>Reporter: Devaraj Das
>Assignee: Sharad Agarwal
> Fix For: 0.21.0
>
> Attachments: 873_v1.patch
>
>
> On a couple of occasions we have seen the JobTracker not being able to handle 
> job recovery well, and leading to cluster downtime after a restart. The 
> current design for handling job recovery is complex and prone to corner cases 
> not being handled well enough. In retrospect, it seems like the transaction 
> log based approach as was proposed on HADOOP-3245 
> (http://tinyurl.com/luh9hb), would have been a better/simpler model. However, 
> that is a big project, and it seems for the medium term, just handling job 
> re-submissions after a restart is a good tradeoff. That is, the JobTracker 
> after getting restarted, will resubmit all jobs that were running in its past 
> life. They will all start from the beginning (downside is completed tasks 
> will reexecute). In the long term, the transaction log model or some variant 
> of that should be pursued.
> Thoughts/comments welcome.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-430) Task stuck in cleanup with OutOfMemoryErrors

2009-08-20 Thread Amar Kamat (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745463#action_12745463
 ] 

Amar Kamat commented on MAPREDUCE-430:
--

Spoke to Devaraj about this offline. So child can ignore Errors and catch only 
Exceptions and FSError. But ReduceTask and MapTask still catch Throwable and 
eat it up. This can be serious and hence we need to take care of this. There 
are multiple options
- dont catch Throwable only catch Exceptions
- catch Throwable and exit the jvm
- catch OOM and exit the jvm.

> Task stuck in cleanup with OutOfMemoryErrors
> 
>
> Key: MAPREDUCE-430
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-430
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Amareshwari Sriramadasu
>Assignee: Amar Kamat
> Fix For: 0.20.1
>
> Attachments: MAPREDUCE-430-v1.6-branch-0.20.patch, 
> MAPREDUCE-430-v1.6.patch, MAPREDUCE-430-v1.7.patch, MAPREDUCE-430-v1.8.patch
>
>
> Obesrved a task with OutOfMemory error, stuck in cleanup.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-370) Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.

2009-08-20 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745440#action_12745440
 ] 

Hadoop QA commented on MAPREDUCE-370:
-

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12417117/patch-370-1.txt
  against trunk revision 806065.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 3 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed core unit tests.

-1 contrib tests.  The patch failed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/497/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/497/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/497/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/497/console

This message is automatically generated.

> Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.
> ---
>
> Key: MAPREDUCE-370
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-370
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>Reporter: Amareshwari Sriramadasu
>Assignee: Amareshwari Sriramadasu
> Fix For: 0.21.0
>
> Attachments: patch-370-1.txt, patch-370.txt
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-745) TestRecoveryManager fails sometimes

2009-08-20 Thread Amar Kamat (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amar Kamat updated MAPREDUCE-745:
-

Attachment: MAPREDUCE-745-v1.8.1-branch-0.20.patch

Patch for branch 0.20 with removed imports.

> TestRecoveryManager fails sometimes
> ---
>
> Key: MAPREDUCE-745
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-745
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: jobtracker
>Reporter: Amareshwari Sriramadasu
> Fix For: 0.21.0
>
> Attachments: MAPREDUCE-745-v1.0.patch, MAPREDUCE-745-v1.2.patch, 
> MAPREDUCE-745-v1.3.patch, MAPREDUCE-745-v1.7.patch, 
> MAPREDUCE-745-v1.8-branch-0.20.patch, MAPREDUCE-745-v1.8.1-branch-0.20.patch, 
> MAPREDUCE-745-v1.8.patch, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-157) Job History log file format is not friendly for external tools.

2009-08-20 Thread Jothi Padmanabhan (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jothi Padmanabhan updated MAPREDUCE-157:


Attachment: mapred-157-prelim.patch

Preliminary patch as requested by Doug. I got this to produce a history file in 
Json format (but only after commenting lots of code), so more useful for 
illustration than otherwise.

My guess is that this should be fairly straight forward to port this to use 
Avro. 

> Job History log file format is not friendly for external tools.
> ---
>
> Key: MAPREDUCE-157
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-157
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>Reporter: Owen O'Malley
>Assignee: Jothi Padmanabhan
> Attachments: mapred-157-prelim.patch
>
>
> Currently, parsing the job history logs with external tools is very difficult 
> because of the format. The most critical problem is that newlines aren't 
> escaped in the strings. That makes using tools like grep, sed, and awk very 
> tricky.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-807) Stray user files in mapred.system.dir with permissions other than 777 can prevent the jobtracker from starting up.

2009-08-20 Thread Amar Kamat (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745399#action_12745399
 ] 

Amar Kamat commented on MAPREDUCE-807:
--

All tests except TestRecoveryManager FAILED (timeout) and TestReduceFetch 
FAILED (timeout) passed.

> Stray user files in mapred.system.dir with permissions other than 777 can 
> prevent the jobtracker from starting up.
> --
>
> Key: MAPREDUCE-807
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-807
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: jobtracker
>Reporter: Amar Kamat
>Assignee: Amar Kamat
>Priority: Blocker
> Attachments: MAPRED-807-v1.1.patch, MAPRED-807-v1.2.patch, 
> MAPRED-807-v1.3.patch, MAPRED-807-v1.4.patch
>
>
> With restart disabled, the jobtracker does a _rm -rf_ of the 
> mapred.system.dir. If the mapred.system.dir contains user files with 
> permissions other than 777 then the jobtracker gets stuck in a loop trying to 
> delete the mapred.system.dir (and each time failing with 
> AccessControlException).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-27) Jobs with 0 maps will never get removed from the default scheduler

2009-08-20 Thread Amar Kamat (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-27?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745398#action_12745398
 ] 

Amar Kamat commented on MAPREDUCE-27:
-

MAPREDUCE-805

> Jobs with 0 maps will never get removed from the default scheduler
> --
>
> Key: MAPREDUCE-27
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-27
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Amar Kamat
>Assignee: Amar Kamat
> Attachments: HADOOP-5049-v1.1.patch
>
>
> Jobs' with 0 maps finish/succeed in the init phase i.e while the job is in 
> the _PREP_ state. {{EagerTaskInitializationListener}} removes the job after 
> initing but {{JobQueueJobInProgressListener}} waits for a job-state change 
> event to be raised and aonly then removes the job from the queue and hence 
> the job will stay forever with the {{JobQueueJobInProgressListener}}. Looks 
> like {{FairScheduler}} periodically scans the job list and removes completed 
> jobs. {{CapacityScheduler}} has a concept of waiting jobs and scans waiting 
> queue for completed jobs and purges them.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-834) When TaskTracker config use old memory management values its memory monitoring is diabled.

2009-08-20 Thread Sreekanth Ramakrishnan (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sreekanth Ramakrishnan updated MAPREDUCE-834:
-

Attachment: mapred-834-20.patch

Attaching patch for branch 20.

> When TaskTracker config use old memory management values its memory 
> monitoring is diabled.
> --
>
> Key: MAPREDUCE-834
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-834
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Karam Singh
>Assignee: Sreekanth Ramakrishnan
> Attachments: mapred-834-20.patch, mapreduce-834-1.patch, 
> mapreduce-834-2.patch, mapreduce-834-3.patch, mapreduce-834-4.patch
>
>
> TaskTracker memory config values -:
> mapred.tasktracker.vmem.reserved=8589934592
> mapred.task.default.maxvmem=2147483648
> mapred.task.limit.maxvmem=4294967296
> mapred.tasktracker.pmem.reserved=2147483648
> TaskTracker start as -:
>2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.tasktracker.vmem.reserved is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.tasktracker.pmem.reserved is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.task.default.maxvmem is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable mapred.task.limit.maxvmem 
> is no longer used
>   2009-08-05 12:39:03,308 INFO 
> org.apache.hadoop.mapred.TaskTracker: Starting thread: Map-events fetcher for 
> all reduce tasks on 
>   2009-08-05 12:39:03,309 INFO 
> org.apache.hadoop.mapred.TaskTracker:  Using MemoryCalculatorPlugin : 
> org.apache.hadoop.util.linuxmemorycalculatorplu...@19be4777
>   2009-08-05 12:39:03,311 WARN 
> org.apache.hadoop.mapred.TaskTracker: TaskTracker's 
> totalMemoryAllottedForTasks is -1. TaskMemoryManager is disabled.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-745) TestRecoveryManager fails sometimes

2009-08-20 Thread Amar Kamat (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amar Kamat updated MAPREDUCE-745:
-

Attachment: MAPREDUCE-745-v1.8-branch-0.20.patch
MAPREDUCE-745-v1.8.patch

Attaching patches for trunk and branch-0.20

> TestRecoveryManager fails sometimes
> ---
>
> Key: MAPREDUCE-745
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-745
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: jobtracker
>Reporter: Amareshwari Sriramadasu
> Fix For: 0.21.0
>
> Attachments: MAPREDUCE-745-v1.0.patch, MAPREDUCE-745-v1.2.patch, 
> MAPREDUCE-745-v1.3.patch, MAPREDUCE-745-v1.7.patch, 
> MAPREDUCE-745-v1.8-branch-0.20.patch, MAPREDUCE-745-v1.8.patch, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-745) TestRecoveryManager fails sometimes

2009-08-20 Thread Amar Kamat (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amar Kamat updated MAPREDUCE-745:
-

Attachment: (was: MAPREDUCE-745-v1.8.patch)

> TestRecoveryManager fails sometimes
> ---
>
> Key: MAPREDUCE-745
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-745
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: jobtracker
>Reporter: Amareshwari Sriramadasu
> Fix For: 0.21.0
>
> Attachments: MAPREDUCE-745-v1.0.patch, MAPREDUCE-745-v1.2.patch, 
> MAPREDUCE-745-v1.3.patch, MAPREDUCE-745-v1.7.patch, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-27) Jobs with 0 maps will never get removed from the default scheduler

2009-08-20 Thread Vinod K V (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-27?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745385#action_12745385
 ] 

Vinod K V commented on MAPREDUCE-27:


Duplicated by?

> Jobs with 0 maps will never get removed from the default scheduler
> --
>
> Key: MAPREDUCE-27
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-27
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Amar Kamat
>Assignee: Amar Kamat
> Attachments: HADOOP-5049-v1.1.patch
>
>
> Jobs' with 0 maps finish/succeed in the init phase i.e while the job is in 
> the _PREP_ state. {{EagerTaskInitializationListener}} removes the job after 
> initing but {{JobQueueJobInProgressListener}} waits for a job-state change 
> event to be raised and aonly then removes the job from the queue and hence 
> the job will stay forever with the {{JobQueueJobInProgressListener}}. Looks 
> like {{FairScheduler}} periodically scans the job list and removes completed 
> jobs. {{CapacityScheduler}} has a concept of waiting jobs and scans waiting 
> queue for completed jobs and purges them.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-745) TestRecoveryManager fails sometimes

2009-08-20 Thread Amar Kamat (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amar Kamat updated MAPREDUCE-745:
-

Attachment: MAPREDUCE-745-v1.8.patch

Attaching a patch for trunk with a minor change. Changes to 
JobTracker.getDateFormat() are reverted. 

> TestRecoveryManager fails sometimes
> ---
>
> Key: MAPREDUCE-745
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-745
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: jobtracker
>Reporter: Amareshwari Sriramadasu
> Fix For: 0.21.0
>
> Attachments: MAPREDUCE-745-v1.0.patch, MAPREDUCE-745-v1.2.patch, 
> MAPREDUCE-745-v1.3.patch, MAPREDUCE-745-v1.7.patch, MAPREDUCE-745-v1.8.patch, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt, 
> TEST-org.apache.hadoop.mapred.TestRecoveryManager.txt
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-834) When TaskTracker config use old memory management values its memory monitoring is diabled.

2009-08-20 Thread Sreekanth Ramakrishnan (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745384#action_12745384
 ] 

Sreekanth Ramakrishnan commented on MAPREDUCE-834:
--

The patch does not change any part on {{JobTracker}} or streaming api's.

The issue with {{TestRecoveryManager}} timing out is reported on MAPREDUCE-880 
and the streaming test case failures are also a known reported issue on hudson.



> When TaskTracker config use old memory management values its memory 
> monitoring is diabled.
> --
>
> Key: MAPREDUCE-834
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-834
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Karam Singh
>Assignee: Sreekanth Ramakrishnan
> Attachments: mapreduce-834-1.patch, mapreduce-834-2.patch, 
> mapreduce-834-3.patch, mapreduce-834-4.patch
>
>
> TaskTracker memory config values -:
> mapred.tasktracker.vmem.reserved=8589934592
> mapred.task.default.maxvmem=2147483648
> mapred.task.limit.maxvmem=4294967296
> mapred.tasktracker.pmem.reserved=2147483648
> TaskTracker start as -:
>2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.tasktracker.vmem.reserved is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.tasktracker.pmem.reserved is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.task.default.maxvmem is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable mapred.task.limit.maxvmem 
> is no longer used
>   2009-08-05 12:39:03,308 INFO 
> org.apache.hadoop.mapred.TaskTracker: Starting thread: Map-events fetcher for 
> all reduce tasks on 
>   2009-08-05 12:39:03,309 INFO 
> org.apache.hadoop.mapred.TaskTracker:  Using MemoryCalculatorPlugin : 
> org.apache.hadoop.util.linuxmemorycalculatorplu...@19be4777
>   2009-08-05 12:39:03,311 WARN 
> org.apache.hadoop.mapred.TaskTracker: TaskTracker's 
> totalMemoryAllottedForTasks is -1. TaskMemoryManager is disabled.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-870) Clean up the job Retire code

2009-08-20 Thread Sharad Agarwal (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sharad Agarwal updated MAPREDUCE-870:
-

Status: Patch Available  (was: Open)

> Clean up the job Retire code
> 
>
> Key: MAPREDUCE-870
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-870
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>Reporter: Sharad Agarwal
>Assignee: Sharad Agarwal
> Attachments: 870_v1.patch, 870_v2.patch, 870_v3.patch
>
>
> Currently completed job's full data structures are kept in memory based on 
> mapred.jobtracker.completeuserjobs.maximum, 
> mapred.jobtracker.retirejob.interval.min, 
> mapred.jobtracker.retirejob.interval and mapred.jobtracker.retirejob.check 
> settings. These controls are not much useful now since MAPREDUCE-817 
> introduced a cache for keeping just the very basic info of the completed job. 
> These settings should be removed and the job should be purged as soon as the 
> history files are available in HDFS. 
> Going forward, clients can read the history files if they need to drill down 
> into more information (MAPREDUCE-864).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-834) When TaskTracker config use old memory management values its memory monitoring is diabled.

2009-08-20 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745375#action_12745375
 ] 

Hadoop QA commented on MAPREDUCE-834:
-

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12417108/mapreduce-834-4.patch
  against trunk revision 806065.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 3 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed core unit tests.

-1 contrib tests.  The patch failed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/496/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/496/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/496/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/496/console

This message is automatically generated.

> When TaskTracker config use old memory management values its memory 
> monitoring is diabled.
> --
>
> Key: MAPREDUCE-834
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-834
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Karam Singh
>Assignee: Sreekanth Ramakrishnan
> Attachments: mapreduce-834-1.patch, mapreduce-834-2.patch, 
> mapreduce-834-3.patch, mapreduce-834-4.patch
>
>
> TaskTracker memory config values -:
> mapred.tasktracker.vmem.reserved=8589934592
> mapred.task.default.maxvmem=2147483648
> mapred.task.limit.maxvmem=4294967296
> mapred.tasktracker.pmem.reserved=2147483648
> TaskTracker start as -:
>2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.tasktracker.vmem.reserved is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.tasktracker.pmem.reserved is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.task.default.maxvmem is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable mapred.task.limit.maxvmem 
> is no longer used
>   2009-08-05 12:39:03,308 INFO 
> org.apache.hadoop.mapred.TaskTracker: Starting thread: Map-events fetcher for 
> all reduce tasks on 
>   2009-08-05 12:39:03,309 INFO 
> org.apache.hadoop.mapred.TaskTracker:  Using MemoryCalculatorPlugin : 
> org.apache.hadoop.util.linuxmemorycalculatorplu...@19be4777
>   2009-08-05 12:39:03,311 WARN 
> org.apache.hadoop.mapred.TaskTracker: TaskTracker's 
> totalMemoryAllottedForTasks is -1. TaskMemoryManager is disabled.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-894) DBInputformat not working with SQLServer

2009-08-20 Thread Budianto Lie (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Budianto Lie updated MAPREDUCE-894:
---

Hadoop Flags:   (was: [Reviewed])

> DBInputformat not working with SQLServer
> 
>
> Key: MAPREDUCE-894
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-894
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Budianto Lie
>
> org.apache.hadoop.mapreduce.lib.db.DBInputFormat
> Microsoft SQLServer doesn't support LIMIT and OFFSET.
> Fix:
> Based on MAPREDUCE-716, I already implemented it.
> By creating a new class 
> org.apache.hadoop.mapreduce.lib.db.MsSqlDBRecordReader 
> and modifying class org.apache.hadoop.mapreduce.lib.db.DBInputFormat 
> Note: this fix is working only with SQLServer 2005 or higher.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-890) After HADOOP-4491, the user who started mapred system is not able to run job.

2009-08-20 Thread Vinod K V (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod K V updated MAPREDUCE-890:


Component/s: tasktracker
Summary: After HADOOP-4491, the user who started mapred system is not 
able to run job.  (was: Afer 4491, the user who started mapred system is not 
able to run job.)

> After HADOOP-4491, the user who started mapred system is not able to run job.
> -
>
> Key: MAPREDUCE-890
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-890
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: tasktracker
>Reporter: Karam Singh
>
> Even setup and cleanup task of job fails due exception -: It fails to create 
> job and related directories under mapred.local.dir/taskTracker/jobcache
> Directories are created as -:
> [dr-xrws--- mapred   hadoop  ]  job_200908190916_0002
> mapred is not wrtie under this. Even manually I failed to touch file.
> mapred is use of started mr cluster 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-871) Job/Task local files have incorrect group ownership set by LinuxTaskController binary

2009-08-20 Thread Vinod K V (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod K V updated MAPREDUCE-871:


Attachment: MAPREDUCE-871-20090820.1.txt

Here's the scenario.
{quote}
  TT process user/group information: mapred 
group1,group2,group3 (group1 is primary)
  task-controller ownership: ---Sr-s--- 1 root  group2
  job submitter: userA:groupA
{quote}

The bug is the ownership of files created for the task, for e.g.
bq. $jobid dr-xrws--- userA group1

Attaching patch to fix this issue which essentially sets the permissions of 
files created for the task as
bq. $jobid dr-xrws--- userA group2

Also modified the testcase to reflect this.

> Job/Task local files have incorrect group ownership set by 
> LinuxTaskController binary
> -
>
> Key: MAPREDUCE-871
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-871
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>  Components: tasktracker
>Reporter: Vinod K V
>Assignee: Vinod K V
> Attachments: MAPREDUCE-871-20090820.1.txt
>
>
> HADOOP-4491 fixed the secure permissions of local files on a TT. While 
> testing HADOOP-4491 on a cluster, [~karams] found out a bug. All the 
> files/dirs have should be owned by the group corresponding to the group owner 
> of the task-controller binary (via using getegid()) which in turn is a 
> special group to which *only* TT user belongs. HADOOP-4491 incorrectly set it 
> to primary group of the TT via getgid(), and not the special group.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-871) Job/Task local files have incorrect group ownership set by LinuxTaskController binary

2009-08-20 Thread Vinod K V (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod K V updated MAPREDUCE-871:


Status: Patch Available  (was: Open)

> Job/Task local files have incorrect group ownership set by 
> LinuxTaskController binary
> -
>
> Key: MAPREDUCE-871
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-871
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>  Components: tasktracker
>Reporter: Vinod K V
>Assignee: Vinod K V
> Attachments: MAPREDUCE-871-20090820.1.txt
>
>
> HADOOP-4491 fixed the secure permissions of local files on a TT. While 
> testing HADOOP-4491 on a cluster, [~karams] found out a bug. All the 
> files/dirs have should be owned by the group corresponding to the group owner 
> of the task-controller binary (via using getegid()) which in turn is a 
> special group to which *only* TT user belongs. HADOOP-4491 incorrectly set it 
> to primary group of the TT via getgid(), and not the special group.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-894) DBInputformat not working with SQLServer

2009-08-20 Thread Budianto Lie (JIRA)

DBInputformat not working with SQLServer


 Key: MAPREDUCE-894
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-894
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Budianto Lie


org.apache.hadoop.mapreduce.lib.db.DBInputFormat
Microsoft SQLServer doesn't support LIMIT and OFFSET.

Fix:
Based on MAPREDUCE-716, I already implemented it.
By creating a new class org.apache.hadoop.mapreduce.lib.db.MsSqlDBRecordReader 
and modifying class org.apache.hadoop.mapreduce.lib.db.DBInputFormat 

Note: this fix is working only with SQLServer 2005 or higher.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-893) Provide an ability to refresh queue configuration without restart.

2009-08-20 Thread Hemanth Yamijala (JIRA)

Provide an ability to refresh queue configuration without restart.
--

 Key: MAPREDUCE-893
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-893
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: jobtracker
Reporter: Hemanth Yamijala


While administering a cluster using multiple queues, administrators feel a need 
to refresh queue properties on the fly without needing to restart the 
JobTracker. This is partially supported for some properties such as queue ACLs 
(HADOOP-5396) and state (HADOOP-5913). The idea is to extend the facility to 
refresh other queue properties as well, including scheduler properties.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-886) After 4491, when task-controller exit with some error message, LinuxTaskController only ExitCodeException but does not prints the exit code of task-controller

2009-08-20 Thread Sreekanth Ramakrishnan (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sreekanth Ramakrishnan updated MAPREDUCE-886:
-

Attachment: mapreduce-886-1.patch

Adding exit code to the logging method, logging exit code only if the process 
has exited with non-zero exit code.

> After 4491, when task-controller exit with some error message, 
> LinuxTaskController only ExitCodeException but does not prints the exit code 
> of task-controller
> --
>
> Key: MAPREDUCE-886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-886
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: tasktracker
>Affects Versions: 0.21.0
>Reporter: Karam Singh
> Attachments: mapreduce-886-1.patch
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-370) Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.

2009-08-20 Thread Amareshwari Sriramadasu (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated MAPREDUCE-370:
--

Fix Version/s: 0.21.0
   Status: Patch Available  (was: Open)

> Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.
> ---
>
> Key: MAPREDUCE-370
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-370
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>Reporter: Amareshwari Sriramadasu
>Assignee: Amareshwari Sriramadasu
> Fix For: 0.21.0
>
> Attachments: patch-370-1.txt, patch-370.txt
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-370) Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.

2009-08-20 Thread Amareshwari Sriramadasu (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated MAPREDUCE-370:
--

Attachment: patch-370-1.txt

Patch with review comments from Sharad incorporated.


> Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.
> ---
>
> Key: MAPREDUCE-370
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-370
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>Reporter: Amareshwari Sriramadasu
>Assignee: Amareshwari Sriramadasu
> Fix For: 0.21.0
>
> Attachments: patch-370-1.txt, patch-370.txt
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-818) org.apache.hadoop.mapreduce.Counters.getGroup returns null if the group name doesnt exist.

2009-08-20 Thread Amareshwari Sriramadasu (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745342#action_12745342
 ] 

Amareshwari Sriramadasu commented on MAPREDUCE-818:
---

-1 contrib tests. Is due to MAPREDUCE-699

> org.apache.hadoop.mapreduce.Counters.getGroup returns null if the group name 
> doesnt exist.
> --
>
> Key: MAPREDUCE-818
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-818
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Amareshwari Sriramadasu
>Assignee: Amareshwari Sriramadasu
>Priority: Minor
> Fix For: 0.20.1
>
> Attachments: patch-818.txt
>
>
> org.apache.hadoop.mapreduce.Counters.getGroup returns null if the group name 
> doesnt exist. But the documentation says it returns an empty group if there 
> is none with the specified name.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-856) Localized files from DistributedCache should have right access-control

2009-08-20 Thread Vinod K V (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod K V updated MAPREDUCE-856:


Attachment: MAPREDUCE-856-20090820.txt

Attaching patch. It has to be applied over the latest one at MAPREDUCE-476. 
This does the following:

- Changes dir structure to the following:
{code}
$mapred.local.dir
   `-- taskTracker
`-- $user
   |- distcache
   |
   `-- jobcache
{code}

 - Adds a new initializeDistributedCache(InitializationContext context) to 
secure distributed cache files via TaskRunner

 - Adds a new INITIALIZE_DISTRIBUTEDCACHE command for LinuxTaskController 
binary which takes a user name as argument and sets private permissions for the 
corresponding distcache files in $mapred.local.dir/taskTracker/$user/distcache

 - Adds a new INITIALIZE_USER command to set secure permissions for the 
$mapred.local.dir/taskTracker/$user directories.

 - Adds user as part of Task serialization as TT needs user name even before 
job-conf is localized. The user-name is needed for creating user specific 
directories $mapred.local.dir/taskTracker/$user.

 - Moves some of the old and newly added localization code in TaskTracker.java 
to org.apache.hadoop.mapreduce.server.tasktracker.Localizer as per 
MAPREDUCE-759 and MAPREDUCE-303. Had to change the scope of few things to 
public, but marked them only for internal use.

 - Adds functionality to clean up stale user directories whenever possible.

 - Adds a new test in TestTaskTrackerLocalization to verify cleanup of task 
files when needed.

 - Adds a new testcase 
TestTrackerDistributedCacheManagerWithLinuxTaskController to verify permissions 
set on distributed cache files by LinuxTaskController

> Localized files from DistributedCache should have right access-control
> --
>
> Key: MAPREDUCE-856
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-856
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>  Components: tasktracker
>Reporter: Arun C Murthy
>Assignee: Vinod K V
>     Attachments: MAPREDUCE-856-20090820.txt
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-834) When TaskTracker config use old memory management values its memory monitoring is diabled.

2009-08-20 Thread Hemanth Yamijala (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hemanth Yamijala updated MAPREDUCE-834:
---

Assignee: Sreekanth Ramakrishnan
  Status: Patch Available  (was: Open)

Last patch looks fine to me. +1. Running through Hudson. Let's wait for the 
hudson run before uploading the 20 patch.

> When TaskTracker config use old memory management values its memory 
> monitoring is diabled.
> --
>
> Key: MAPREDUCE-834
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-834
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Karam Singh
>Assignee: Sreekanth Ramakrishnan
> Attachments: mapreduce-834-1.patch, mapreduce-834-2.patch, 
> mapreduce-834-3.patch, mapreduce-834-4.patch
>
>
> TaskTracker memory config values -:
> mapred.tasktracker.vmem.reserved=8589934592
> mapred.task.default.maxvmem=2147483648
> mapred.task.limit.maxvmem=4294967296
> mapred.tasktracker.pmem.reserved=2147483648
> TaskTracker start as -:
>2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.tasktracker.vmem.reserved is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.tasktracker.pmem.reserved is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.task.default.maxvmem is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable mapred.task.limit.maxvmem 
> is no longer used
>   2009-08-05 12:39:03,308 INFO 
> org.apache.hadoop.mapred.TaskTracker: Starting thread: Map-events fetcher for 
> all reduce tasks on 
>   2009-08-05 12:39:03,309 INFO 
> org.apache.hadoop.mapred.TaskTracker:  Using MemoryCalculatorPlugin : 
> org.apache.hadoop.util.linuxmemorycalculatorplu...@19be4777
>   2009-08-05 12:39:03,311 WARN 
> org.apache.hadoop.mapred.TaskTracker: TaskTracker's 
> totalMemoryAllottedForTasks is -1. TaskMemoryManager is disabled.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

64 matches

Mail list logo