date:20090819

[jira] Updated: (MAPREDUCE-834) When TaskTracker config use old memory management values its memory monitoring is diabled.

2009-08-19 Thread Sreekanth Ramakrishnan (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sreekanth Ramakrishnan updated MAPREDUCE-834:
-

Attachment: mapreduce-834-4.patch

Removing an unused variable from the test case.

> When TaskTracker config use old memory management values its memory 
> monitoring is diabled.
> --
>
> Key: MAPREDUCE-834
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-834
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Karam Singh
> Attachments: mapreduce-834-1.patch, mapreduce-834-2.patch, 
> mapreduce-834-3.patch, mapreduce-834-4.patch
>
>
> TaskTracker memory config values -:
> mapred.tasktracker.vmem.reserved=8589934592
> mapred.task.default.maxvmem=2147483648
> mapred.task.limit.maxvmem=4294967296
> mapred.tasktracker.pmem.reserved=2147483648
> TaskTracker start as -:
>2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.tasktracker.vmem.reserved is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.tasktracker.pmem.reserved is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.task.default.maxvmem is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable mapred.task.limit.maxvmem 
> is no longer used
>   2009-08-05 12:39:03,308 INFO 
> org.apache.hadoop.mapred.TaskTracker: Starting thread: Map-events fetcher for 
> all reduce tasks on 
>   2009-08-05 12:39:03,309 INFO 
> org.apache.hadoop.mapred.TaskTracker:  Using MemoryCalculatorPlugin : 
> org.apache.hadoop.util.linuxmemorycalculatorplu...@19be4777
>   2009-08-05 12:39:03,311 WARN 
> org.apache.hadoop.mapred.TaskTracker: TaskTracker's 
> totalMemoryAllottedForTasks is -1. TaskMemoryManager is disabled.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-284) Improvements to RPC between Child and TaskTracker

2009-08-19 Thread Ravi Gummadi (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745332#action_12745332
 ] 

Ravi Gummadi commented on MAPREDUCE-284:


unit tests passed on my local machine.

ant test-patch gave:

 [exec] -1 overall.
 [exec]
 [exec] +1 @author.  The patch does not contain any @author tags.
 [exec]
 [exec] -1 tests included.  The patch doesn't appear to include any new 
or modified tests.
 [exec] Please justify why no new tests are needed 
for this patch.
 [exec] Also please list what manual steps were 
performed to verify this patch.
 [exec]
 [exec] +1 javadoc.  The javadoc tool did not generate any warning 
messages.
 [exec]
 [exec] +1 javac.  The applied patch does not increase the total number 
of javac compiler warnings.
 [exec]
 [exec] +1 findbugs.  The patch does not introduce any new Findbugs 
warnings.
 [exec]
 [exec] +1 release audit.  The applied patch does not increase the 
total number of release audit warnings.

I don't see a simple way to add a unit testcase for this.

> Improvements to RPC between Child and TaskTracker
> -
>
> Key: MAPREDUCE-284
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-284
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>Reporter: Arun C Murthy
>Assignee: Ravi Gummadi
> Fix For: 0.21.0
>
> Attachments: MR-284.patch, MR-284.v1.patch, MR-284.v2.patch
>
>
> We could improve the RPC between the Child and TaskTracker:
>* Set ping interval lower by default to 5s
>* Disable nagle's algorithm (tcp no-delay)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-284) Improvements to RPC between Child and TaskTracker

2009-08-19 Thread Ravi Gummadi (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ravi Gummadi updated MAPREDUCE-284:
---

Attachment: MR-284.v2.patch

As changing ipc.client.tcpnodelay affects other clients(not just child to TT) 
and that needs more discussion, am uploading a new patch that sets tcpnodelay 
only in Child.java as part of this JIRA.

Please review and provide your comments.

> Improvements to RPC between Child and TaskTracker
> -
>
> Key: MAPREDUCE-284
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-284
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>Reporter: Arun C Murthy
>Assignee: Ravi Gummadi
> Fix For: 0.21.0
>
> Attachments: MR-284.patch, MR-284.v1.patch, MR-284.v2.patch
>
>
> We could improve the RPC between the Child and TaskTracker:
>* Set ping interval lower by default to 5s
>* Disable nagle's algorithm (tcp no-delay)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (MAPREDUCE-833) Jobclient does not print any warning message when old memory config variable used with -D option from command line

2009-08-19 Thread Hemanth Yamijala (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hemanth Yamijala resolved MAPREDUCE-833.


Resolution: Duplicate

This was fixed as a part of MAPREDUCE-832, which introduced a log message in 
JobConf's constructor. By doing so, any time a new conf object is created on 
the client, these warning messages would get printed. Hence resolving as 
duplicate.

> Jobclient does not print any warning message when old memory config variable 
> used with -D option from command line
> --
>
> Key: MAPREDUCE-833
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-833
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.20.1
>Reporter: Karam Singh
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-832) Too many WARN messages about deprecated memorty config variables in JobTacker log

2009-08-19 Thread Hemanth Yamijala (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hemanth Yamijala updated MAPREDUCE-832:
---

   Resolution: Fixed
Fix Version/s: 0.20.1
 Hadoop Flags: [Reviewed]
   Status: Resolved  (was: Patch Available)

I just committed this to trunk and branch 0.20. Thanks, Rahul !

> Too many WARN messages about deprecated memorty config variables in JobTacker 
> log
> -
>
> Key: MAPREDUCE-832
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-832
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.20.1
>Reporter: Karam Singh
>Assignee: rahul k singh
> Fix For: 0.20.1
>
> Attachments: mapreduce-832-20.patch, mapreduce-832-20.patch, 
> mapreduce-832.patch
>
>
> When user submit a mapred job using old memory config vairiable 
> (mapred.task.maxmem) followinig message too many times in JobTracker logs -:
> [
> WARN org.apache.hadoop.mapred.JobConf: The variable mapred.task.maxvmem is no 
> longer used instead use  mapred.job.map.memory.mb and 
> mapred.job.reduce.memory.mb
> ]

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-370) Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.

2009-08-19 Thread Sharad Agarwal (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745326#action_12745326
 ] 

Sharad Agarwal commented on MAPREDUCE-370:
--

bq. If we do this, this will remove generate* methods from the api proposed.
I am +1 on removing generate*. This will simplify the api and overriding of 
MultipleOutputs is not required in case user wants to control the output path 
name.

> Change org.apache.hadoop.mapred.lib.MultipleOutputs to use new api.
> ---
>
> Key: MAPREDUCE-370
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-370
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>Reporter: Amareshwari Sriramadasu
>Assignee: Amareshwari Sriramadasu
> Attachments: patch-370.txt
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-832) Too many WARN messages about deprecated memorty config variables in JobTacker log

2009-08-19 Thread Hemanth Yamijala (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hemanth Yamijala updated MAPREDUCE-832:
---

Attachment: mapreduce-832-20.patch

Patch for Hadoop 0.20 branch. Only source paths have been changed.

> Too many WARN messages about deprecated memorty config variables in JobTacker 
> log
> -
>
> Key: MAPREDUCE-832
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-832
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.20.1
>Reporter: Karam Singh
>Assignee: rahul k singh
> Attachments: mapreduce-832-20.patch, mapreduce-832-20.patch, 
> mapreduce-832.patch
>
>
> When user submit a mapred job using old memory config vairiable 
> (mapred.task.maxmem) followinig message too many times in JobTracker logs -:
> [
> WARN org.apache.hadoop.mapred.JobConf: The variable mapred.task.maxvmem is no 
> longer used instead use  mapred.job.map.memory.mb and 
> mapred.job.reduce.memory.mb
> ]

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-832) Too many WARN messages about deprecated memorty config variables in JobTacker log

2009-08-19 Thread Hemanth Yamijala (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745318#action_12745318
 ] 

Hemanth Yamijala commented on MAPREDUCE-832:


bq. -1 tests included. The patch doesn't appear to include any new or modified 
tests.

The patch is adding a debug log message which is hard to write a unit test for. 
Our QE team has manually verified the patch by running tests on a cluster with 
old memory configuration and verifying that the number of WARN messages has 
come down drastically. Also, the scenario in MAPREDUCE-833 has been tested and 
found fixed.

bq. -1 core tests. The patch failed core unit tests.

org.apache.hadoop.mapred.TestRecoveryManager.testRestartCount's failure is 
independent of this patch. It happens with trunk and is being tracked in 
MAPREDUCE-880

bq. -1 contrib tests. The patch failed contrib unit tests.

The streaming test failures have been happening for a while on hudson alone. 
They pass locally.

On this basis, I am going to commit the patch to trunk and an equivalent patch 
to branch 20.

> Too many WARN messages about deprecated memorty config variables in JobTacker 
> log
> -
>
> Key: MAPREDUCE-832
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-832
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.20.1
>Reporter: Karam Singh
>Assignee: rahul k singh
> Attachments: mapreduce-832-20.patch, mapreduce-832.patch
>
>
> When user submit a mapred job using old memory config vairiable 
> (mapred.task.maxmem) followinig message too many times in JobTracker logs -:
> [
> WARN org.apache.hadoop.mapred.JobConf: The variable mapred.task.maxvmem is no 
> longer used instead use  mapred.job.map.memory.mb and 
> mapred.job.reduce.memory.mb
> ]

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-807) Stray user files in mapred.system.dir with permissions other than 777 can prevent the jobtracker from starting up.

2009-08-19 Thread Amar Kamat (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amar Kamat updated MAPREDUCE-807:
-

Attachment: MAPRED-807-v1.4.patch

Attaching a patch that fixes the issue at hand. Result of test-patch
[exec] -1 overall.  
 [exec] 
 [exec] +1 @author.  The patch does not contain any @author tags.
 [exec] 
 [exec] -1 tests included.  The patch doesn't appear to include any new 
or modified tests.
 [exec] Please justify why no new tests are needed 
for this patch.
 [exec] Also please list what manual steps were 
performed to verify this patch.
 [exec] 
 [exec] +1 javadoc.  The javadoc tool did not generate any warning 
messages.
 [exec] 
 [exec] +1 javac.  The applied patch does not increase the total number 
of javac compiler warnings.
 [exec] 
 [exec] +1 findbugs.  The patch does not introduce any new Findbugs 
warnings.
 [exec] 
 [exec] +1 release audit.  The applied patch does not increase the 
total number of release audit warnings.

Running ant tests.

> Stray user files in mapred.system.dir with permissions other than 777 can 
> prevent the jobtracker from starting up.
> --
>
> Key: MAPREDUCE-807
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-807
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: jobtracker
>Reporter: Amar Kamat
>Assignee: Amar Kamat
>Priority: Blocker
> Attachments: MAPRED-807-v1.1.patch, MAPRED-807-v1.2.patch, 
> MAPRED-807-v1.3.patch, MAPRED-807-v1.4.patch
>
>
> With restart disabled, the jobtracker does a _rm -rf_ of the 
> mapred.system.dir. If the mapred.system.dir contains user files with 
> permissions other than 777 then the jobtracker gets stuck in a loop trying to 
> delete the mapred.system.dir (and each time failing with 
> AccessControlException).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-824) Support a hierarchy of queues in the capacity scheduler

2009-08-19 Thread Hemanth Yamijala (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745312#action_12745312
 ] 

Hemanth Yamijala commented on MAPREDUCE-824:


Some comments:
- AbstractQueue.updateContext can move into QueueSchedulingContext, as all the 
state it is operating on is in QSC. 
- Also prevMapClusterCapacity and prevReduceClusterCapacity can also be moved 
to the context. They can be private, and renamed to prev*Capacity, dropping the 
'Cluster' because for container queues, they don't reflect the entire cluster 
capacity. Same naming change would apply to variables in QueueSchedulingContext 
(like setMapClusterCapacity, etc)
- AbstractQueue.getOrderedJobQueues, its not very clear that this is looking 
through the entire hierarchy. Also, it assumes that sorting is done before 
this. So, its not a very orthogonal API. Move this to the scheduler, and 
introduce a new API like AbstractQueue.getDescendentJobQueues().
- Override AbstractQueue.addChildren in JobQueue to throw an unsupported 
exception.
- Make AbstractQueue.getChildren package private and document it is for tests.
- I suggest we modify the algorithm in distributeUnConfiguredCapacity to follow 
this pattern to make it clearer:
{code}
for (Queue q : children) {
  if (q.capacity == -1) {
unconfigured.add(q);
  }
}

// distribute capacity for all unconfigured queues.

for (Queue q : children) {
  q.distributeUnconfiguredCapacity();
}
{code}
- I would suggest we provide equals and hashCode in AbstractQueue to be based 
on the queue Name. toString in AbstractQueue should print the queue name.
- I didn't understand the need for setting the capacity in conf in 
distributeUnconfiguredCapacity. It seems like requiring the Configuration 
instance to be passed to distributeUnconfiguredCapacity is creating an 
undesirable dependency. Can you check if we can break this dependency.
- distributeUnConfiguredCapacity will throw a Divide by zero if there is no 
queues without configured capacity.
- We don't need to pass the supportsPriority variable separately to the 
JobQueue's constructor. Let's set that directly in the 
JobQueue.QueueSchedulingContext which we are already passing to JobQueue.
- In JobQueue, methods like addWaitingJob etc should be private. Also, I think 
some of the methods can be folded. For e.g. makeJobRunning just calls 
addRunningJob, so we can refactor to remove makeJobRunning and call 
addRunningJob directly.
- TaskData seems out of place in TaskSchedulingContext. The scheduling context 
contains state w.r.to scheduler. TaskData is a simple abstraction that returns 
a view of job information based on the task type. So, let's pull it out and 
call it TaskDataView which can be extended by MapTaskDataView and 
ReduceTaskDataView. There should be only one ‌instance of these per scheduler 
instance and they can be got from the scheduler itself.
- Rename TaskSchedulingContext.add to TSC.update.
- Can we pull out the whole hierarchy building logic into a separate class - 
like a QueueHierarchyBuilder ? It could be given the CapacitySchedulerConf and 
QueueManager and have an API like buildHierarchy - which would return the root 
of the queues. Capacity scheduler can thus be abstracted from how the hierarchy 
is created - it just gets the hierarchy from somewhere. For e.g. in tests, the 
hierarchy can be manually created and given to the given.
- Please remove mapScheduler.initialize() and reduceScheduler.initialize().
- tsi.getMaxCapacity() < tsi.getCapacity(): this check in 
areTasksInQueueOverLimit does not seem required. Because the check is already 
being done in tsi.getCapacity()
- totalCapacity modification in the loadContext is a no-op, because the changes 
will not be reflected in the caller method. Likewise the check for 
totalCapacity > 100.0 is a no-op in createHierarchy.
- The separator char for queues is chosen to be '.' in createHierarchy. It must 
be checked that this character doesn't appear anywhere else in the queue name.
- Call to root.sort() should be from TaskSchedulingMgr.assignTasks()
- JobQueuesManager.createQueue should be addQueue. Also, it can get the queue 
name from the job queue object directly, and doesn't need the extra parameter.
- JobQueueManager.getQueueNames can be getJobQueueNames.


> Support a hierarchy of queues in the capacity scheduler
> ---
>
> Key: MAPREDUCE-824
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-824
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/capacity-sched
>Reporter: Hemanth Yamijala
> Attachments: HADOOP-824-1.patch, HADOOP-824-2.patch, 
> HADOOP-824-3.patch
>
>
> Currently in Capacity Scheduler, cluster capacity is divided among the queues 
> based on the queue capacity. These queues typically

[jira] Updated: (MAPREDUCE-834) When TaskTracker config use old memory management values its memory monitoring is diabled.

2009-08-19 Thread Sreekanth Ramakrishnan (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sreekanth Ramakrishnan updated MAPREDUCE-834:
-

Attachment: mapreduce-834-3.patch

Attaching file fixing an issue found during internal testing. Modified test 
case and code.

> When TaskTracker config use old memory management values its memory 
> monitoring is diabled.
> --
>
> Key: MAPREDUCE-834
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-834
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Karam Singh
> Attachments: mapreduce-834-1.patch, mapreduce-834-2.patch, 
> mapreduce-834-3.patch
>
>
> TaskTracker memory config values -:
> mapred.tasktracker.vmem.reserved=8589934592
> mapred.task.default.maxvmem=2147483648
> mapred.task.limit.maxvmem=4294967296
> mapred.tasktracker.pmem.reserved=2147483648
> TaskTracker start as -:
>2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.tasktracker.vmem.reserved is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.tasktracker.pmem.reserved is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.task.default.maxvmem is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable mapred.task.limit.maxvmem 
> is no longer used
>   2009-08-05 12:39:03,308 INFO 
> org.apache.hadoop.mapred.TaskTracker: Starting thread: Map-events fetcher for 
> all reduce tasks on 
>   2009-08-05 12:39:03,309 INFO 
> org.apache.hadoop.mapred.TaskTracker:  Using MemoryCalculatorPlugin : 
> org.apache.hadoop.util.linuxmemorycalculatorplu...@19be4777
>   2009-08-05 12:39:03,311 WARN 
> org.apache.hadoop.mapred.TaskTracker: TaskTracker's 
> totalMemoryAllottedForTasks is -1. TaskMemoryManager is disabled.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-870) Clean up the job Retire code

2009-08-19 Thread Sharad Agarwal (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sharad Agarwal updated MAPREDUCE-870:
-

Attachment: 870_v3.patch

> Clean up the job Retire code
> 
>
> Key: MAPREDUCE-870
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-870
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>Reporter: Sharad Agarwal
>Assignee: Sharad Agarwal
> Attachments: 870_v1.patch, 870_v2.patch, 870_v3.patch
>
>
> Currently completed job's full data structures are kept in memory based on 
> mapred.jobtracker.completeuserjobs.maximum, 
> mapred.jobtracker.retirejob.interval.min, 
> mapred.jobtracker.retirejob.interval and mapred.jobtracker.retirejob.check 
> settings. These controls are not much useful now since MAPREDUCE-817 
> introduced a cache for keeping just the very basic info of the completed job. 
> These settings should be removed and the job should be purged as soon as the 
> history files are available in HDFS. 
> Going forward, clients can read the history files if they need to drill down 
> into more information (MAPREDUCE-864).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-870) Clean up the job Retire code

2009-08-19 Thread Sharad Agarwal (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sharad Agarwal updated MAPREDUCE-870:
-

Status: Open  (was: Patch Available)

> Clean up the job Retire code
> 
>
> Key: MAPREDUCE-870
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-870
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>Reporter: Sharad Agarwal
>Assignee: Sharad Agarwal
> Attachments: 870_v1.patch, 870_v2.patch
>
>
> Currently completed job's full data structures are kept in memory based on 
> mapred.jobtracker.completeuserjobs.maximum, 
> mapred.jobtracker.retirejob.interval.min, 
> mapred.jobtracker.retirejob.interval and mapred.jobtracker.retirejob.check 
> settings. These controls are not much useful now since MAPREDUCE-817 
> introduced a cache for keeping just the very basic info of the completed job. 
> These settings should be removed and the job should be purged as soon as the 
> history files are available in HDFS. 
> Going forward, clients can read the history files if they need to drill down 
> into more information (MAPREDUCE-864).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (MAPREDUCE-882) TestJobHistory fails sometimes

2009-08-19 Thread Amar Kamat (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amar Kamat resolved MAPREDUCE-882.
--

Resolution: Duplicate

The problem comes because taskIDsToAttemptIDs structure is shared in 
TestJobHistory across tests. So if the job-id clash across testcases then the 
checks will fail. MAPREDUCE-745 should fix this.

> TestJobHistory fails sometimes
> --
>
> Key: MAPREDUCE-882
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-882
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Devaraj Das
>Assignee: Amar Kamat
>
> Testcase: testDoneFolderOnHDFS took 31.892 sec
> Testcase: testJobHistoryFile took 27.901 sec
> FAILED
> Duplicate START_TIME seen for task task_200908161937_0001_m_03 in history 
> file at line 5
> junit.framework.AssertionFailedError: Duplicate START_TIME seen for task 
> task_200908161937_0001_m_03 in history file at line 5
> at 
> org.apache.hadoop.mapred.TestJobHistory$TestListener.handle(TestJobHistory.java:164)
> at org.apache.hadoop.mapred.JobHistory.parseLine(JobHistory.java:505)
> at 
> org.apache.hadoop.mapred.JobHistory.parseHistoryFromFS(JobHistory.java:469)
> at 
> org.apache.hadoop.mapred.TestJobHistory.validateJobHistoryFileFormat(TestJobHistory.java:496)
> at 
> org.apache.hadoop.mapred.TestJobHistory.testJobHistoryFile(TestJobHistory.java:964)
> Testcase: testJobHistoryUserLogLocation took 75.161 sec
> Testcase: testJobHistoryJobStatus took 156.88 sec

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Assigned: (MAPREDUCE-882) TestJobHistory fails sometimes

2009-08-19 Thread Amar Kamat (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amar Kamat reassigned MAPREDUCE-882:


Assignee: Amar Kamat

> TestJobHistory fails sometimes
> --
>
> Key: MAPREDUCE-882
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-882
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Devaraj Das
>Assignee: Amar Kamat
>
> Testcase: testDoneFolderOnHDFS took 31.892 sec
> Testcase: testJobHistoryFile took 27.901 sec
> FAILED
> Duplicate START_TIME seen for task task_200908161937_0001_m_03 in history 
> file at line 5
> junit.framework.AssertionFailedError: Duplicate START_TIME seen for task 
> task_200908161937_0001_m_03 in history file at line 5
> at 
> org.apache.hadoop.mapred.TestJobHistory$TestListener.handle(TestJobHistory.java:164)
> at org.apache.hadoop.mapred.JobHistory.parseLine(JobHistory.java:505)
> at 
> org.apache.hadoop.mapred.JobHistory.parseHistoryFromFS(JobHistory.java:469)
> at 
> org.apache.hadoop.mapred.TestJobHistory.validateJobHistoryFileFormat(TestJobHistory.java:496)
> at 
> org.apache.hadoop.mapred.TestJobHistory.testJobHistoryFile(TestJobHistory.java:964)
> Testcase: testJobHistoryUserLogLocation took 75.161 sec
> Testcase: testJobHistoryJobStatus took 156.88 sec

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (MAPREDUCE-891) Streaming tests fail with NPE in MiniDFSCluster

2009-08-19 Thread Vinod K V (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod K V resolved MAPREDUCE-891.
-

Resolution: Duplicate

Duplicate of MAPREDUCE-699

> Streaming tests fail with NPE in MiniDFSCluster
> ---
>
> Key: MAPREDUCE-891
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-891
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/streaming, test
>Reporter: Arun C Murthy
> Fix For: 0.21.0
>
>
> Streaming testcases' usage of MiniDFSCluster.startDatanodes causes NPE in 
> GenericOptionsParser:
> {noformat}
> java.lang.NullPointerException
>   at org.apache.commons.cli.GnuParser.flatten(GnuParser.java:110)
>   at org.apache.commons.cli.Parser.parse(Parser.java:143)
>   at 
> org.apache.hadoop.util.GenericOptionsParser.parseGeneralOptions(GenericOptionsParser.java:374)
>   at 
> org.apache.hadoop.util.GenericOptionsParser.(GenericOptionsParser.java:153)
>   at 
> org.apache.hadoop.util.GenericOptionsParser.(GenericOptionsParser.java:138)
>   at 
> org.apache.hadoop.hdfs.server.datanode.DataNode.instantiateDataNode(DataNode.java:1314)
>   at 
> org.apache.hadoop.hdfs.MiniDFSCluster.startDataNodes(MiniDFSCluster.java:414)
>   at org.apache.hadoop.hdfs.MiniDFSCluster.(MiniDFSCluster.java:278)
>   at org.apache.hadoop.hdfs.MiniDFSCluster.(MiniDFSCluster.java:119)
>   at 
> org.apache.hadoop.streaming.TestDumpTypedBytes.testDumping(TestDumpTypedBytes.java:40)
> {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-157) Job History log file format is not friendly for external tools.

2009-08-19 Thread eric baldeschwieler (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745279#action_12745279
 ] 

eric baldeschwieler commented on MAPREDUCE-157:
---

Re: AVRO conversion

Doug and I chatted.  My concern is that we are working through a job history 
refactor with a bunch of moving parts.  I want to get those all into 21 and 
stable.  Until that is done I don't want to consider AVRO since it might put 
that rework at risk (since our team has already invested in JSON).  That said, 
I'm not against binary AVRO here.  It could have advantages.  If someone else 
can put in the time to demonstrate that this will work, I think that might be a 
better approach.  I just hope we can do that as a distinct patch that follows 
this one.

(Or collaborate to make one patch, we just don't have the resources before 21 
freeze)

(For the wider context of the refactor, see MAPREDUCE-863)

> Job History log file format is not friendly for external tools.
> ---
>
> Key: MAPREDUCE-157
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-157
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>Reporter: Owen O'Malley
>Assignee: Jothi Padmanabhan
>
> Currently, parsing the job history logs with external tools is very difficult 
> because of the format. The most critical problem is that newlines aren't 
> escaped in the strings. That makes using tools like grep, sed, and awk very 
> tricky.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (MAPREDUCE-198) Log job history events to a common dump file

2009-08-19 Thread eric baldeschwieler (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

eric baldeschwieler resolved MAPREDUCE-198.
---

Resolution: Won't Fix

We've abandon this idea.  We will not be working on it. 


> Log job history events to a common dump file
> 
>
> Key: MAPREDUCE-198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>Reporter: Amar Kamat
>Assignee: Amar Kamat
>
> As of today all the jobhistory events are logged to separate files. It would 
> be nice to also dump all this info into a common file so that external tools 
> (e.g Chukwa) can harvest history info. Job configuration should also be 
> dumped. Whether to use a same log file for history dumps and configuration 
> dumps should be configurable (by default everything goes to one file). 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-751) Rumen: a tool to extract job characterization data from job tracker logs

2009-08-19 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745240#action_12745240
 ] 

Hadoop QA commented on MAPREDUCE-751:
-

-1 overall.  Here are the results of testing the latest attachment 
  
http://issues.apache.org/jira/secure/attachment/12417041/2009-08-19--1030.patch
  against trunk revision 805324.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 40 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

-1 javac.  The applied patch generated 2239 javac compiler warnings (more 
than the trunk's current 2232 warnings).

-1 findbugs.  The patch appears to introduce 8 new Findbugs warnings.

-1 release audit.  The applied patch generated 217 release audit warnings 
(more than the trunk's current 202 warnings).

-1 core tests.  The patch failed core unit tests.

-1 contrib tests.  The patch failed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/495/testReport/
Release audit warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/495/artifact/trunk/patchprocess/releaseAuditDiffWarnings.txt
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/495/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/495/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/495/console

This message is automatically generated.

> Rumen: a tool to extract job characterization data from job tracker logs
> 
>
> Key: MAPREDUCE-751
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-751
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.20.1, 0.21.0
>Reporter: Dick King
> Fix For: 0.20.1, 0.21.0
>
> Attachments: 2009-08-19--1030.patch, mapreduce-751--2009-07-23.patch
>
>
>  We propose a new map/reduce component, rumen, which can be used to process 
> job history logs to produce any or all of the following:
>   * Retrospective info describing the statistical behavior of the
> amount of time it would have taken to launch a job into a certain
> percentage of the number of mapper slots in the log's cluster, given the
> load over the period covered by the log
>   * Statistical info as to the runtimes and shuffle times, etc. of
> the tasks and jobs covered by the log
>   * files describing detailed job trace information, and the
> network topology as inferred from the host locations and rack IDs that
> arise in the job tracker log.  In addition to this facility, rumen
> includes readers for this information to return job and detailed task
> information to other tools.
> These other tools include a more advanced version of gridmix, and 
> also includes mumak: see blocked issues.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-157) Job History log file format is not friendly for external tools.

2009-08-19 Thread Doug Cutting (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745211#action_12745211
 ] 

Doug Cutting commented on MAPREDUCE-157:


Jothi, if you have an early version of this patch, please post it.  That way we 
can better evaluate converting it to use Avro.  Thanks!

> Job History log file format is not friendly for external tools.
> ---
>
> Key: MAPREDUCE-157
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-157
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>Reporter: Owen O'Malley
>Assignee: Jothi Padmanabhan
>
> Currently, parsing the job history logs with external tools is very difficult 
> because of the format. The most critical problem is that newlines aren't 
> escaped in the strings. That makes using tools like grep, sed, and awk very 
> tricky.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-892) command line tool to list all tasktrackers and their status

2009-08-19 Thread dhruba borthakur (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

dhruba borthakur updated MAPREDUCE-892:
---

Assignee: Dmytro Molkov
 Summary: command line tool to list all tasktrackers and their status  
(was: command line tool to list all tasktracers and their status)

> command line tool to list all tasktrackers and their status
> ---
>
> Key: MAPREDUCE-892
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-892
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>Reporter: dhruba borthakur
>Assignee: Dmytro Molkov
>
> The "hadoop mradmin -report" could list all the tasktrackers that the 
> JobTracker knows about. It will also list a brief status summary for each of 
> the TaskTracker. (This is similar to the hadop dfsadmin -report command that 
> lists all Datanodes)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-892) command line tool to list all tasktracers and their status

2009-08-19 Thread dhruba borthakur (JIRA)

command line tool to list all tasktracers and their status
--

 Key: MAPREDUCE-892
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-892
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Reporter: dhruba borthakur


The "hadoop mradmin -report" could list all the tasktrackers that the 
JobTracker knows about. It will also list a brief status summary for each of 
the TaskTracker. (This is similar to the hadop dfsadmin -report command that 
lists all Datanodes)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-886) After 4491, when task-controller exit with some error message, LinuxTaskController only ExitCodeException but does not prints the exit code of task-controller

2009-08-19 Thread Tsz Wo (Nicholas), SZE (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745155#action_12745155
 ] 

Tsz Wo (Nicholas), SZE commented on MAPREDUCE-886:
--

Do you mean MAPREDUCE-842 for "4491"?

> After 4491, when task-controller exit with some error message, 
> LinuxTaskController only ExitCodeException but does not prints the exit code 
> of task-controller
> --
>
> Key: MAPREDUCE-886
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-886
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: tasktracker
>Affects Versions: 0.21.0
>Reporter: Karam Singh
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-751) Rumen: a tool to extract job characterization data from job tracker logs

2009-08-19 Thread Dick King (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dick King updated MAPREDUCE-751:


 Tags: rumen,mumakil,job tracker logs
Fix Version/s: 0.21.0
   0.20.1
Affects Version/s: 0.21.0
   0.20.1
   Status: Patch Available  (was: Open)

This patch implements Rumen as described by this issue.  Rumen consumes job 
tracker log directories and produces the job traces that mumakil and GridMMIX 
cosume.

> Rumen: a tool to extract job characterization data from job tracker logs
> 
>
> Key: MAPREDUCE-751
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-751
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.20.1, 0.21.0
>Reporter: Dick King
> Fix For: 0.20.1, 0.21.0
>
> Attachments: 2009-08-19--1030.patch, mapreduce-751--2009-07-23.patch
>
>
>  We propose a new map/reduce component, rumen, which can be used to process 
> job history logs to produce any or all of the following:
>   * Retrospective info describing the statistical behavior of the
> amount of time it would have taken to launch a job into a certain
> percentage of the number of mapper slots in the log's cluster, given the
> load over the period covered by the log
>   * Statistical info as to the runtimes and shuffle times, etc. of
> the tasks and jobs covered by the log
>   * files describing detailed job trace information, and the
> network topology as inferred from the host locations and rack IDs that
> arise in the job tracker log.  In addition to this facility, rumen
> includes readers for this information to return job and detailed task
> information to other tools.
> These other tools include a more advanced version of gridmix, and 
> also includes mumak: see blocked issues.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-751) Rumen: a tool to extract job characterization data from job tracker logs

2009-08-19 Thread Dick King (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dick King updated MAPREDUCE-751:


Attachment: 2009-08-19--1030.patch

This is the patch that implements Rumen.  It is licensed to Apache.

> Rumen: a tool to extract job characterization data from job tracker logs
> 
>
> Key: MAPREDUCE-751
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-751
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Dick King
> Attachments: 2009-08-19--1030.patch, mapreduce-751--2009-07-23.patch
>
>
>  We propose a new map/reduce component, rumen, which can be used to process 
> job history logs to produce any or all of the following:
>   * Retrospective info describing the statistical behavior of the
> amount of time it would have taken to launch a job into a certain
> percentage of the number of mapper slots in the log's cluster, given the
> load over the period covered by the log
>   * Statistical info as to the runtimes and shuffle times, etc. of
> the tasks and jobs covered by the log
>   * files describing detailed job trace information, and the
> network topology as inferred from the host locations and rack IDs that
> arise in the job tracker log.  In addition to this facility, rumen
> includes readers for this information to return job and detailed task
> information to other tools.
> These other tools include a more advanced version of gridmix, and 
> also includes mumak: see blocked issues.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-336) The logging level of the tasks should be configurable by the job

2009-08-19 Thread Arun C Murthy (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745145#action_12745145
 ] 

Arun C Murthy commented on MAPREDUCE-336:
-

TestJobHistory failure is being tracked at MAPREDUCE-882.

I've opened MAPREDUCE-891 to track streaming test-failures.



> The logging level of the tasks should be configurable by the job
> 
>
> Key: MAPREDUCE-336
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-336
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>Reporter: Owen O'Malley
>Assignee: Arun C Murthy
> Fix For: 0.21.0
>
> Attachments: MAPREDUCE-336_0_20090818.patch
>
>
> It would be nice to be able to configure the logging level of the Task JVM's 
> separately from the server JVM's. Reducing logging substantially increases 
> performance and reduces the consumption of local disk on the task trackers.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-891) Streaming tests fail with NPE in MiniDFSCluster

2009-08-19 Thread Arun C Murthy (JIRA)

Streaming tests fail with NPE in MiniDFSCluster
---

 Key: MAPREDUCE-891
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-891
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: contrib/streaming, test
Reporter: Arun C Murthy
 Fix For: 0.21.0


Streaming testcases' usage of MiniDFSCluster.startDatanodes causes NPE in 
GenericOptionsParser:

{noformat}
java.lang.NullPointerException
at org.apache.commons.cli.GnuParser.flatten(GnuParser.java:110)
at org.apache.commons.cli.Parser.parse(Parser.java:143)
at 
org.apache.hadoop.util.GenericOptionsParser.parseGeneralOptions(GenericOptionsParser.java:374)
at 
org.apache.hadoop.util.GenericOptionsParser.(GenericOptionsParser.java:153)
at 
org.apache.hadoop.util.GenericOptionsParser.(GenericOptionsParser.java:138)
at 
org.apache.hadoop.hdfs.server.datanode.DataNode.instantiateDataNode(DataNode.java:1314)
at 
org.apache.hadoop.hdfs.MiniDFSCluster.startDataNodes(MiniDFSCluster.java:414)
at org.apache.hadoop.hdfs.MiniDFSCluster.(MiniDFSCluster.java:278)
at org.apache.hadoop.hdfs.MiniDFSCluster.(MiniDFSCluster.java:119)
at 
org.apache.hadoop.streaming.TestDumpTypedBytes.testDumping(TestDumpTypedBytes.java:40)
{noformat}


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-876) Sqoop import of large tables can time out

2009-08-19 Thread Aaron Kimball (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745139#action_12745139
 ] 

Aaron Kimball commented on MAPREDUCE-876:
-

Failures are in capacity scheduler and streaming; unrelated.

> Sqoop import of large tables can time out
> -
>
> Key: MAPREDUCE-876
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-876
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: contrib/sqoop
>Reporter: Aaron Kimball
>Assignee: Aaron Kimball
> Attachments: MAPREDUCE-876.2.patch, MAPREDUCE-876.patch
>
>
> Related to MAPREDUCE-875, Sqoop should use a background thread to ensure that 
> progress is being reported while a database does external work for the 
> MapReduce task.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-777) A method for finding and tracking jobs from the new API

2009-08-19 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745124#action_12745124
 ] 

Hadoop QA commented on MAPREDUCE-777:
-

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12416974/patch-777-2.txt
  against trunk revision 805324.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 9 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

-1 javac.  The applied patch generated 2272 javac compiler warnings (more 
than the trunk's current 2232 warnings).

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed core unit tests.

-1 contrib tests.  The patch failed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/494/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/494/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/494/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/494/console

This message is automatically generated.

> A method for finding and tracking jobs from the new API
> ---
>
> Key: MAPREDUCE-777
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-777
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: client
>Reporter: Owen O'Malley
>Assignee: Amareshwari Sriramadasu
> Fix For: 0.21.0
>
> Attachments: patch-777-1.txt, patch-777-2.txt, patch-777.txt
>
>
> We need to create a replacement interface for the JobClient API in the new 
> interface. In particular, the user needs to be able to query and track jobs 
> that were launched by other processes.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-890) Afer 4491, the user who started mapred system is not able to run job.

2009-08-19 Thread Karam Singh (JIRA)

Afer 4491, the user who started mapred system is not able to run job.
-

 Key: MAPREDUCE-890
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-890
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Karam Singh


Even setup and cleanup task of job fails due exception -: It fails to create 
job and related directories under mapred.local.dir/taskTracker/jobcache
Directories are created as -:
[dr-xrws--- mapred   hadoop  ]  job_200908190916_0002
mapred is not wrtie under this. Even manually I failed to touch file.
mapred is use of started mr cluster 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-832) Too many WARN messages about deprecated memorty config variables in JobTacker log

2009-08-19 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745014#action_12745014
 ] 

Hadoop QA commented on MAPREDUCE-832:
-

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12416966/mapreduce-832.patch
  against trunk revision 805324.

+1 @author.  The patch does not contain any @author tags.

-1 tests included.  The patch doesn't appear to include any new or modified 
tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed core unit tests.

-1 contrib tests.  The patch failed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/493/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/493/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/493/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/493/console

This message is automatically generated.

> Too many WARN messages about deprecated memorty config variables in JobTacker 
> log
> -
>
> Key: MAPREDUCE-832
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-832
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.20.1
>Reporter: Karam Singh
>Assignee: rahul k singh
> Attachments: mapreduce-832-20.patch, mapreduce-832.patch
>
>
> When user submit a mapred job using old memory config vairiable 
> (mapred.task.maxmem) followinig message too many times in JobTracker logs -:
> [
> WARN org.apache.hadoop.mapred.JobConf: The variable mapred.task.maxvmem is no 
> longer used instead use  mapred.job.map.memory.mb and 
> mapred.job.reduce.memory.mb
> ]

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-889) binary communication formats added to Streaming by HADOOP-1722 should be documented

2009-08-19 Thread Amareshwari Sriramadasu (JIRA)

binary communication formats added to Streaming by HADOOP-1722 should be 
documented
---

 Key: MAPREDUCE-889
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-889
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: documentation
Reporter: Amareshwari Sriramadasu
Priority: Blocker
 Fix For: 0.21.0


binary communication formats added to Streaming by HADOOP-1722 should be 
documented in forrest

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (MAPREDUCE-888) TestJobHistory sometimes fails while validating history.

2009-08-19 Thread Amar Kamat (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amar Kamat resolved MAPREDUCE-888.
--

Resolution: Duplicate

MAPREDUCE-882

> TestJobHistory sometimes fails while validating history.
> 
>
> Key: MAPREDUCE-888
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-888
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: test
>Reporter: Amar Kamat
>
> Error Message
> Duplicate START_TIME seen for task task_200908190021_0001_m_03 in history 
> file at line 5
> Stacktrace
> junit.framework.AssertionFailedError: Duplicate START_TIME seen for task 
> task_200908190021_0001_m_03 in history file at line 5
>   at 
> org.apache.hadoop.mapred.TestJobHistory$TestListener.handle(TestJobHistory.java:163)
>   at org.apache.hadoop.mapred.JobHistory.parseLine(JobHistory.java:497)
>   at 
> org.apache.hadoop.mapred.JobHistory.parseHistoryFromFS(JobHistory.java:463)
>   at 
> org.apache.hadoop.mapred.TestJobHistory.validateJobHistoryFileFormat(TestJobHistory.java:486)
>   at 
> org.apache.hadoop.mapred.TestJobHistory.testJobHistoryFile(TestJobHistory.java:955)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-888) TestJobHistory sometimes fails while validating history.

2009-08-19 Thread Amar Kamat (JIRA)

TestJobHistory sometimes fails while validating history.


 Key: MAPREDUCE-888
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-888
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: test
Reporter: Amar Kamat


Error Message

Duplicate START_TIME seen for task task_200908190021_0001_m_03 in history 
file at line 5

Stacktrace

junit.framework.AssertionFailedError: Duplicate START_TIME seen for task 
task_200908190021_0001_m_03 in history file at line 5
at 
org.apache.hadoop.mapred.TestJobHistory$TestListener.handle(TestJobHistory.java:163)
at org.apache.hadoop.mapred.JobHistory.parseLine(JobHistory.java:497)
at 
org.apache.hadoop.mapred.JobHistory.parseHistoryFromFS(JobHistory.java:463)
at 
org.apache.hadoop.mapred.TestJobHistory.validateJobHistoryFileFormat(TestJobHistory.java:486)
at 
org.apache.hadoop.mapred.TestJobHistory.testJobHistoryFile(TestJobHistory.java:955)


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (MAPREDUCE-27) Jobs with 0 maps will never get removed from the default scheduler

2009-08-19 Thread Amar Kamat (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-27?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amar Kamat resolved MAPREDUCE-27.
-

Resolution: Duplicate

> Jobs with 0 maps will never get removed from the default scheduler
> --
>
> Key: MAPREDUCE-27
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-27
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Amar Kamat
>Assignee: Amar Kamat
> Attachments: HADOOP-5049-v1.1.patch
>
>
> Jobs' with 0 maps finish/succeed in the init phase i.e while the job is in 
> the _PREP_ state. {{EagerTaskInitializationListener}} removes the job after 
> initing but {{JobQueueJobInProgressListener}} waits for a job-state change 
> event to be raised and aonly then removes the job from the queue and hence 
> the job will stay forever with the {{JobQueueJobInProgressListener}}. Looks 
> like {{FairScheduler}} periodically scans the job list and removes completed 
> jobs. {{CapacityScheduler}} has a concept of waiting jobs and scans waiting 
> queue for completed jobs and purges them.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-849) Renaming of configuration property names in mapreduce

2009-08-19 Thread Amareshwari Sriramadasu (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated MAPREDUCE-849:
--

Attachment: Config changes.xls

Attaching the document listing oldname and newname with proposed naming 
convention.



> Renaming of configuration property names in mapreduce
> -
>
> Key: MAPREDUCE-849
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-849
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>Reporter: Amareshwari Sriramadasu
>Assignee: Amareshwari Sriramadasu
> Fix For: 0.21.0
>
> Attachments: Config changes.xls
>
>
> In-line with HDFS-531, property names in configuration files should be 
> standardized in MAPREDUCE. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-430) Task stuck in cleanup with OutOfMemoryErrors

2009-08-19 Thread Amar Kamat (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12744986#action_12744986
 ] 

Amar Kamat commented on MAPREDUCE-430:
--

bq. ant tests (core + core) passed except TestJobHistory. 
I meant core + contrib

> Task stuck in cleanup with OutOfMemoryErrors
> 
>
> Key: MAPREDUCE-430
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-430
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Amareshwari Sriramadasu
>Assignee: Amar Kamat
> Fix For: 0.20.1
>
> Attachments: MAPREDUCE-430-v1.6-branch-0.20.patch, 
> MAPREDUCE-430-v1.6.patch, MAPREDUCE-430-v1.7.patch, MAPREDUCE-430-v1.8.patch
>
>
> Obesrved a task with OutOfMemory error, stuck in cleanup.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-430) Task stuck in cleanup with OutOfMemoryErrors

2009-08-19 Thread Amar Kamat (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12744985#action_12744985
 ] 

Amar Kamat commented on MAPREDUCE-430:
--

ant tests (core + core) passed except TestJobHistory.

> Task stuck in cleanup with OutOfMemoryErrors
> 
>
> Key: MAPREDUCE-430
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-430
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Amareshwari Sriramadasu
>Assignee: Amar Kamat
> Fix For: 0.20.1
>
> Attachments: MAPREDUCE-430-v1.6-branch-0.20.patch, 
> MAPREDUCE-430-v1.6.patch, MAPREDUCE-430-v1.7.patch, MAPREDUCE-430-v1.8.patch
>
>
> Obesrved a task with OutOfMemory error, stuck in cleanup.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-336) The logging level of the tasks should be configurable by the job

2009-08-19 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12744960#action_12744960
 ] 

Hadoop QA commented on MAPREDUCE-336:
-

-1 overall.  Here are the results of testing the latest attachment 
  
http://issues.apache.org/jira/secure/attachment/12416955/MAPREDUCE-336_0_20090818.patch
  against trunk revision 805324.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 3 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed core unit tests.

-1 contrib tests.  The patch failed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/492/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/492/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/492/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/492/console

This message is automatically generated.

> The logging level of the tasks should be configurable by the job
> 
>
> Key: MAPREDUCE-336
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-336
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>Reporter: Owen O'Malley
>Assignee: Arun C Murthy
> Fix For: 0.21.0
>
> Attachments: MAPREDUCE-336_0_20090818.patch
>
>
> It would be nice to be able to configure the logging level of the Task JVM's 
> separately from the server JVM's. Reducing logging substantially increases 
> performance and reduces the consumption of local disk on the task trackers.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-516) Fix the 'cluster drain' problem in the Capacity Scheduler wrt High RAM Jobs

2009-08-19 Thread Sreekanth Ramakrishnan (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sreekanth Ramakrishnan updated MAPREDUCE-516:
-

Attachment: mr516-ydist.patch

Attaching Yahoo! distribution patch.

> Fix the 'cluster drain' problem in the Capacity Scheduler wrt High RAM Jobs
> ---
>
> Key: MAPREDUCE-516
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-516
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.20.1
>Reporter: Arun C Murthy
>Assignee: Arun C Murthy
> Fix For: 0.21.0
>
> Attachments: HADOOP-5964-11.patch, HADOOP-5964-12.patch, 
> HADOOP-5964_0_20090602.patch, HADOOP-5964_10_20090622.patch, 
> HADOOP-5964_11_20090623.patch, HADOOP-5964_1_20090608.patch, 
> HADOOP-5964_1_20090623_yhadoop.patch, HADOOP-5964_2_20090609.patch, 
> HADOOP-5964_2_20090629_yhadoop.patch, HADOOP-5964_2_20090629_yhadoop.patch, 
> HADOOP-5964_4_20090615.patch, HADOOP-5964_6_20090617.patch, 
> HADOOP-5964_7_20090618.patch, HADOOP-5964_8_20090618.patch, 
> HADOOP-5964_9_20090619.patch, HADOOP-5964_9_20090619.patch, 
> MAPREDUCE-516-Y20.patch, mr516-ydist.patch
>
>
> When a HighRAMJob turns up at the head of the queue, the current 
> implementation of support for HighRAMJobs in the Capacity Scheduler has 
> problem in that the scheduler stops assigning tasks to all TaskTrackers in 
> the cluster until a HighRAMJob finds a suitable TaskTrackers for all its 
> tasks.
> This causes a severe utilization problem since effectively no new tasks are 
> allowed to run until the HighRAMJob (at the head of the queue) gets slots.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-887) After 4491, task cleaup directory some gets created under the ownershiptasktracker user instread job submitting.

2009-08-19 Thread Karam Singh (JIRA)

After 4491, task cleaup directory some gets created under the 
ownershiptasktracker user instread job submitting.


 Key: MAPREDUCE-887
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-887
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: tasktracker
Affects Versions: 0.21.0
Reporter: Karam Singh


Some time, when task is killed, task cleanup directory is created under the 
ownership tasktracker launching user instead job submitting user.
dr-xrws--- karams   hadoop  ]  job_200908170914_0020
 |-- [drwxr-sr-x mapred   hadoop  ]  
attempt_200908170914_0020_m_02_0.cleanup
 `-- [drwxrws--- karams   hadoop  ]  
attempt_200908170914_0020_m_12_0


Here karams is user who submitted job and mapred is the use who launched TT. 
taskattrempt.cleanup created with mapred  user not with karams user.
This issue is intermittent, not always reproducible. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-886) After 4491, when task-controller exit with some error message, LinuxTaskController only ExitCodeException but does not prints the exit code of task-controller

2009-08-19 Thread Karam Singh (JIRA)

After 4491, when task-controller exit with some error message, 
LinuxTaskController only ExitCodeException but does not prints the exit code of 
task-controller
--

 Key: MAPREDUCE-886
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-886
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: tasktracker
Affects Versions: 0.21.0
Reporter: Karam Singh




-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-181) mapred.system.dir should be accessible only to hadoop daemons

2009-08-19 Thread Devaraj Das (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12744944#action_12744944
 ] 

Devaraj Das commented on MAPREDUCE-181:
---

Some more details on the split file handling:
1) The FileSystem used for writing the split bytes would be the same filesystem 
where mapred.system.dir is located.
2) The split info (actual split bytes) would get written to the user's home 
directory on that filesystem (e.g., /user//.mapreduce/jobid)
3) The split info can be cleaned up by the cleanup task of the job.
For now, let's postpone the special handling for the JobConf, and instead put a 
cap on the max size (like 1 MB).

> mapred.system.dir should be accessible only to hadoop daemons 
> --
>
> Key: MAPREDUCE-181
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-181
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Amar Kamat
>Assignee: Amar Kamat
> Attachments: hadoop-3578-branch-20-example-2.patch, 
> hadoop-3578-branch-20-example.patch, HADOOP-3578-v2.6.patch, 
> HADOOP-3578-v2.7.patch
>
>
> Currently the jobclient accesses the {{mapred.system.dir}} to add job 
> details. Hence the {{mapred.system.dir}} has the permissions of 
> {{rwx-wx-wx}}. This could be a security loophole where the job files might 
> get overwritten/tampered after the job submission. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-834) When TaskTracker config use old memory management values its memory monitoring is diabled.

2009-08-19 Thread Sreekanth Ramakrishnan (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sreekanth Ramakrishnan updated MAPREDUCE-834:
-

Attachment: mapreduce-834-2.patch

Attaching patch fixing using the correct key and converting the same to MB 
while allotting total memory alloted to the tasks.

> When TaskTracker config use old memory management values its memory 
> monitoring is diabled.
> --
>
> Key: MAPREDUCE-834
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-834
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Karam Singh
> Attachments: mapreduce-834-1.patch, mapreduce-834-2.patch
>
>
> TaskTracker memory config values -:
> mapred.tasktracker.vmem.reserved=8589934592
> mapred.task.default.maxvmem=2147483648
> mapred.task.limit.maxvmem=4294967296
> mapred.tasktracker.pmem.reserved=2147483648
> TaskTracker start as -:
>2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.tasktracker.vmem.reserved is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.tasktracker.pmem.reserved is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable 
> mapred.task.default.maxvmem is no longer used
>   2009-08-05 12:39:03,308 WARN 
> org.apache.hadoop.mapred.TaskTracker: The variable mapred.task.limit.maxvmem 
> is no longer used
>   2009-08-05 12:39:03,308 INFO 
> org.apache.hadoop.mapred.TaskTracker: Starting thread: Map-events fetcher for 
> all reduce tasks on 
>   2009-08-05 12:39:03,309 INFO 
> org.apache.hadoop.mapred.TaskTracker:  Using MemoryCalculatorPlugin : 
> org.apache.hadoop.util.linuxmemorycalculatorplu...@19be4777
>   2009-08-05 12:39:03,311 WARN 
> org.apache.hadoop.mapred.TaskTracker: TaskTracker's 
> totalMemoryAllottedForTasks is -1. TaskMemoryManager is disabled.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-430) Task stuck in cleanup with OutOfMemoryErrors

2009-08-19 Thread Amar Kamat (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amar Kamat updated MAPREDUCE-430:
-

Attachment: MAPREDUCE-430-v1.8.patch

Attaching a patch that avoid finally being called upon errors. Result of 
test-patch
[exec] +1 overall.  
 [exec] 
 [exec] +1 @author.  The patch does not contain any @author tags.
 [exec] 
 [exec] +1 tests included.  The patch appears to include 3 new or 
modified tests.
 [exec] 
 [exec] +1 javadoc.  The javadoc tool did not generate any warning 
messages.
 [exec] 
 [exec] +1 javac.  The applied patch does not increase the total number 
of javac compiler warnings.
 [exec] 
 [exec] +1 findbugs.  The patch does not introduce any new Findbugs 
warnings.
 [exec] 
 [exec] +1 release audit.  The applied patch does not increase the 
total number of release audit warnings.

Testing the patch.

> Task stuck in cleanup with OutOfMemoryErrors
> 
>
> Key: MAPREDUCE-430
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-430
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Amareshwari Sriramadasu
>Assignee: Amar Kamat
> Fix For: 0.20.1
>
> Attachments: MAPREDUCE-430-v1.6-branch-0.20.patch, 
> MAPREDUCE-430-v1.6.patch, MAPREDUCE-430-v1.7.patch, MAPREDUCE-430-v1.8.patch
>
>
> Obesrved a task with OutOfMemory error, stuck in cleanup.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

45 matches

Mail list logo