date:20100423

[jira] Commented: (MAPREDUCE-1545) Add 'first-task-launched' to job-summary

2010-04-23 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860479#action_12860479
 ] 

Hadoop QA commented on MAPREDUCE-1545:
--

-1 overall.  Here are the results of testing the latest attachment 
  
http://issues.apache.org/jira/secure/attachment/12440269/mr-1545-trunk-v2.patch
  against trunk revision 937201.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 3 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed core unit tests.

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/134/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/134/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/134/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/134/console

This message is automatically generated.

> Add 'first-task-launched' to job-summary
> 
>
> Key: MAPREDUCE-1545
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1545
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: jobtracker
>Reporter: Arun C Murthy
>Assignee: Luke Lu
> Fix For: 0.22.0
>
> Attachments: mr-1545-trunk-v1.patch, mr-1545-trunk-v2.patch, 
> mr-1545-y20s-v1.patch, mr-1545-y20s-v2.patch, mr-1545-y20s-v3.patch
>
>
> It would be useful to track 'first-task-launched' time to job-summary for 
> better reporting.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1718) job conf key for the services name of DelegationToken for HFTP url is constructed incorrectly in HFTPFileSystem

2010-04-23 Thread Boris Shkolnik (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Boris Shkolnik updated MAPREDUCE-1718:
--

Attachment: MAPREDUCE-1718-BP20-2.patch

moved utility method to SecurityUtil.java

> job conf key for the services name of DelegationToken for HFTP url is 
> constructed incorrectly in HFTPFileSystem
> ---
>
> Key: MAPREDUCE-1718
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1718
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Boris Shkolnik
>Assignee: Boris Shkolnik
> Attachments: MAPREDUCE-1718-BP20-1.patch, MAPREDUCE-1718-BP20-2.patch
>
>
> the key (build in TokenCache) is hdfs.service.host_HOSTNAME.PORT, but 
> in HftpFileSystem it is sometimes built as hdfs.service.host_IP.PORT.
> Fix. change it to always be IP.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1718) job conf key for the services name of DelegationToken for HFTP url is constructed incorrectly in HFTPFileSystem

2010-04-23 Thread Boris Shkolnik (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860461#action_12860461
 ] 

Boris Shkolnik commented on MAPREDUCE-1718:
---

shouldn't. it is used as a key, so it just needs to be the same in both cases.

> job conf key for the services name of DelegationToken for HFTP url is 
> constructed incorrectly in HFTPFileSystem
> ---
>
> Key: MAPREDUCE-1718
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1718
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Boris Shkolnik
>Assignee: Boris Shkolnik
> Attachments: MAPREDUCE-1718-BP20-1.patch, MAPREDUCE-1718-BP20-2.patch
>
>
> the key (build in TokenCache) is hdfs.service.host_HOSTNAME.PORT, but 
> in HftpFileSystem it is sometimes built as hdfs.service.host_IP.PORT.
> Fix. change it to always be IP.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1641) Job submission should fail if same uri is added for mapred.cache.files and mapred.cache.archives

2010-04-23 Thread Dick King (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dick King updated MAPREDUCE-1641:
-

Attachment: duped-files-archives--off-0-20-101--2010-04-23--1819.patch

This patch applies to 0.20.101, not to trunk, and needs to be forward ported.  
However, it is licensed.

> Job submission should fail if same uri is added for mapred.cache.files and 
> mapred.cache.archives
> 
>
> Key: MAPREDUCE-1641
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1641
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: distributed-cache
>Reporter: Amareshwari Sriramadasu
>Assignee: Dick King
> Fix For: 0.22.0
>
> Attachments: BZ-3539321--off-0-20-101--2010-04-20.patch, 
> duped-files-archives--off-0-20-101--2010-04-21.patch, 
> duped-files-archives--off-0-20-101--2010-04-23--1819.patch
>
>
> The behavior of mapred.cache.files and mapred.cache.archives is different 
> during localization in the following way:
> If a jar file is added to mapred.cache.files,  it will be localized under 
> TaskTracker under a unique path. 
> If a jar file is added to mapred.cache.archives, it will be localized under a 
> unique path in a directory named the jar file name, and will be unarchived 
> under the same directory.
> If same jar file is passed for both the configurations, the behavior 
> undefined. Thus the job submission should fail.
> Currently, since distributed cache processes files before archives, the jar 
> file will be just localized and not unarchived.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1545) Add 'first-task-launched' to job-summary

2010-04-23 Thread Luke Lu (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Luke Lu updated MAPREDUCE-1545:
---

Status: Patch Available  (was: Open)

> Add 'first-task-launched' to job-summary
> 
>
> Key: MAPREDUCE-1545
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1545
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: jobtracker
>Reporter: Arun C Murthy
>Assignee: Luke Lu
> Fix For: 0.22.0
>
> Attachments: mr-1545-trunk-v1.patch, mr-1545-trunk-v2.patch, 
> mr-1545-y20s-v1.patch, mr-1545-y20s-v2.patch, mr-1545-y20s-v3.patch
>
>
> It would be useful to track 'first-task-launched' time to job-summary for 
> better reporting.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1650) Exclude Private elements from generated MapReduce Javadoc

2010-04-23 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860394#action_12860394
 ] 

Hadoop QA commented on MAPREDUCE-1650:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12442685/MAPREDUCE-1650.patch
  against trunk revision 937201.

+1 @author.  The patch does not contain any @author tags.

+0 tests included.  The patch appears to be a documentation patch that 
doesn't require tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed core unit tests.

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/133/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/133/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/133/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/133/console

This message is automatically generated.

> Exclude Private elements from generated MapReduce Javadoc
> -
>
> Key: MAPREDUCE-1650
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1650
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>  Components: documentation
>Reporter: Tom White
>Assignee: Tom White
> Fix For: 0.21.0
>
> Attachments: MAPREDUCE-1650.patch, MAPREDUCE-1650.patch, 
> MAPREDUCE-1650.patch
>
>
> Exclude elements annotated with InterfaceAudience.Private or 
> InterfaceAudience.LimitedPrivate from Javadoc and JDiff.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (MAPREDUCE-1694) streaming documentation appears to be wrong on overriding settings w/-D

2010-04-23 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved MAPREDUCE-1694.
-

Fix Version/s: 0.21.0
   Resolution: Fixed

Marking this as fixed in 0.21, hoping that it comes out sooner rather than 
later.

> streaming documentation appears to be wrong on overriding settings w/-D
> ---
>
> Key: MAPREDUCE-1694
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1694
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/streaming, documentation
>Affects Versions: 0.20.2
>Reporter: Allen Wittenauer
> Fix For: 0.21.0
>
>
> Throughout http://hadoop.apache.org/common/docs/current/streaming.html , 
> there are many examples that do "hadoop jar streaming blah 
> -Dsomething=something".  None of these examples appear to work anymore.  
> Moving the "-Dsomething=something" to be after "hadoop jar streaming" works.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1725) Fix MapReduce API incompatibilities between 0.20 and 0.21

2010-04-23 Thread Tom White (JIRA)

Fix MapReduce API incompatibilities between 0.20 and 0.21
-

 Key: MAPREDUCE-1725
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1725
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: client
Reporter: Tom White
Assignee: Tom White
Priority: Blocker


A few API compatibilities have crept in since 0.20 (they are being tracked in 
MAPREDUCE-1623). These should be fixed before 0.21 is released.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1568) TrackerDistributedCacheManager should do deleteLocalPath asynchronously

2010-04-23 Thread Scott Chen (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860354#action_12860354
 ] 

Scott Chen commented on MAPREDUCE-1568:
---

Amareshwari, Actually, I do not have a strong preference on whether to use 
another cleanup thread or not. I think it is more elegant because 
getLocalCache() will actually just do get local cache instead of checking and 
start the cleaning thread. But on the other hand the current change is simple 
and safe.

What do you think?

> TrackerDistributedCacheManager should do deleteLocalPath asynchronously
> ---
>
> Key: MAPREDUCE-1568
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1568
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>Affects Versions: 0.22.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.22.0
>
> Attachments: MAPREDUCE-1568.txt
>
>
> TrackerDistributedCacheManager.deleteCache() has been improved:
> MAPREDUCE-1302 makes TrackerDistributedCacheManager rename the caches in the 
> main thread and then delete them in the background 
> MAPREDUCE-1098 avoids global locking while do the renaming (renaming lots of 
> directories can also takes a long time)
> But the deleteLocalCache is still in the main thread of TaskRunner.run(). So 
> it will still slow down the task which triggers the deletion (originally this 
> will blocks all tasks, but it is fixed by MAPREDUCE-1098). Other tasks do not 
> wait for the deletion. The task which triggers the deletion should not wait 
> for this either. TrackerDistributedCacheManager should do deleteLocalPath() 
> asynchronously.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1568) TrackerDistributedCacheManager should do deleteLocalPath asynchronously

2010-04-23 Thread Scott Chen (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860350#action_12860350
 ] 

Scott Chen commented on MAPREDUCE-1568:
---

Hey Amareshwari,

deleteCache will first get the global lock of all cache and put the one needs 
with zero reference count in toBeDeleted (this is done by you guys in 
MAPREDUCE-1098). And the asynchronous deletion will start from there. 

When the deletion condition is valid, only one task will get the global lock 
and after it comes out of the global lock the deletion condition will no longer 
valid. So there cannot be two threads deleting same set of cache at the same 
moment.

{code}
  private void deleteCache(Configuration conf) throws IOException {
Collection toBeDeleted = new LinkedList();
synchronized (cachedArchives) {  // Global lock of all caches
// Find cache Status with refcount of zero and put them in to toBeDeleted
}

// do the deletion asynchronously, after releasing the global lock
...
cacheFileCleaner.start();
  }
{code}

A separate cleanup thread is another option. I think that will work fine as 
well. But that will require more change. I think the good thing about the 
current patch is that it is simple and safe.

> TrackerDistributedCacheManager should do deleteLocalPath asynchronously
> ---
>
> Key: MAPREDUCE-1568
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1568
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>Affects Versions: 0.22.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.22.0
>
> Attachments: MAPREDUCE-1568.txt
>
>
> TrackerDistributedCacheManager.deleteCache() has been improved:
> MAPREDUCE-1302 makes TrackerDistributedCacheManager rename the caches in the 
> main thread and then delete them in the background 
> MAPREDUCE-1098 avoids global locking while do the renaming (renaming lots of 
> directories can also takes a long time)
> But the deleteLocalCache is still in the main thread of TaskRunner.run(). So 
> it will still slow down the task which triggers the deletion (originally this 
> will blocks all tasks, but it is fixed by MAPREDUCE-1098). Other tasks do not 
> wait for the deletion. The task which triggers the deletion should not wait 
> for this either. TrackerDistributedCacheManager should do deleteLocalPath() 
> asynchronously.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1664) Job Acls affect Queue Acls

2010-04-23 Thread Ravi Gummadi (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ravi Gummadi updated MAPREDUCE-1664:


Attachment: 1664.20S.3.4.patch

Attaching patch for earlier version of hadoop. Not for commit here.

> Job Acls affect Queue Acls
> --
>
> Key: MAPREDUCE-1664
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1664
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: security
>Affects Versions: 0.22.0
>Reporter: Ravi Gummadi
>Assignee: Ravi Gummadi
> Fix For: 0.22.0
>
> Attachments: 1664.20S.3.4.patch
>
>
> MAPREDUCE-1307 introduced job ACLs for securing job level operations. So in 
> current trunk, queue ACLs and job ACLs are checked(with AND for both acls) 
> for allowing job level operations. So for doing operations like killJob, 
> killTask and setJobPriority user should be part of both 
> mapred.queue.{queuename}.acl-administer-jobs and in 
> mapreduce.job.acl-modify-job. This needs to change so that users who are part 
> of mapred.queue.{queuename}.acl-administer-jobs will be able to do 
> killJob,killTask,setJobPriority and users part of 
> mapreduce.job.acl-modify-job will be able to do 
> killJob,killTask,setJobPriority.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1612) job conf file is not accessible from job history web page

2010-04-23 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860346#action_12860346
 ] 

Hadoop QA commented on MAPREDUCE-1612:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12442668/MR-1619-1.patch
  against trunk revision 937201.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 3 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed core unit tests.

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/132/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/132/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/132/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/132/console

This message is automatically generated.

> job conf file is not accessible from job history web page
> -
>
> Key: MAPREDUCE-1612
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1612
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: jobtracker
>Affects Versions: 0.22.0
>Reporter: Ravi Gummadi
>Assignee: Ravi Gummadi
> Fix For: 0.22.0
>
> Attachments: jobconf_history_jsp.fix.20S.patch, MR-1619-1.patch
>
>
> Clicking on conf file link from job history web page is causing an NPE if 
> history file(and the job conf file) are stored on DFS. This NPE is from 
> jobconf_history.jsp because jobConf built from path on DFS is not having any 
> properties.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1723) Capacity Scheduler should allow configuration of Map & Reduce task slots independently per queue

2010-04-23 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860340#action_12860340
 ] 

Allen Wittenauer commented on MAPREDUCE-1723:
-

FWIW, if pre-emption = kill rather than freeze, then I don't think that is 
efficient from a system throughput perspective.

> Capacity Scheduler should allow configuration of Map & Reduce task slots 
> independently per queue
> 
>
> Key: MAPREDUCE-1723
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1723
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: contrib/capacity-sched
>Affects Versions: 0.20.1
> Environment: all
>Reporter: Subramaniam Krishnan
> Fix For: 0.20.3
>
>
> The Capacity Scheduler allows configuration of percentage of task slots per 
> queue. We have a scenario in which our biggest queue (50% quota) has Jobs 
> with mainly Map tasks & we need to enforce strict capacity limits per queue 
> due to SLA requirements. So other smaller queues which require Reduce tasks 
> gets starved even though the Reduce slots are idle. The Grid can be more 
> efficiently utilized if Capacity Scheduler allows configuration of Map & 
> Reduce task slots capacity independently per queue.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1288) DistributedCache localizes only once per cache URI

2010-04-23 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860339#action_12860339
 ] 

Allen Wittenauer commented on MAPREDUCE-1288:
-

That sounds like really bad behavior.

Why should an old job fail because of what is, essentially, an external event?  

This still sounds like a blocker to me.

> DistributedCache localizes only once per cache URI
> --
>
> Key: MAPREDUCE-1288
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1288
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: security, tasktracker
>Affects Versions: 0.21.0
>Reporter: Devaraj Das
>Priority: Blocker
> Fix For: 0.21.0
>
>
> As part of the file localization the distributed cache localizer creates a 
> copy of the file in the corresponding user's private directory. The 
> localization in DistributedCache assumes the key as the URI of the cachefile 
> and if it already exists in the map, the localization is not done again. This 
> means that another user cannot access the same distributed cache file. We 
> should change the key to include the username so that localization is done 
> for every user.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1687) Stress submission policy does not always stress the cluster.

2010-04-23 Thread Hong Tang (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hong Tang updated MAPREDUCE-1687:
-

Attachment: mr-1687-yhadoop-20.1xx-20100423-2.patch

Fixed a minor bug in StressReaderThread.run() where the condition of the while 
loop waiting for cluster to be underloaded is incorrect.

> Stress submission policy does not always stress the cluster.
> 
>
> Key: MAPREDUCE-1687
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1687
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/gridmix
>Reporter: Hong Tang
> Attachments: mr-1687-yhadoop-20.1xx-20100416.patch, 
> mr-1687-yhadoop-20.1xx-20100423-2.patch, mr-1687-yhadoop-20.1xx-20100423.patch
>
>
> Currently, the rough idea of stress submission policy is to continue 
> submitting jobs until the pending map tasks reach 2x of the cluster capacity. 
> This proves to be inadequate and we saw a large job could monopolize the 
> whole cluster.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1650) Exclude Private elements from generated MapReduce Javadoc

2010-04-23 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White updated MAPREDUCE-1650:
-

Attachment: MAPREDUCE-1650.patch

> Exclude Private elements from generated MapReduce Javadoc
> -
>
> Key: MAPREDUCE-1650
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1650
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>  Components: documentation
>Reporter: Tom White
>Assignee: Tom White
> Fix For: 0.21.0
>
> Attachments: MAPREDUCE-1650.patch, MAPREDUCE-1650.patch, 
> MAPREDUCE-1650.patch
>
>
> Exclude elements annotated with InterfaceAudience.Private or 
> InterfaceAudience.LimitedPrivate from Javadoc and JDiff.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1650) Exclude Private elements from generated MapReduce Javadoc

2010-04-23 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White updated MAPREDUCE-1650:
-

   Status: Patch Available  (was: In Progress)
Fix Version/s: 0.21.0

> Exclude Private elements from generated MapReduce Javadoc
> -
>
> Key: MAPREDUCE-1650
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1650
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>  Components: documentation
>Reporter: Tom White
>Assignee: Tom White
> Fix For: 0.21.0
>
> Attachments: MAPREDUCE-1650.patch, MAPREDUCE-1650.patch, 
> MAPREDUCE-1650.patch
>
>
> Exclude elements annotated with InterfaceAudience.Private or 
> InterfaceAudience.LimitedPrivate from Javadoc and JDiff.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Work started: (MAPREDUCE-1650) Exclude Private elements from generated MapReduce Javadoc

2010-04-23 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on MAPREDUCE-1650 started by Tom White.

> Exclude Private elements from generated MapReduce Javadoc
> -
>
> Key: MAPREDUCE-1650
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1650
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>  Components: documentation
>Reporter: Tom White
>Assignee: Tom White
> Fix For: 0.21.0
>
> Attachments: MAPREDUCE-1650.patch, MAPREDUCE-1650.patch, 
> MAPREDUCE-1650.patch
>
>
> Exclude elements annotated with InterfaceAudience.Private or 
> InterfaceAudience.LimitedPrivate from Javadoc and JDiff.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1219) JobTracker Metrics causes undue load on JobTracker

2010-04-23 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860276#action_12860276
 ] 

Hadoop QA commented on MAPREDUCE-1219:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12442659/MR-1219-2.patch
  against trunk revision 937201.

+1 @author.  The patch does not contain any @author tags.

-1 tests included.  The patch doesn't appear to include any new or modified 
tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed core unit tests.

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/131/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/131/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/131/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/131/console

This message is automatically generated.

> JobTracker Metrics causes undue load on JobTracker
> --
>
> Key: MAPREDUCE-1219
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1219
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.21.0
>Reporter: Jothi Padmanabhan
>Assignee: Sreekanth Ramakrishnan
> Attachments: MAPREDUCE-1219.patch, MR-1219-1.patch, MR-1219-2.patch, 
> patch-1219-ydist.txt
>
>
> JobTrackerMetricsInst.doUpdates updates job-level counters of all running 
> jobs into JobTracker's metrics causing very bad performance and hampers 
> heartbeats. Since Job level metrics are better served by JobHistory, it may 
> be a good idea to remove these from the metrics framework.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1397) NullPointerException observed during task failures

2010-04-23 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860230#action_12860230
 ] 

Hadoop QA commented on MAPREDUCE-1397:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12442658/patch-1397-3.txt
  against trunk revision 936166.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 3 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed core unit tests.

-1 contrib tests.  The patch failed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/130/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/130/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/130/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h4.grid.sp2.yahoo.net/130/console

This message is automatically generated.

> NullPointerException observed during task failures
> --
>
> Key: MAPREDUCE-1397
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1397
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: tasktracker
>Affects Versions: 0.20.1
>Reporter: Ramya R
>Assignee: Amareshwari Sriramadasu
>Priority: Minor
> Fix For: 0.22.0
>
> Attachments: patch-1397-1.txt, patch-1397-2.txt, patch-1397-3.txt, 
> patch-1397-ydist.txt, patch-1397.txt
>
>
> In an environment where many jobs are killed simultaneously, NPEs are 
> observed in the TT/JT logs when a task fails. The situation is aggravated 
> when the taskcontroller.cfg is not configured properly. Below is the 
> exception obtained:
> {noformat}
> INFO org.apache.hadoop.mapred.TaskInProgress: Error from :
> java.lang.Throwable: Child Error
> at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:529)
> Caused by: java.lang.NullPointerException
> at 
> org.apache.hadoop.mapred.JvmManager$JvmManagerForType.getDetails(JvmManager.java:329)
> at 
> org.apache.hadoop.mapred.JvmManager$JvmManagerForType.reapJvm(JvmManager.java:315)
> at 
> org.apache.hadoop.mapred.JvmManager$JvmManagerForType.access$000(JvmManager.java:146)
> at org.apache.hadoop.mapred.JvmManager.launchJvm(JvmManager.java:109)
> at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:502)
>  {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1612) job conf file is not accessible from job history web page

2010-04-23 Thread Sreekanth Ramakrishnan (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sreekanth Ramakrishnan updated MAPREDUCE-1612:
--

Attachment: MR-1619-1.patch

Attaching patch for trunk.

> job conf file is not accessible from job history web page
> -
>
> Key: MAPREDUCE-1612
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1612
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: jobtracker
>Affects Versions: 0.22.0
>Reporter: Ravi Gummadi
>Assignee: Ravi Gummadi
> Fix For: 0.22.0
>
> Attachments: jobconf_history_jsp.fix.20S.patch, MR-1619-1.patch
>
>
> Clicking on conf file link from job history web page is causing an NPE if 
> history file(and the job conf file) are stored on DFS. This NPE is from 
> jobconf_history.jsp because jobConf built from path on DFS is not having any 
> properties.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1612) job conf file is not accessible from job history web page

2010-04-23 Thread Sreekanth Ramakrishnan (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sreekanth Ramakrishnan updated MAPREDUCE-1612:
--

Status: Patch Available  (was: Open)

> job conf file is not accessible from job history web page
> -
>
> Key: MAPREDUCE-1612
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1612
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: jobtracker
>Affects Versions: 0.22.0
>Reporter: Ravi Gummadi
>Assignee: Ravi Gummadi
> Fix For: 0.22.0
>
> Attachments: jobconf_history_jsp.fix.20S.patch, MR-1619-1.patch
>
>
> Clicking on conf file link from job history web page is causing an NPE if 
> history file(and the job conf file) are stored on DFS. This NPE is from 
> jobconf_history.jsp because jobConf built from path on DFS is not having any 
> properties.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1724) JobTracker balks at empty String for locations

2010-04-23 Thread Craig Macdonald (JIRA)

JobTracker balks at empty String for locations
--

 Key: MAPREDUCE-1724
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1724
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobtracker
Affects Versions: 0.20.1
 Environment: CDH 2 hadoop-0.20.1+169.68
Reporter: Craig Macdonald
Priority: Minor


If a split has locations which are "" (empty String), then the JobTracker will 
get upset during initialization:

2010-04-22 19:09:20,395 ERROR org.apache.hadoop.mapred.JobTracker: Job 
initialization failed:
java.lang.StringIndexOutOfBoundsException: String index out of range: 0
at java.lang.String.charAt(String.java:687)
at org.apache.hadoop.net.NetUtils.normalizeHostName(NetUtils.java:420)
at org.apache.hadoop.net.NetUtils.normalizeHostNames(NetUtils.java:443)
at 
org.apache.hadoop.net.CachedDNSToSwitchMapping.resolve(CachedDNSToSwitchMapping.java:42)
at 
org.apache.hadoop.mapred.JobTracker.resolveAndAddToTopology(JobTracker.java:2411)
at 
org.apache.hadoop.mapred.JobInProgress.createCache(JobInProgress.java:360)
at 
org.apache.hadoop.mapred.JobInProgress.initTasks(JobInProgress.java:462)
at org.apache.hadoop.mapred.JobTracker.initJob(JobTracker.java:3183)
at 
org.apache.hadoop.mapred.EagerTaskInitializationListener$InitJob.run(EagerTaskInitializationListener.java:79)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:885)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:907)
at java.lang.Thread.run(Thread.java:637)

Two key points:
 * This is different from Hadoop 0.18
 * CombineFileSplit has a constructor where String[] location is not specified, 
and hence the location array is populated with empty Strings.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1693) Process tree clean up of either a failed task or killed task tests.

2010-04-23 Thread Vinay Kumar Thota (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinay Kumar Thota updated MAPREDUCE-1693:
-

Attachment: taskchildskilling_1693.patch

Method name changes in utility class based on Iyappan review comments and this 
patch got effected.So uploading the latest path.

> Process tree clean up of either a failed task or killed task tests.
> ---
>
> Key: MAPREDUCE-1693
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1693
> Project: Hadoop Map/Reduce
>  Issue Type: Task
>  Components: test
>Reporter: Vinay Kumar Thota
>Assignee: Vinay Kumar Thota
> Attachments: taskchildskilling_1693.diff, 
> taskchildskilling_1693.diff, taskchildskilling_1693.patch, 
> taskchildskilling_1693.patch, taskchildskilling_1693.patch, 
> taskchildskilling_1693.patch, taskchildskilling_1693.patch
>
>
> The following scenarios covered in the test.
> 1. Run a job which spawns subshells in the tasks. Kill one of the task. All 
> the child process of the killed task must be killed.
> 2. Run a job which spawns subshells in tasks. Fail one of the task. All the 
> child process of the killed task must be killed along with the task after its 
> failure.
> 3. Check process tree cleanup on paritcular task-tracker when we use 
> -kill-task and -fail-task with both map and reduce.
> 4. Submit a job which would spawn child processes and each of the child 
> processes exceeds the memory limits. Let the job complete . Check if all the 
> child processes are killed, the overall job should fail.
> l)Submit a job which would spawn child processes and each of the child 
> processes exceeds the memory limits. Kill/fail the job while in progress. 
> Check if all the child processes are killed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1710) Process tree clean up of exceeding memory limit tasks.

2010-04-23 Thread Vinay Kumar Thota (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinay Kumar Thota updated MAPREDUCE-1710:
-

Attachment: memorylimittask_1710.patch

method signatures changed in Utility class based on Iyappan review comments and 
and it effects to this patch.So uploading the new patch for it.

> Process tree clean up of exceeding memory limit tasks.
> --
>
> Key: MAPREDUCE-1710
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1710
> Project: Hadoop Map/Reduce
>  Issue Type: Task
>  Components: test
>Reporter: Vinay Kumar Thota
>Assignee: Vinay Kumar Thota
> Attachments: memorylimittask_1710.patch, memorylimittask_1710.patch, 
> memorylimittask_1710.patch, memorylimittask_1710.patch
>
>
> 1. Submit a job which would spawn child processes and each of the child 
> processes exceeds the memory limits. Let the job complete . Check if all the 
> child processes are killed, the overall job should fail.
> 2. Submit a job which would spawn child processes and each of the child 
> processes exceeds the memory limits. Kill/fail the job while in progress. 
> Check if all the child processes are killed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1711) Gridmix should provide an option to submit jobs to the same queues as specified in the trace.

2010-04-23 Thread rahul k singh (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

rahul k singh updated MAPREDUCE-1711:
-

Attachment: MR-1711-yhadoop-20-1xx-4.patch

> Gridmix should provide an option to submit jobs to the same queues as 
> specified in the trace.
> -
>
> Key: MAPREDUCE-1711
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1711
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: contrib/gridmix
>Reporter: Hong Tang
> Attachments: MR-1711-yhadoop-20-1xx-2.patch, 
> MR-1711-yhadoop-20-1xx-3.patch, MR-1711-yhadoop-20-1xx-4.patch, 
> MR-1711-yhadoop-20-1xx.patch, mr-1711-yhadoop-20.1xx-20100416.patch
>
>
> Gridmix should provide an option to submit jobs to the same queues as 
> specified in the trace.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1711) Gridmix should provide an option to submit jobs to the same queues as specified in the trace.

2010-04-23 Thread rahul k singh (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

rahul k singh updated MAPREDUCE-1711:
-

Attachment: MR-1711-yhadoop-20-1xx-3.patch

Implemented the changes suggested by hong

> Gridmix should provide an option to submit jobs to the same queues as 
> specified in the trace.
> -
>
> Key: MAPREDUCE-1711
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1711
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: contrib/gridmix
>Reporter: Hong Tang
> Attachments: MR-1711-yhadoop-20-1xx-2.patch, 
> MR-1711-yhadoop-20-1xx-3.patch, MR-1711-yhadoop-20-1xx.patch, 
> mr-1711-yhadoop-20.1xx-20100416.patch
>
>
> Gridmix should provide an option to submit jobs to the same queues as 
> specified in the trace.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1723) Capacity Scheduler should allow configuration of Map & Reduce task slots independently per queue

2010-04-23 Thread Subramaniam Krishnan (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860198#action_12860198
 ] 

Subramaniam Krishnan commented on MAPREDUCE-1723:
-


Yes...I was just about to update my comment clarifying the same...:)

> Capacity Scheduler should allow configuration of Map & Reduce task slots 
> independently per queue
> 
>
> Key: MAPREDUCE-1723
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1723
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: contrib/capacity-sched
>Affects Versions: 0.20.1
> Environment: all
>Reporter: Subramaniam Krishnan
> Fix For: 0.20.3
>
>
> The Capacity Scheduler allows configuration of percentage of task slots per 
> queue. We have a scenario in which our biggest queue (50% quota) has Jobs 
> with mainly Map tasks & we need to enforce strict capacity limits per queue 
> due to SLA requirements. So other smaller queues which require Reduce tasks 
> gets starved even though the Reduce slots are idle. The Grid can be more 
> efficiently utilized if Capacity Scheduler allows configuration of Map & 
> Reduce task slots capacity independently per queue.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1723) Capacity Scheduler should allow configuration of Map & Reduce task slots independently per queue

2010-04-23 Thread Hemanth Yamijala (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860197#action_12860197
 ] 

Hemanth Yamijala commented on MAPREDUCE-1723:
-

bq. the proportion of Map & Reduce slots in our cluster would ideally be 3:4

You mean, 3 maps out of 4 slots ?

> Capacity Scheduler should allow configuration of Map & Reduce task slots 
> independently per queue
> 
>
> Key: MAPREDUCE-1723
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1723
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: contrib/capacity-sched
>Affects Versions: 0.20.1
> Environment: all
>Reporter: Subramaniam Krishnan
> Fix For: 0.20.3
>
>
> The Capacity Scheduler allows configuration of percentage of task slots per 
> queue. We have a scenario in which our biggest queue (50% quota) has Jobs 
> with mainly Map tasks & we need to enforce strict capacity limits per queue 
> due to SLA requirements. So other smaller queues which require Reduce tasks 
> gets starved even though the Reduce slots are idle. The Grid can be more 
> efficiently utilized if Capacity Scheduler allows configuration of Map & 
> Reduce task slots capacity independently per queue.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1723) Capacity Scheduler should allow configuration of Map & Reduce task slots independently per queue

2010-04-23 Thread Subramaniam Krishnan (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860195#action_12860195
 ] 

Subramaniam Krishnan commented on MAPREDUCE-1723:
-


Hemanth, the proportion of Map & Reduce slots in our cluster would ideally be 
3:4 (tending to even 4:5) but right now we have configured it at 2:3 to 
mitigate the effect described above.

> Capacity Scheduler should allow configuration of Map & Reduce task slots 
> independently per queue
> 
>
> Key: MAPREDUCE-1723
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1723
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: contrib/capacity-sched
>Affects Versions: 0.20.1
> Environment: all
>Reporter: Subramaniam Krishnan
> Fix For: 0.20.3
>
>
> The Capacity Scheduler allows configuration of percentage of task slots per 
> queue. We have a scenario in which our biggest queue (50% quota) has Jobs 
> with mainly Map tasks & we need to enforce strict capacity limits per queue 
> due to SLA requirements. So other smaller queues which require Reduce tasks 
> gets starved even though the Reduce slots are idle. The Grid can be more 
> efficiently utilized if Capacity Scheduler allows configuration of Map & 
> Reduce task slots capacity independently per queue.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1695) capacity scheduler is not included in findbugs/javadoc targets

2010-04-23 Thread Hemanth Yamijala (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hemanth Yamijala updated MAPREDUCE-1695:


   Status: Resolved  (was: Patch Available)
 Hadoop Flags: [Reviewed]
Fix Version/s: 0.22.0
   Resolution: Fixed

I just committed this. Thanks, Hong !

> capacity scheduler is not included in findbugs/javadoc targets
> --
>
> Key: MAPREDUCE-1695
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1695
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/capacity-sched
>Reporter: Hong Tang
>Assignee: Hong Tang
> Fix For: 0.22.0
>
> Attachments: MAPREDUCE-1695-2.patch, mapreduce-1695-20100420-2.patch, 
> mapreduce-1695-20100420-2.patch, mapreduce-1695-20100420.patch, 
> mapreduce-1695-20100421.patch, mapreduce-1695-20100422.patch, 
> MAPREDUCE-1695-3.patch, MAPREDUCE-1695.patch, 
> mr1695-hadoop-findbugs-report-1.html, mr1695-hadoop-findbugs-report-2.html
>
>
> Capacity Scheduler is not included in findbugs/javadoc targets.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1695) capacity scheduler is not included in findbugs/javadoc targets

2010-04-23 Thread Hemanth Yamijala (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860184#action_12860184
 ] 

Hemanth Yamijala commented on MAPREDUCE-1695:
-

+1. Looks good to me. This is ready to go.

> capacity scheduler is not included in findbugs/javadoc targets
> --
>
> Key: MAPREDUCE-1695
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1695
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/capacity-sched
>Reporter: Hong Tang
>Assignee: Hong Tang
> Attachments: MAPREDUCE-1695-2.patch, mapreduce-1695-20100420-2.patch, 
> mapreduce-1695-20100420-2.patch, mapreduce-1695-20100420.patch, 
> mapreduce-1695-20100421.patch, mapreduce-1695-20100422.patch, 
> MAPREDUCE-1695-3.patch, MAPREDUCE-1695.patch, 
> mr1695-hadoop-findbugs-report-1.html, mr1695-hadoop-findbugs-report-2.html
>
>
> Capacity Scheduler is not included in findbugs/javadoc targets.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1219) JobTracker Metrics causes undue load on JobTracker

2010-04-23 Thread Amareshwari Sriramadasu (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860180#action_12860180
 ] 

Amareshwari Sriramadasu commented on MAPREDUCE-1219:


+1 patch looks good.

> JobTracker Metrics causes undue load on JobTracker
> --
>
> Key: MAPREDUCE-1219
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1219
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.21.0
>Reporter: Jothi Padmanabhan
>Assignee: Sreekanth Ramakrishnan
> Attachments: MAPREDUCE-1219.patch, MR-1219-1.patch, MR-1219-2.patch, 
> patch-1219-ydist.txt
>
>
> JobTrackerMetricsInst.doUpdates updates job-level counters of all running 
> jobs into JobTracker's metrics causing very bad performance and hampers 
> heartbeats. Since Job level metrics are better served by JobHistory, it may 
> be a good idea to remove these from the metrics framework.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1219) JobTracker Metrics causes undue load on JobTracker

2010-04-23 Thread Sreekanth Ramakrishnan (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sreekanth Ramakrishnan updated MAPREDUCE-1219:
--

Status: Patch Available  (was: Open)

> JobTracker Metrics causes undue load on JobTracker
> --
>
> Key: MAPREDUCE-1219
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1219
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.21.0
>Reporter: Jothi Padmanabhan
>Assignee: Sreekanth Ramakrishnan
> Attachments: MAPREDUCE-1219.patch, MR-1219-1.patch, MR-1219-2.patch, 
> patch-1219-ydist.txt
>
>
> JobTrackerMetricsInst.doUpdates updates job-level counters of all running 
> jobs into JobTracker's metrics causing very bad performance and hampers 
> heartbeats. Since Job level metrics are better served by JobHistory, it may 
> be a good idea to remove these from the metrics framework.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1219) JobTracker Metrics causes undue load on JobTracker

2010-04-23 Thread Sreekanth Ramakrishnan (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sreekanth Ramakrishnan updated MAPREDUCE-1219:
--

Status: Open  (was: Patch Available)

> JobTracker Metrics causes undue load on JobTracker
> --
>
> Key: MAPREDUCE-1219
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1219
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.21.0
>Reporter: Jothi Padmanabhan
>Assignee: Sreekanth Ramakrishnan
> Attachments: MAPREDUCE-1219.patch, MR-1219-1.patch, MR-1219-2.patch, 
> patch-1219-ydist.txt
>
>
> JobTrackerMetricsInst.doUpdates updates job-level counters of all running 
> jobs into JobTracker's metrics causing very bad performance and hampers 
> heartbeats. Since Job level metrics are better served by JobHistory, it may 
> be a good idea to remove these from the metrics framework.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1219) JobTracker Metrics causes undue load on JobTracker

2010-04-23 Thread Sreekanth Ramakrishnan (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sreekanth Ramakrishnan updated MAPREDUCE-1219:
--

Attachment: MR-1219-2.patch

Incorporating Amareshwaris comment.

> JobTracker Metrics causes undue load on JobTracker
> --
>
> Key: MAPREDUCE-1219
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1219
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.21.0
>Reporter: Jothi Padmanabhan
>Assignee: Sreekanth Ramakrishnan
> Attachments: MAPREDUCE-1219.patch, MR-1219-1.patch, MR-1219-2.patch, 
> patch-1219-ydist.txt
>
>
> JobTrackerMetricsInst.doUpdates updates job-level counters of all running 
> jobs into JobTracker's metrics causing very bad performance and hampers 
> heartbeats. Since Job level metrics are better served by JobHistory, it may 
> be a good idea to remove these from the metrics framework.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1397) NullPointerException observed during task failures

2010-04-23 Thread Amareshwari Sriramadasu (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated MAPREDUCE-1397:
---

Attachment: patch-1397-3.txt

Patch for trunk with changes in testcase.

> NullPointerException observed during task failures
> --
>
> Key: MAPREDUCE-1397
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1397
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: tasktracker
>Affects Versions: 0.20.1
>Reporter: Ramya R
>Assignee: Amareshwari Sriramadasu
>Priority: Minor
> Fix For: 0.22.0
>
> Attachments: patch-1397-1.txt, patch-1397-2.txt, patch-1397-3.txt, 
> patch-1397-ydist.txt, patch-1397.txt
>
>
> In an environment where many jobs are killed simultaneously, NPEs are 
> observed in the TT/JT logs when a task fails. The situation is aggravated 
> when the taskcontroller.cfg is not configured properly. Below is the 
> exception obtained:
> {noformat}
> INFO org.apache.hadoop.mapred.TaskInProgress: Error from :
> java.lang.Throwable: Child Error
> at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:529)
> Caused by: java.lang.NullPointerException
> at 
> org.apache.hadoop.mapred.JvmManager$JvmManagerForType.getDetails(JvmManager.java:329)
> at 
> org.apache.hadoop.mapred.JvmManager$JvmManagerForType.reapJvm(JvmManager.java:315)
> at 
> org.apache.hadoop.mapred.JvmManager$JvmManagerForType.access$000(JvmManager.java:146)
> at org.apache.hadoop.mapred.JvmManager.launchJvm(JvmManager.java:109)
> at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:502)
>  {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1397) NullPointerException observed during task failures

2010-04-23 Thread Amareshwari Sriramadasu (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated MAPREDUCE-1397:
---

Status: Patch Available  (was: Open)

> NullPointerException observed during task failures
> --
>
> Key: MAPREDUCE-1397
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1397
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: tasktracker
>Affects Versions: 0.20.1
>Reporter: Ramya R
>Assignee: Amareshwari Sriramadasu
>Priority: Minor
> Fix For: 0.22.0
>
> Attachments: patch-1397-1.txt, patch-1397-2.txt, patch-1397-3.txt, 
> patch-1397-ydist.txt, patch-1397.txt
>
>
> In an environment where many jobs are killed simultaneously, NPEs are 
> observed in the TT/JT logs when a task fails. The situation is aggravated 
> when the taskcontroller.cfg is not configured properly. Below is the 
> exception obtained:
> {noformat}
> INFO org.apache.hadoop.mapred.TaskInProgress: Error from :
> java.lang.Throwable: Child Error
> at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:529)
> Caused by: java.lang.NullPointerException
> at 
> org.apache.hadoop.mapred.JvmManager$JvmManagerForType.getDetails(JvmManager.java:329)
> at 
> org.apache.hadoop.mapred.JvmManager$JvmManagerForType.reapJvm(JvmManager.java:315)
> at 
> org.apache.hadoop.mapred.JvmManager$JvmManagerForType.access$000(JvmManager.java:146)
> at org.apache.hadoop.mapred.JvmManager.launchJvm(JvmManager.java:109)
> at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:502)
>  {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1397) NullPointerException observed during task failures

2010-04-23 Thread Amareshwari Sriramadasu (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated MAPREDUCE-1397:
---

Attachment: patch-1397-ydist.txt

Patch for yahoo! distribution with a couple of bug fixes in test case.

> NullPointerException observed during task failures
> --
>
> Key: MAPREDUCE-1397
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1397
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: tasktracker
>Affects Versions: 0.20.1
>Reporter: Ramya R
>Assignee: Amareshwari Sriramadasu
>Priority: Minor
> Fix For: 0.22.0
>
> Attachments: patch-1397-1.txt, patch-1397-2.txt, 
> patch-1397-ydist.txt, patch-1397.txt
>
>
> In an environment where many jobs are killed simultaneously, NPEs are 
> observed in the TT/JT logs when a task fails. The situation is aggravated 
> when the taskcontroller.cfg is not configured properly. Below is the 
> exception obtained:
> {noformat}
> INFO org.apache.hadoop.mapred.TaskInProgress: Error from :
> java.lang.Throwable: Child Error
> at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:529)
> Caused by: java.lang.NullPointerException
> at 
> org.apache.hadoop.mapred.JvmManager$JvmManagerForType.getDetails(JvmManager.java:329)
> at 
> org.apache.hadoop.mapred.JvmManager$JvmManagerForType.reapJvm(JvmManager.java:315)
> at 
> org.apache.hadoop.mapred.JvmManager$JvmManagerForType.access$000(JvmManager.java:146)
> at org.apache.hadoop.mapred.JvmManager.launchJvm(JvmManager.java:109)
> at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:502)
>  {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1397) NullPointerException observed during task failures

2010-04-23 Thread Amareshwari Sriramadasu (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated MAPREDUCE-1397:
---

Status: Open  (was: Patch Available)

> NullPointerException observed during task failures
> --
>
> Key: MAPREDUCE-1397
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1397
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: tasktracker
>Affects Versions: 0.20.1
>Reporter: Ramya R
>Assignee: Amareshwari Sriramadasu
>Priority: Minor
> Fix For: 0.22.0
>
> Attachments: patch-1397-1.txt, patch-1397-2.txt, patch-1397.txt
>
>
> In an environment where many jobs are killed simultaneously, NPEs are 
> observed in the TT/JT logs when a task fails. The situation is aggravated 
> when the taskcontroller.cfg is not configured properly. Below is the 
> exception obtained:
> {noformat}
> INFO org.apache.hadoop.mapred.TaskInProgress: Error from :
> java.lang.Throwable: Child Error
> at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:529)
> Caused by: java.lang.NullPointerException
> at 
> org.apache.hadoop.mapred.JvmManager$JvmManagerForType.getDetails(JvmManager.java:329)
> at 
> org.apache.hadoop.mapred.JvmManager$JvmManagerForType.reapJvm(JvmManager.java:315)
> at 
> org.apache.hadoop.mapred.JvmManager$JvmManagerForType.access$000(JvmManager.java:146)
> at org.apache.hadoop.mapred.JvmManager.launchJvm(JvmManager.java:109)
> at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:502)
>  {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1225) TT successfully localizes a task even though the corresponding cache-file has already changed on DFS.

2010-04-23 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860160#action_12860160
 ] 

Hadoop QA commented on MAPREDUCE-1225:
--

+1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12442402/MAPREDUCE-1225.patch
  against trunk revision 936166.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 3 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed core unit tests.

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/360/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/360/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/360/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/360/console

This message is automatically generated.

> TT successfully localizes a task even though the corresponding cache-file has 
> already changed on DFS.
> -
>
> Key: MAPREDUCE-1225
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1225
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: tasktracker
>Affects Versions: 0.21.0, 0.22.0
>Reporter: Vinod K V
> Fix For: 0.22.0
>
> Attachments: MAPREDUCE-1225.patch, MAPREDUCE-1225.patch, 
> MAPREDUCE-1225.patch, MAPREDUCE-1225.patch
>
>
> This happens with the first task of a job being localized on this TT. TT 
> doesn't check if the file on DFS is fresh according to the timestamps set in 
> job-conf during submission. After the first task incorrectly gets localized, 
> all further tasks fail on this TT as expected.
> Found this issue while trying to improve test-case for MAPREUCE-913.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1687) Stress submission policy does not always stress the cluster.

2010-04-23 Thread Hong Tang (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hong Tang updated MAPREDUCE-1687:
-

Attachment: mr-1687-yhadoop-20.1xx-20100423.patch

Patch that fine tunes the overload determination heuristics. 

> Stress submission policy does not always stress the cluster.
> 
>
> Key: MAPREDUCE-1687
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1687
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/gridmix
>Reporter: Hong Tang
> Attachments: mr-1687-yhadoop-20.1xx-20100416.patch, 
> mr-1687-yhadoop-20.1xx-20100423.patch
>
>
> Currently, the rough idea of stress submission policy is to continue 
> submitting jobs until the pending map tasks reach 2x of the cluster capacity. 
> This proves to be inadequate and we saw a large job could monopolize the 
> whole cluster.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1711) Gridmix should provide an option to submit jobs to the same queues as specified in the trace.

2010-04-23 Thread Hong Tang (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860152#action_12860152
 ] 

Hong Tang commented on MAPREDUCE-1711:
--

{noformat}
+assertEquals(" Improper queue for  " + job.getJobName() + " " ,
+  job.getConfiguration().get("mapred.job.queue.name"),
+ "q" + (Integer.valueOf(job.getJobName().substring(
+   job.getJobName().length() - 1))+1));
{noformat}
This code is fragile, why not using GridmixJob.getJobSeqId(job)?

> Gridmix should provide an option to submit jobs to the same queues as 
> specified in the trace.
> -
>
> Key: MAPREDUCE-1711
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1711
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: contrib/gridmix
>Reporter: Hong Tang
> Attachments: MR-1711-yhadoop-20-1xx-2.patch, 
> MR-1711-yhadoop-20-1xx.patch, mr-1711-yhadoop-20.1xx-20100416.patch
>
>
> Gridmix should provide an option to submit jobs to the same queues as 
> specified in the trace.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1711) Gridmix should provide an option to submit jobs to the same queues as specified in the trace.

2010-04-23 Thread Hong Tang (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860150#action_12860150
 ] 

Hong Tang commented on MAPREDUCE-1711:
--

- We should generate data only once.
- We may want to increase the # of jobs submitted more than 3. To avoid 
lengthening too much of unit test run time, we should consider running sleep 
jobs, or processing less bytes in that specific case.
- There is another problem I missed which is that JobQueueTaskScheduler does 
not really understand the notion of queues while we may not be able to have the 
unit test depending on capacity scheduler. This issue is probably outside the 
scope of this jira, though.

> Gridmix should provide an option to submit jobs to the same queues as 
> specified in the trace.
> -
>
> Key: MAPREDUCE-1711
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1711
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: contrib/gridmix
>Reporter: Hong Tang
> Attachments: MR-1711-yhadoop-20-1xx-2.patch, 
> MR-1711-yhadoop-20-1xx.patch, mr-1711-yhadoop-20.1xx-20100416.patch
>
>
> Gridmix should provide an option to submit jobs to the same queues as 
> specified in the trace.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1723) Capacity Scheduler should allow configuration of Map & Reduce task slots independently per queue

2010-04-23 Thread Hemanth Yamijala (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860147#action_12860147
 ] 

Hemanth Yamijala commented on MAPREDUCE-1723:
-

Hmm. In HADOOP-3445 (God, I am surprised I still remember the number, *smile*) 
which introduced the capacity scheduler, Vivek had argued to have separate 
percentages for map and reduce capacities. At the time though, consensus drove 
towards having a single number.  I think a big factor driving that decision was 
the absence of limits and presence of pre-emption. At that time, queues could 
not impose limits and hence spare capacity could be always used elsewhere; and 
pre-emption was meant to ensure that queues could get their 'guaranteed' 
capacity when required.

With time, limits have come in and pre-emption has gone out. There is this 
valid use case that has come up. To me it seems like there are two ways to 
approach this problem. One is to do the enhancement proposed in the JIRA. Two 
is to re-introduce pre-emption. Clearly the first option is simple and easy to 
understand; I can think of ways we can keep the spec and implementation simple 
for the default case and still support this special requirement. The only thing 
bothering me is that it seems to be handling a specific type of cluster setup 
(i.e. the kind of queue and job profile that is described). The second option 
is clearly quite complicated. But we've had repeated cases from people asking 
for pre-emption in the scheduler, and I think it is a topic that's going to die 
only when it gets implemented. *smile*.

As a side note while we are still discussing this, Subramaniam, what is the 
proportion of map and reduce slots in your cluster ? Are they the same ?

> Capacity Scheduler should allow configuration of Map & Reduce task slots 
> independently per queue
> 
>
> Key: MAPREDUCE-1723
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1723
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: contrib/capacity-sched
>Affects Versions: 0.20.1
> Environment: all
>Reporter: Subramaniam Krishnan
> Fix For: 0.20.3
>
>
> The Capacity Scheduler allows configuration of percentage of task slots per 
> queue. We have a scenario in which our biggest queue (50% quota) has Jobs 
> with mainly Map tasks & we need to enforce strict capacity limits per queue 
> due to SLA requirements. So other smaller queues which require Reduce tasks 
> gets starved even though the Reduce slots are idle. The Grid can be more 
> efficiently utilized if Capacity Scheduler allows configuration of Map & 
> Reduce task slots capacity independently per queue.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

45 matches

Mail list logo