date:20091204


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Douglas updated MAPREDUCE-1152:
-

  Resolution: Fixed
Hadoop Flags: [Reviewed]
  Status: Resolved  (was: Patch Available)

+1

I committed this. Thanks, Sharad!

 JobTrackerInstrumentation.killed{Map/Reduce} is never called
 

 Key: MAPREDUCE-1152
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1152
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.22.0
Reporter: Sharad Agarwal
 Fix For: 0.22.0

 Attachments: 1152.patch, 1152.patch, 1152_v2.patch, 1152_v3.patch


 JobTrackerInstrumentation.killed{Map/Reduce} metrics added as part of 
 MAPREDUCE-1103 is not captured

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-372) Change org.apache.hadoop.mapred.lib.ChainMapper/Reducer to use new api.


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated MAPREDUCE-372:
--

Status: Patch Available  (was: Open)

 Change org.apache.hadoop.mapred.lib.ChainMapper/Reducer to use new api.
 ---

 Key: MAPREDUCE-372
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-372
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
Reporter: Amareshwari Sriramadasu
Assignee: Amareshwari Sriramadasu
 Fix For: 0.21.0

 Attachments: mapred-372.patch, mapred-372.patch, mapred-372.patch, 
 patch-372-1.txt, patch-372-2.txt, patch-372.txt




-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-372) Change org.apache.hadoop.mapred.lib.ChainMapper/Reducer to use new api.


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated MAPREDUCE-372:
--

Attachment: patch-372-2.txt

Patch with review comments incorporated.

 Change org.apache.hadoop.mapred.lib.ChainMapper/Reducer to use new api.
 ---

 Key: MAPREDUCE-372
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-372
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
Reporter: Amareshwari Sriramadasu
Assignee: Amareshwari Sriramadasu
 Fix For: 0.21.0

 Attachments: mapred-372.patch, mapred-372.patch, mapred-372.patch, 
 patch-372-1.txt, patch-372-2.txt, patch-372.txt




-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1174) Sqoop improperly handles table/column names which are reserved sql words


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Douglas updated MAPREDUCE-1174:
-

Status: Open  (was: Patch Available)

Unfortunately, the patch has gone stale. Could you regenerate it?

 Sqoop improperly handles table/column names which are reserved sql words
 

 Key: MAPREDUCE-1174
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1174
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: contrib/sqoop
Reporter: Aaron Kimball
Assignee: Aaron Kimball
 Attachments: MAPREDUCE-1174.patch


 In some databases it is legal to name tables and columns with terms that 
 overlap SQL reserved keywords (e.g., {{CREATE}}, {{table}}, etc.). In such 
 cases, the database allows you to escape the table and column names. We 
 should always escape table and column names when possible.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (MAPREDUCE-1234) getJobID() returns null on org.apache.hadoop.mapreduce.Job after job was submitted


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu resolved MAPREDUCE-1234.


Resolution: Duplicate

Duplicate of MAPREDUCE-118

 getJobID() returns null on org.apache.hadoop.mapreduce.Job after job was 
 submitted
 --

 Key: MAPREDUCE-1234
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1234
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: client
Affects Versions: 0.20.1
 Environment: Run on Win XP, but will propably occur on any system
Reporter: Thomas Kathmann
Priority: Minor
   Original Estimate: 0.5h
  Remaining Estimate: 0.5h

 After an instance of org.apache.hadoop.mapreduce.Job is submitted via 
 submit() the method getJobID() returns null.
 The code of the submit() method should include something like:
 setJobID(info.getJobID());
 after
 info = jobClient.submitJobInternal(conf);

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-118) Job.getJobID() will always return null


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated MAPREDUCE-118:
--

 Priority: Blocker  (was: Major)
Affects Version/s: 0.20.1
Fix Version/s: 0.20.2

 Job.getJobID() will always return null
 --

 Key: MAPREDUCE-118
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-118
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.20.1
Reporter: Amar Kamat
Priority: Blocker
 Fix For: 0.20.2


 JobContext is used for a read-only view of job's info. Hence all the readonly 
 fields in JobContext are set in the constructor. Job extends JobContext. When 
 a Job is created, jobid is not known and hence there is no way to set JobID 
 once Job is created. JobID is obtained only when the JobClient queries the 
 jobTracker for a job-id., which happens later i.e upon job submission.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1257) Ability to grab the number of spills

[
https://issues.apache.org/jira/browse/MAPREDUCE-1257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785822#action_12785822
]

Hadoop QA commented on MAPREDUCE-1257:
--

+1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12426868/mapreduce-1257.txt
against trunk revision 887061.

+1 @author. The patch does not contain any @author tags.

+1 tests included. The patch appears to include 6 new or modified tests.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac
compiler warnings.

+1 findbugs. The patch does not introduce any new Findbugs warnings.

+1 release audit. The applied patch does not increase the total number of
release audit warnings.

+1 core tests. The patch passed core unit tests.

+1 contrib tests. The patch passed contrib unit tests.

Test results:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h6.grid.sp2.yahoo.net/288/testReport/
Findbugs warnings:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h6.grid.sp2.yahoo.net/288/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h6.grid.sp2.yahoo.net/288/artifact/trunk/build/test/checkstyle-errors.html
Console output:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h6.grid.sp2.yahoo.net/288/console

This message is automatically generated.

Ability to grab the number of spills

Key: MAPREDUCE-1257
URL: https://issues.apache.org/jira/browse/MAPREDUCE-1257
Project: Hadoop Map/Reduce
Issue Type: New Feature
Affects Versions: 0.22.0
Reporter: Sriranjan Manjunath
Assignee: Todd Lipcon
Fix For: 0.22.0

Attachments: mapreduce-1257.txt

The counters should have information about the number of spills in addition
to the number of spill records.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-118) Job.getJobID() will always return null


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated MAPREDUCE-118:
--

Attachment: patch-118-0.20.txt

Patch for branch 0.20

 Job.getJobID() will always return null
 --

 Key: MAPREDUCE-118
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-118
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.20.1
Reporter: Amar Kamat
Priority: Blocker
 Fix For: 0.20.2

 Attachments: patch-118-0.20.txt


 JobContext is used for a read-only view of job's info. Hence all the readonly 
 fields in JobContext are set in the constructor. Job extends JobContext. When 
 a Job is created, jobid is not known and hence there is no way to set JobID 
 once Job is created. JobID is obtained only when the JobClient queries the 
 jobTracker for a job-id., which happens later i.e upon job submission.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-118) Job.getJobID() will always return null


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated MAPREDUCE-118:
--

Attachment: patch-118.txt
patch-118-0.21.txt

Patch for branch 0.21 and trunk, renaming getID to getJobID, sothat it 
overrides the method in JobContext.

 Job.getJobID() will always return null
 --

 Key: MAPREDUCE-118
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-118
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.20.1
Reporter: Amar Kamat
Priority: Blocker
 Fix For: 0.20.2

 Attachments: patch-118-0.20.txt, patch-118-0.21.txt, patch-118.txt


 JobContext is used for a read-only view of job's info. Hence all the readonly 
 fields in JobContext are set in the constructor. Job extends JobContext. When 
 a Job is created, jobid is not known and hence there is no way to set JobID 
 once Job is created. JobID is obtained only when the JobClient queries the 
 jobTracker for a job-id., which happens later i.e upon job submission.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Assigned: (MAPREDUCE-1084) Implementing aspects development and fault injeciton framework for MapReduce

2009-12-04 Thread Sreekanth Ramakrishnan (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sreekanth Ramakrishnan reassigned MAPREDUCE-1084:
-

Assignee: Sreekanth Ramakrishnan

 Implementing aspects development and fault injeciton framework for MapReduce
 

 Key: MAPREDUCE-1084
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1084
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: build, test
Reporter: Konstantin Boudnik
Assignee: Sreekanth Ramakrishnan

 Similar to HDFS-435 and HADOOP-6204 this JIRA will track the introduction of 
 injection framework for MapReduce.
 After HADOOP-6204 is in place this particular modification should be very 
 trivial and would take importing (via svn:external) of src/test/build and 
 some tweaking of the build.xml file

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-118) Job.getJobID() will always return null


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated MAPREDUCE-118:
--

Status: Patch Available  (was: Open)

 Job.getJobID() will always return null
 --

 Key: MAPREDUCE-118
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-118
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.20.1
Reporter: Amar Kamat
Priority: Blocker
 Fix For: 0.20.2

 Attachments: patch-118-0.20.txt, patch-118-0.21.txt, patch-118.txt


 JobContext is used for a read-only view of job's info. Hence all the readonly 
 fields in JobContext are set in the constructor. Job extends JobContext. When 
 a Job is created, jobid is not known and hence there is no way to set JobID 
 once Job is created. JobID is obtained only when the JobClient queries the 
 jobTracker for a job-id., which happens later i.e upon job submission.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1084) Implementing aspects development and fault injeciton framework for MapReduce

2009-12-04 Thread Sreekanth Ramakrishnan (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sreekanth Ramakrishnan updated MAPREDUCE-1084:
--

Attachment: mapreduce-1084-1-withoutsvnexternals.patch
mapreduce-1084-1.patch

Attaching the patch implementing the fault injection in mapreduce project.

There are two patches with svn external and without svn external.  Svn external 
patch when applied over workspace does not create the appropriate folder 
structure with links even tho' the property and folder is added into version 
control.


 Implementing aspects development and fault injeciton framework for MapReduce
 

 Key: MAPREDUCE-1084
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1084
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: build, test
Reporter: Konstantin Boudnik
Assignee: Sreekanth Ramakrishnan
 Attachments: mapreduce-1084-1-withoutsvnexternals.patch, 
 mapreduce-1084-1.patch


 Similar to HDFS-435 and HADOOP-6204 this JIRA will track the introduction of 
 injection framework for MapReduce.
 After HADOOP-6204 is in place this particular modification should be very 
 trivial and would take importing (via svn:external) of src/test/build and 
 some tweaking of the build.xml file

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1254) job.xml should add crc check in tasktracker and sub jvm.

2009-12-04 Thread ZhuGuanyin (JIRA)

[
https://issues.apache.org/jira/browse/MAPREDUCE-1254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785838#action_12785838
]

ZhuGuanyin commented on MAPREDUCE-1254:
---

Because the local inexpensive disks are not reliable, and we once found the non
zero file became zero length, but the os kernel message has no warning, while
some minutes later, the kernel message report the disk failtures. Durining that
time, the read operation return success without throw any IOException.

In current implementation, it would throw IOException if the job.xml missing,
but it couldn't detect the configuration file has corrupted or has being
truncated.

job.xml should add crc check in tasktracker and sub jvm.

Key: MAPREDUCE-1254
URL: https://issues.apache.org/jira/browse/MAPREDUCE-1254
Project: Hadoop Map/Reduce
Issue Type: New Feature
Components: task, tasktracker
Affects Versions: 0.22.0
Reporter: ZhuGuanyin

Currently job.xml in tasktracker and subjvm are write to local disk through
ChecksumFilesystem, and already had crc checksum information, but load the
job.xml file without crc check. It would cause the mapred job finished
successful but with wrong data because of disk error. Example: The
tasktracker and sub task jvm would load the default configuration if it
doesn't successfully load the job.xml which maybe replace the mapper with
IdentityMapper.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1114) Speed up ivy resolution in builds with clever caching


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Douglas updated MAPREDUCE-1114:
-

Status: Open  (was: Patch Available)

The patch is stale.

The long build times are a problem and ivy's a big part of that, but I agree 
with your assessment: this is a hack. I don't think the 15 second payoff 
justifies the maintenance cost of a custom caching layer for ivy.

 Speed up ivy resolution in builds with clever caching
 -

 Key: MAPREDUCE-1114
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1114
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: build
Affects Versions: 0.22.0
Reporter: Todd Lipcon
Assignee: Todd Lipcon
Priority: Minor
 Attachments: mapreduce-1114.txt, mapreduce-1114.txt, 
 mapreduce-1114.txt


 An awful lot of time is spent in the ivy:resolve parts of the build, even 
 when all of the dependencies have been fetched and cached. Profiling showed 
 this was in XML parsing. I have a sort-of-ugly hack which speeds up 
 incremental compiles (and more importantly ant test) significantly using 
 some ant macros to cache the resolved classpaths.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1161) NotificationTestCase should not lock current thread


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Douglas updated MAPREDUCE-1161:
-

  Resolution: Fixed
Hadoop Flags: [Reviewed]
  Status: Resolved  (was: Patch Available)

I committed this. Thanks, Owen!

 NotificationTestCase should not lock current thread
 ---

 Key: MAPREDUCE-1161
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1161
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Owen O'Malley
Assignee: Owen O'Malley
 Fix For: 0.21.0

 Attachments: mr-1161.patch


 There are 3 instances where NotificationTestCase is locking 
 Thread.currentThread() is being locked and calling sleep on it. There is also 
 a method stdPrintln that doesn't do anything.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1241) JobTracker should not crash when mapred-queues.xml does not exist

[
https://issues.apache.org/jira/browse/MAPREDUCE-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785860#action_12785860
]

Hadoop QA commented on MAPREDUCE-1241:
--

-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12426838/mapreduce-1241.txt
against trunk revision 887061.

+1 @author. The patch does not contain any @author tags.

+1 tests included. The patch appears to include 3 new or modified tests.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac
compiler warnings.

+1 findbugs. The patch does not introduce any new Findbugs warnings.

-1 release audit. The applied patch generated 160 release audit warnings
(more than the trunk's current 159 warnings).

+1 core tests. The patch passed core unit tests.

+1 contrib tests. The patch passed contrib unit tests.

Test results:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/163/testReport/
Release audit warnings:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/163/artifact/trunk/patchprocess/releaseAuditDiffWarnings.txt
Findbugs warnings:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/163/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/163/artifact/trunk/build/test/checkstyle-errors.html
Console output:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/163/console

This message is automatically generated.

JobTracker should not crash when mapred-queues.xml does not exist
-

Key: MAPREDUCE-1241
URL: https://issues.apache.org/jira/browse/MAPREDUCE-1241
Project: Hadoop Map/Reduce
Issue Type: Bug
Reporter: Owen O'Malley
Assignee: Todd Lipcon
Priority: Blocker
Fix For: 0.21.0, 0.22.0

Attachments: mapreduce-1241.txt

Currently, if you bring up the JobTracker on an old configuration directory,
it gets a NullPointerException looking for the mapred-queues.xml file. It
should just assume a default queue and continue.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1075) getQueue(String queue) in JobTracker would return NPE for invalid queue name

2009-12-04 Thread Hemanth Yamijala (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785874#action_12785874
 ] 

Hemanth Yamijala commented on MAPREDUCE-1075:
-

In an offline discussion with Vinod, we concluded that there is no provision to 
marshal exceptions in Hadoop's RPC right now. Hence, we are deciding in favor 
of returning null in the queue APIs.

With this context I looked at the new patch. One minor NIT is that I would 
suggest we test the API JobClient.getQueueInfo instead of Cluster.getQueue, as 
it covers more code path that's changed. Can you please make this change and 
run the patch through Hudson so I can commit once it passes ?

 getQueue(String queue) in JobTracker would return NPE for invalid queue name
 

 Key: MAPREDUCE-1075
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1075
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: V.V.Chaitanya Krishna
Assignee: V.V.Chaitanya Krishna
 Fix For: 0.21.0

 Attachments: MAPREDUCE-1075-1.patch, MAPREDUCE-1075-2.patch, 
 MAPREDUCE-1075-3.patch, MAPREDUCE-1075-4.patch, MAPREDUCE-1075-5.patch, 
 MAPREDUCE-1075-6.patch




-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1152) JobTrackerInstrumentation.killed{Map/Reduce} is never called


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785878#action_12785878
 ] 

Hudson commented on MAPREDUCE-1152:
---

Integrated in Hadoop-Mapreduce-trunk-Commit #144 (See 
[http://hudson.zones.apache.org/hudson/job/Hadoop-Mapreduce-trunk-Commit/144/])
. Distinguish between failed and killed tasks in
JobTrackerInstrumentation. Contributed by Sharad Agarwal


 JobTrackerInstrumentation.killed{Map/Reduce} is never called
 

 Key: MAPREDUCE-1152
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1152
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.22.0
Reporter: Sharad Agarwal
 Fix For: 0.22.0

 Attachments: 1152.patch, 1152.patch, 1152_v2.patch, 1152_v3.patch


 JobTrackerInstrumentation.killed{Map/Reduce} metrics added as part of 
 MAPREDUCE-1103 is not captured

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1161) NotificationTestCase should not lock current thread


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785877#action_12785877
 ] 

Hudson commented on MAPREDUCE-1161:
---

Integrated in Hadoop-Mapreduce-trunk-Commit #144 (See 
[http://hudson.zones.apache.org/hudson/job/Hadoop-Mapreduce-trunk-Commit/144/])
. Remove ineffective synchronization in NotificationTestCase.
Contributed by Owen O'Malley


 NotificationTestCase should not lock current thread
 ---

 Key: MAPREDUCE-1161
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1161
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Owen O'Malley
Assignee: Owen O'Malley
 Fix For: 0.21.0

 Attachments: mr-1161.patch


 There are 3 instances where NotificationTestCase is locking 
 Thread.currentThread() is being locked and calling sleep on it. There is also 
 a method stdPrintln that doesn't do anything.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-372) Change org.apache.hadoop.mapred.lib.ChainMapper/Reducer to use new api.

2009-12-04 Thread Sharad Agarwal (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785897#action_12785897
 ] 

Sharad Agarwal commented on MAPREDUCE-372:
--

Looked at the ChainBlockingQueue part of the code. Some comments:
1. Can we avoid the casting in Chain#stopAllThreads? One way could be to 
override interrupt() in MapRunner and ReduceRunner. Also interruptAllThreads 
would be a better name IMO.
2. I think instead of interrupting the runners and then calling interrupt on 
both readers and writers, it would be simpler we can directly interrupt all the 
blocking queues.

 Change org.apache.hadoop.mapred.lib.ChainMapper/Reducer to use new api.
 ---

 Key: MAPREDUCE-372
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-372
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
Reporter: Amareshwari Sriramadasu
Assignee: Amareshwari Sriramadasu
 Fix For: 0.21.0

 Attachments: mapred-372.patch, mapred-372.patch, mapred-372.patch, 
 patch-372-1.txt, patch-372-2.txt, patch-372.txt




-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1082) Command line UI for queues' information is broken with hierarchical queues.

2009-12-04 Thread Hemanth Yamijala (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785899#action_12785899
 ] 

Hemanth Yamijala commented on MAPREDUCE-1082:
-

Looking close. Some final comments:
- We are assuming the job statuses cannot be null in QueueInfo. I think we 
should check this in setJobStatuses. If it is null, we can set an empty array.
- The test case should call APIs like setRootQueues. getQueue is not passing 
through the code path change you made in JobTracker.getQueueInfoArray

 Command line UI for queues' information is broken with hierarchical queues.
 ---

 Key: MAPREDUCE-1082
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1082
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: client, jobtracker
Affects Versions: 0.21.0
Reporter: Vinod K V
Assignee: V.V.Chaitanya Krishna
Priority: Blocker
 Fix For: 0.21.0

 Attachments: MAPREDUCE-1082-1.txt, MAPREDUCE-1082-2.patch, 
 MAPREDUCE-1082-3.patch


 When the command ./bin/mapred --config ~/tmp/conf/ queue -list is run, it 
 just hangs. I can see the following in the JT logs:
 {code}
 2009-10-08 13:19:26,762 INFO org.apache.hadoop.ipc.Server: IPC Server handler 
 1 on 5 caught: java.lang.NullPointerException
 at org.apache.hadoop.mapreduce.QueueInfo.write(QueueInfo.java:217)
 at org.apache.hadoop.mapreduce.QueueInfo.write(QueueInfo.java:223)
 at 
 org.apache.hadoop.io.ObjectWritable.writeObject(ObjectWritable.java:159)
 at 
 org.apache.hadoop.io.ObjectWritable.writeObject(ObjectWritable.java:126)
 at org.apache.hadoop.io.ObjectWritable.write(ObjectWritable.java:70)
 at org.apache.hadoop.ipc.Server.setupResponse(Server.java:1074)
 at org.apache.hadoop.ipc.Server.access$2400(Server.java:77)
 at org.apache.hadoop.ipc.Server$Handler.run(Server.java:983)
 {code}
 Same is the case with ./bin/mapred --config ~/tmp/conf/ queue -info 
 any-container-queue

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-181) Secure job submission

[
https://issues.apache.org/jira/browse/MAPREDUCE-181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785905#action_12785905
]

Hadoop QA commented on MAPREDUCE-181:
-

-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12426876/181-4.patch
against trunk revision 887096.

+1 @author. The patch does not contain any @author tags.

+1 tests included. The patch appears to include 78 new or modified tests.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac
compiler warnings.

+1 findbugs. The patch does not introduce any new Findbugs warnings.

+1 release audit. The applied patch does not increase the total number of
release audit warnings.

-1 core tests. The patch failed core unit tests.

-1 contrib tests. The patch failed contrib unit tests.

Test results:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h6.grid.sp2.yahoo.net/289/testReport/
Findbugs warnings:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h6.grid.sp2.yahoo.net/289/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h6.grid.sp2.yahoo.net/289/artifact/trunk/build/test/checkstyle-errors.html
Console output:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h6.grid.sp2.yahoo.net/289/console

This message is automatically generated.

Secure job submission
--

Key: MAPREDUCE-181
URL: https://issues.apache.org/jira/browse/MAPREDUCE-181
Project: Hadoop Map/Reduce
Issue Type: Sub-task
Reporter: Amar Kamat
Assignee: Devaraj Das
Fix For: 0.22.0

Attachments: 181-1.patch, 181-2.patch, 181-3.patch, 181-3.patch,
181-4.patch, hadoop-3578-branch-20-example-2.patch,
hadoop-3578-branch-20-example.patch, HADOOP-3578-v2.6.patch,
HADOOP-3578-v2.7.patch, MAPRED-181-v3.32.patch, MAPRED-181-v3.8.patch

Currently the jobclient accesses the {{mapred.system.dir}} to add job
details. Hence the {{mapred.system.dir}} has the permissions of
{{rwx-wx-wx}}. This could be a security loophole where the job files might
get overwritten/tampered after the job submission.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1185) URL to JT webconsole for running job and job history should be the same

2009-12-04 Thread Arun C Murthy (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785911#action_12785911
 ] 

Arun C Murthy commented on MAPREDUCE-1185:
--

+1

 URL to JT webconsole for running job and job history should be the same
 ---

 Key: MAPREDUCE-1185
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1185
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: jobtracker
Reporter: Sharad Agarwal
Assignee: Sharad Agarwal
 Attachments: 1185_v1.patch, 1185_v2.patch, 1185_v3.patch, 
 1185_v4.patch, 1185_v5.patch, 1185_v6.patch, 1185_v7.patch, 
 patch-1185-1-ydist.txt, patch-1185-2-ydist.txt, patch-1185-3-ydist.txt, 
 patch-1185-ydist.txt


 The tracking url for running jobs and the jobs which are retired is 
 different. This creates problem for clients which caches the job running url 
 because soon it becomes invalid when job is retired.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1264) Error Recovery failed, task will continue but run forever as new data only comes in very very slowly

2009-12-04 Thread Thibaut (JIRA)

[
https://issues.apache.org/jira/browse/MAPREDUCE-1264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Thibaut updated MAPREDUCE-1264:
---

Description:
Hi,

Sometimes, some of my jobs (It normally always happens in the reducers and on
random basis) will not finish and will run forever. I have to manually fail the
task so the task will be started and be finished.

The error log on the node is full of entries like:
java.io.IOException: Error Recovery for block blk_-8036012205502614140_21582139
failed because recovery from primary datanode 192.168.0.3:50011 failed 6
times. Pipeline was 192.168.0.3:50011. Aborting...
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.processDatanodeError(DFSClient.java:2582)
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.access$1600(DFSClient.java:2076)
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.run(DFSClient.java:2239)
java.io.IOException: Error Recovery for block blk_-8036012205502614140_21582139
failed because recovery from primary datanode 192.168.0.3:50011 failed 6
times. Pipeline was 192.168.0.3:50011. Aborting...
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.processDatanodeError(DFSClient.java:2582)
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.access$1600(DFSClient.java:2076)
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.run(DFSClient.java:2239)
java.io.IOException: Error Recovery for block blk_-8036012205502614140_21582139
failed because recovery from primary datanode 192.168.0.3:50011 failed 6
times. Pipeline was 192.168.0.3:50011. Aborting...
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.processDatanodeError(DFSClient.java:2582)
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.access$1600(DFSClient.java:2076)
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.run(DFSClient.java:2239)
The error entries all refer to the same data block.

Unfortunately, the reduce function still seems to be called in the reducer with
valid data (although very very slowly), so the task will never been killed and
restarted and will take forever to run!

If I kill the task, the job will finish without any problems.

I experienced the same problem under version 0.20.0 as well.

Thanks,
Thibaut

was:
Hi,

Unfortunally, the reduce function still seems to be called in the reducer with
valid data (allthough very very slowly), so the task will never been killed and
restarted and will take forever to run!

I experienced the same problem under version 0.20.0 as well.

Thanks,
Thibaut

Fix Version/s: 0.20.2

Error Recovery failed, task will continue but run forever as new data only
comes in very very slowly

Key: MAPREDUCE-1264
URL: https://issues.apache.org/jira/browse/MAPREDUCE-1264
Project: Hadoop Map/Reduce
Issue Type: Bug
Affects Versions: 0.20.1
Reporter: Thibaut

[jira] Updated: (MAPREDUCE-1230) Vertica streaming adapter doesn't handle nulls in all cases

2009-12-04 Thread Omer Trajman (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Omer Trajman updated MAPREDUCE-1230:


Fix Version/s: 0.21.0

 Vertica streaming adapter doesn't handle nulls in all cases
 ---

 Key: MAPREDUCE-1230
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1230
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.21.0
 Environment: Hadoop 0.21.0 pre-release and Vertica 3.0+
Reporter: Omer Trajman
Assignee: Omer Trajman
 Fix For: 0.21.0

 Attachments: MAPREDUCE-1230.patch


 Test user reported that Vertica adapter throws an npe when retrieving null 
 values for certain types (binary, numeric both reported).  There is no 
 special case handling when serializing nulls.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1230) Vertica streaming adapter doesn't handle nulls in all cases

2009-12-04 Thread Omer Trajman (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Omer Trajman updated MAPREDUCE-1230:


Status: Patch Available  (was: Open)

 Vertica streaming adapter doesn't handle nulls in all cases
 ---

 Key: MAPREDUCE-1230
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1230
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.21.0
 Environment: Hadoop 0.21.0 pre-release and Vertica 3.0+
Reporter: Omer Trajman
Assignee: Omer Trajman
 Fix For: 0.21.0

 Attachments: MAPREDUCE-1230.patch


 Test user reported that Vertica adapter throws an npe when retrieving null 
 values for certain types (binary, numeric both reported).  There is no 
 special case handling when serializing nulls.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1230) Vertica streaming adapter doesn't handle nulls in all cases

2009-12-04 Thread Omer Trajman (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Omer Trajman updated MAPREDUCE-1230:


Status: Open  (was: Patch Available)

 Vertica streaming adapter doesn't handle nulls in all cases
 ---

 Key: MAPREDUCE-1230
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1230
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.21.0
 Environment: Hadoop 0.21.0 pre-release and Vertica 3.0+
Reporter: Omer Trajman
Assignee: Omer Trajman
 Fix For: 0.21.0

 Attachments: MAPREDUCE-1230.patch


 Test user reported that Vertica adapter throws an npe when retrieving null 
 values for certain types (binary, numeric both reported).  There is no 
 special case handling when serializing nulls.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-896) Users can set non-writable permissions on temporary files for TT and can abuse disk usage.

2009-12-04 Thread Ravi Gummadi (JIRA)

[
https://issues.apache.org/jira/browse/MAPREDUCE-896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785948#action_12785948
]

Ravi Gummadi commented on MAPREDUCE-896:

The new TaskController command is ENABLE_TASK_FOR_CLEANUP.

There is a change in JVMManager where the workdir for the last task was being
deleted inline, but now we delete it asynchronously. This should be fine.

The change in setupWorkDir fixes the issue of trying to delete workDir, which
is the current working dir. Only contents of workDir are deleted, leaving the
workDir as empty. A testcase is added to validate this cleanup of workDir.

Removing check_group as this wouldn't work if user changes the group of workDir.

createFileAndSetPermissions sets a=rx for subDir and file in subDir sothat no
one can delete them without doing chmod.

Am fine with the other comments.

Users can set non-writable permissions on temporary files for TT and can
abuse disk usage.
--

Key: MAPREDUCE-896
URL: https://issues.apache.org/jira/browse/MAPREDUCE-896
Project: Hadoop Map/Reduce
Issue Type: Bug
Components: tasktracker
Affects Versions: 0.21.0
Reporter: Vinod K V
Assignee: Ravi Gummadi
Fix For: 0.21.0

Attachments: MR-896.patch, MR-896.v1.patch

As of now, irrespective of the TaskController in use, TT itself does a full
delete on local files created by itself or job tasks. This step, depending
upon TT's umask and the permissions set by files by the user, for e.g in
job-work/task-work or child.tmp directories, may or may not go through
successful completion fully. Thus is left an opportunity for abusing disk
space usage either accidentally or intentionally by TT/users.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-372) Change org.apache.hadoop.mapred.lib.ChainMapper/Reducer to use new api.

[
https://issues.apache.org/jira/browse/MAPREDUCE-372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785954#action_12785954
]

Hadoop QA commented on MAPREDUCE-372:
-

-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12426877/patch-372-2.txt
against trunk revision 887135.

+1 @author. The patch does not contain any @author tags.

+1 tests included. The patch appears to include 9 new or modified tests.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac
compiler warnings.

+1 findbugs. The patch does not introduce any new Findbugs warnings.

+1 release audit. The applied patch does not increase the total number of
release audit warnings.

+1 core tests. The patch passed core unit tests.

-1 contrib tests. The patch failed contrib unit tests.

Test results:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/164/testReport/
Findbugs warnings:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/164/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/164/artifact/trunk/build/test/checkstyle-errors.html
Console output:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/164/console

This message is automatically generated.

Change org.apache.hadoop.mapred.lib.ChainMapper/Reducer to use new api.
---

Key: MAPREDUCE-372
URL: https://issues.apache.org/jira/browse/MAPREDUCE-372
Project: Hadoop Map/Reduce
Issue Type: Sub-task
Reporter: Amareshwari Sriramadasu
Assignee: Amareshwari Sriramadasu
Fix For: 0.21.0

Attachments: mapred-372.patch, mapred-372.patch, mapred-372.patch,
patch-372-1.txt, patch-372-2.txt, patch-372.txt

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-118) Job.getJobID() will always return null

[
https://issues.apache.org/jira/browse/MAPREDUCE-118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785974#action_12785974
]

Hadoop QA commented on MAPREDUCE-118:
-

-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12426883/patch-118.txt
against trunk revision 887135.

+1 @author. The patch does not contain any @author tags.

+1 tests included. The patch appears to include 18 new or modified tests.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac
compiler warnings.

+1 findbugs. The patch does not introduce any new Findbugs warnings.

+1 release audit. The applied patch does not increase the total number of
release audit warnings.

-1 core tests. The patch failed core unit tests.

+1 contrib tests. The patch passed contrib unit tests.

Test results:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h6.grid.sp2.yahoo.net/290/testReport/
Findbugs warnings:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h6.grid.sp2.yahoo.net/290/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h6.grid.sp2.yahoo.net/290/artifact/trunk/build/test/checkstyle-errors.html
Console output:
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h6.grid.sp2.yahoo.net/290/console

This message is automatically generated.

Job.getJobID() will always return null
--

Key: MAPREDUCE-118
URL: https://issues.apache.org/jira/browse/MAPREDUCE-118
Project: Hadoop Map/Reduce
Issue Type: Bug
Affects Versions: 0.20.1
Reporter: Amar Kamat
Priority: Blocker
Fix For: 0.20.2

Attachments: patch-118-0.20.txt, patch-118-0.21.txt, patch-118.txt

JobContext is used for a read-only view of job's info. Hence all the readonly
fields in JobContext are set in the constructor. Job extends JobContext. When
a Job is created, jobid is not known and hence there is no way to set JobID
once Job is created. JobID is obtained only when the JobClient queries the
jobTracker for a job-id., which happens later i.e upon job submission.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1249) mapreduce.reduce.shuffle.read.timeout's default value should be 3 minutes, in mapred-default.xml


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12786043#action_12786043
 ] 

Hudson commented on MAPREDUCE-1249:
---

Integrated in Hadoop-Mapreduce-trunk #164 (See 
[http://hudson.zones.apache.org/hudson/job/Hadoop-Mapreduce-trunk/164/])
. Update config default value for socket read timeout to
match code default. Contributed by Amareshwari Sriramadasu


 mapreduce.reduce.shuffle.read.timeout's default value should be 3 minutes, in 
 mapred-default.xml
 

 Key: MAPREDUCE-1249
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1249
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: task
Affects Versions: 0.21.0
Reporter: Amareshwari Sriramadasu
Assignee: Amareshwari Sriramadasu
Priority: Blocker
 Fix For: 0.21.0

 Attachments: patch-1249-1.txt, patch-1249.txt


 mapreduce.reduce.shuffle.read.timeout has a value of 30,000 (30 seconds) in 
 mapred-default.xml, whereas the default value in Fetcher code is 3 minutes.
 It should be 3 minutes by default, as it was in pre MAPREDUCE-353.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1161) NotificationTestCase should not lock current thread


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12786042#action_12786042
 ] 

Hudson commented on MAPREDUCE-1161:
---

Integrated in Hadoop-Mapreduce-trunk #164 (See 
[http://hudson.zones.apache.org/hudson/job/Hadoop-Mapreduce-trunk/164/])
. Remove ineffective synchronization in NotificationTestCase.
Contributed by Owen O'Malley


 NotificationTestCase should not lock current thread
 ---

 Key: MAPREDUCE-1161
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1161
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Owen O'Malley
Assignee: Owen O'Malley
 Fix For: 0.21.0

 Attachments: mr-1161.patch


 There are 3 instances where NotificationTestCase is locking 
 Thread.currentThread() is being locked and calling sleep on it. There is also 
 a method stdPrintln that doesn't do anything.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1152) JobTrackerInstrumentation.killed{Map/Reduce} is never called


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12786045#action_12786045
 ] 

Hudson commented on MAPREDUCE-1152:
---

Integrated in Hadoop-Mapreduce-trunk #164 (See 
[http://hudson.zones.apache.org/hudson/job/Hadoop-Mapreduce-trunk/164/])
. Distinguish between failed and killed tasks in
JobTrackerInstrumentation. Contributed by Sharad Agarwal


 JobTrackerInstrumentation.killed{Map/Reduce} is never called
 

 Key: MAPREDUCE-1152
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1152
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.22.0
Reporter: Sharad Agarwal
 Fix For: 0.22.0

 Attachments: 1152.patch, 1152.patch, 1152_v2.patch, 1152_v3.patch


 JobTrackerInstrumentation.killed{Map/Reduce} metrics added as part of 
 MAPREDUCE-1103 is not captured

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1260) Update Eclipse configuration to match changes to Ivy configuration


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12786041#action_12786041
 ] 

Hudson commented on MAPREDUCE-1260:
---

Integrated in Hadoop-Mapreduce-trunk #164 (See 
[http://hudson.zones.apache.org/hudson/job/Hadoop-Mapreduce-trunk/164/])


 Update Eclipse configuration to match changes to Ivy configuration
 --

 Key: MAPREDUCE-1260
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1260
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: build
Affects Versions: 0.22.0
Reporter: Edwin Chan
 Fix For: 0.22.0

 Attachments: mapReduceClasspath.patch


 The .eclipse_templates/.classpath file doesn't match the Ivy configuration, 
 so I've updated it to match.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1119) When tasks fail to report status, show tasks's stack dump before killing

[
https://issues.apache.org/jira/browse/MAPREDUCE-1119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12786044#action_12786044
]

Hudson commented on MAPREDUCE-1119:
---

Integrated in Hadoop-Mapreduce-trunk #164 (See
[http://hudson.zones.apache.org/hudson/job/Hadoop-Mapreduce-trunk/164/])
. When tasks fail to report status, show tasks's stack dump before killing.
Contributed by Aaron Kimball.

When tasks fail to report status, show tasks's stack dump before killing

Key: MAPREDUCE-1119
URL: https://issues.apache.org/jira/browse/MAPREDUCE-1119
Project: Hadoop Map/Reduce
Issue Type: Bug
Components: tasktracker
Affects Versions: 0.22.0
Reporter: Todd Lipcon
Assignee: Aaron Kimball
Fix For: 0.22.0

Attachments: MAPREDUCE-1119.2.patch, MAPREDUCE-1119.3.patch,
MAPREDUCE-1119.4.patch, MAPREDUCE-1119.5.patch, MAPREDUCE-1119.6.patch,
MAPREDUCE-1119.patch

When the TT kills tasks that haven't reported status, it should somehow
gather a stack dump for the task. This could be done either by sending a
SIGQUIT (so the dump ends up in stdout) or perhaps something like JDI to
gather the stack directly from Java. This may be somewhat tricky since the
child may be running as another user (so the SIGQUIT would have to go through
LinuxTaskController). This feature would make debugging these kinds of
failures much easier, especially if we could somehow get it into the
TaskDiagnostic message

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1265) Include tasktracker name in the task attempt error log

[
https://issues.apache.org/jira/browse/MAPREDUCE-1265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Scott Chen updated MAPREDUCE-1265:
--

Description:
When task attempt receive an error, TaskInProgress will log the task attempt id
and diagnosis string in the JobTracker log.
Ex:
2009-xx-xx 23:50:45,994 INFO org.apache.hadoop.mapred.TaskInProgress: Error
from attempt_2009__r_09_1: Error: java.lang.OutOfMemoryError: Java
heap space
2009-xx-xx 22:53:53,146 INFO org.apache.hadoop.mapred.TaskInProgress: Error
from attempt_2009__m_000478_0: Task attempt_2009__m_000478_0
failed to report status for 601 seconds. Killing!

When we want to debug a machine for example, a blacklisted node.
We have to use the task attempt id to find these information. This is not very
convenient.

It will be nice if we can also log the tasktracker which cauces this error.
This way we can just grep the hostname to quickly find all the relevant error
message.

was:
When task attempt receive an error, TaskInProgress will log the task attempt id
and diagnosis string in the JobTracker log.
Ex:
2009-xx-xx 23:50:45,994 INFO org.apache.hadoop.mapred.TaskInProgress: Error
from attempt_2009__r_09_1: Error: java.lang.OutOfMemoryError: Java
heap space
2009-xx-xx 22:53:53,146 INFO org.apache.hadoop.mapred.TaskInProgress: Error
from attempt_2009__m_000478_0: Task attempt_2009__m_000478_0
failed to report status for 601 seconds. Killing!

When we want to debug a machine or a job. We have to use the task attempt id to
find these information.

It will be much more convenient if we can just log them together.
This way we can just grep the jobId or hostname to quickly find all the
relevant error message.

Summary: Include tasktracker name in the task attempt error log (was:
Include jobId and hostname in the task attempt error log)

Include tasktracker name in the task attempt error log
--

Key: MAPREDUCE-1265
URL: https://issues.apache.org/jira/browse/MAPREDUCE-1265
Project: Hadoop Map/Reduce
Issue Type: Improvement
Reporter: Scott Chen
Assignee: Scott Chen
Priority: Trivial
Attachments: MAPREDUCE-1265-v2.patch, MAPREDUCE-1265.patch

When task attempt receive an error, TaskInProgress will log the task attempt
id and diagnosis string in the JobTracker log.
Ex:
2009-xx-xx 23:50:45,994 INFO org.apache.hadoop.mapred.TaskInProgress: Error
from attempt_2009__r_09_1: Error: java.lang.OutOfMemoryError:
Java heap space
2009-xx-xx 22:53:53,146 INFO org.apache.hadoop.mapred.TaskInProgress: Error
from attempt_2009__m_000478_0: Task attempt_2009__m_000478_0
failed to report status for 601 seconds. Killing!
When we want to debug a machine for example, a blacklisted node.
We have to use the task attempt id to find these information. This is not
very convenient.
It will be nice if we can also log the tasktracker which cauces this error.
This way we can just grep the hostname to quickly find all the relevant error
message.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1265) Include tasktracker name in the task attempt error log

[
https://issues.apache.org/jira/browse/MAPREDUCE-1265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12786066#action_12786066
]

Scott Chen commented on MAPREDUCE-1265:
---

I just realized that job id is just part of task attempt id so we can easily
obtain that.
So we need to log tasktracker name here only.

So, here is the log after change:
2009-xx-xx 23:50:45,994 INFO org.apache.hadoop.mapred.TaskInProgress: Error
from attempt_2009__r_09_1 *on tracker_m01.aaa.com*: Error:
java.lang.OutOfMemoryError: Java heap space
2009-xx-xx 22:53:53,146 INFO org.apache.hadoop.mapred.TaskInProgress: Error
from attempt_2009__m_000478_0 *on tracker_m02.aaa.com*: Task
attempt_2009__m_000478_0 failed to report status for 601 seconds.
Killing!

Include tasktracker name in the task attempt error log
--

Key: MAPREDUCE-1265
URL: https://issues.apache.org/jira/browse/MAPREDUCE-1265
Project: Hadoop Map/Reduce
Issue Type: Improvement
Affects Versions: 0.22.0
Reporter: Scott Chen
Assignee: Scott Chen
Priority: Trivial
Fix For: 0.22.0

Attachments: MAPREDUCE-1265-v2.patch, MAPREDUCE-1265.patch

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1265) Include tasktracker name in the task attempt error log

[
https://issues.apache.org/jira/browse/MAPREDUCE-1265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Scott Chen updated MAPREDUCE-1265:
--

Fix Version/s: 0.22.0
Affects Version/s: 0.22.0
Status: Patch Available (was: Open)

Include tasktracker name in the task attempt error log
--

Attachments: MAPREDUCE-1265-v2.patch, MAPREDUCE-1265.patch

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1265) Include tasktracker name in the task attempt error log

[
https://issues.apache.org/jira/browse/MAPREDUCE-1265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Scott Chen updated MAPREDUCE-1265:
--

When we want to debug a machine for example, a blacklisted node.
We have to use the task attempt id to find the TT. This is not very convenient.

It will be nice if we can also log the tasktracker which cauces this error.
This way we can just grep the hostname to quickly find all the relevant error
message.

When we want to debug a machine for example, a blacklisted node.
We have to use the task attempt id to find these information. This is not very
convenient.

It will be nice if we can also log the tasktracker which cauces this error.
This way we can just grep the hostname to quickly find all the relevant error
message.

Include tasktracker name in the task attempt error log
--

Attachments: MAPREDUCE-1265-v2.patch, MAPREDUCE-1265.patch

When task attempt receive an error, TaskInProgress will log the task attempt
id and diagnosis string in the JobTracker log.
Ex:
2009-xx-xx 23:50:45,994 INFO org.apache.hadoop.mapred.TaskInProgress: Error
from attempt_2009__r_09_1: Error: java.lang.OutOfMemoryError:
Java heap space
2009-xx-xx 22:53:53,146 INFO org.apache.hadoop.mapred.TaskInProgress: Error
from attempt_2009__m_000478_0: Task attempt_2009__m_000478_0
failed to report status for 601 seconds. Killing!
When we want to debug a machine for example, a blacklisted node.
We have to use the task attempt id to find the TT. This is not very
convenient.
It will be nice if we can also log the tasktracker which cauces this error.
This way we can just grep the hostname to quickly find all the relevant error
message.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAPREDUCE-1265) Include tasktracker name in the task attempt error log