[jira] Updated: (MAPREDUCE-2023) TestDFSIO read test may not read specified bytes.

2010-09-03 Thread Hong Tang (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hong Tang updated MAPREDUCE-2023:
-

Attachment: mr-2023-yahoo-hadoop-20.1xx.patch

patch for yahoo hadoop 20.1xx branch. Not to be committed.

 TestDFSIO read test may not read specified bytes.
 -

 Key: MAPREDUCE-2023
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2023
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: benchmarks
Reporter: Hong Tang
Assignee: Hong Tang
 Fix For: 0.22.0

 Attachments: mr-2023-20100826.patch, mr-2023-20100902.patch, 
 mr-2023-yahoo-hadoop-20.1xx.patch, TestFsRead.java


 TestDFSIO's read test may read less bytes than specified when reading large 
 files.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2024) 5th JobTracker constructor (for simulator) uses outdated configs, duplicates code

2010-09-03 Thread Hong Tang (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12905829#action_12905829
 ] 

Hong Tang commented on MAPREDUCE-2024:
--

Is this a duplicate of MAPREDUCE-1001? If so, we probably should keep only one.

 5th JobTracker constructor (for simulator) uses outdated configs, duplicates 
 code
 -

 Key: MAPREDUCE-2024
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2024
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobtracker
Affects Versions: 0.22.0
Reporter: Greg Roelofs
Priority: Minor

 The fifth JobTracker constructor (JobTracker(final JobConf conf, Clock clock, 
 boolean ignoredForSimulation)) still uses the old mapred.* config settings 
 and appears to duplicate much of the code in the fourth (main) ctor.  It 
 should be modernized and, ideally, share as much code as possible with the 
 main one in order to minimize simulation drift and the potential for config 
 errors.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-1582) JobClient#monitorAndPrintJob() should use logger for logging taskDiagnostics

2010-09-03 Thread Johannes Zillmann (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Johannes Zillmann updated MAPREDUCE-1582:
-

Attachment: MAPREDUCE-1582.patch

Attached a patch which simply replaces System.err with LOG.error on that one 
location.
This should be more consistent with the rest of the method as well because the 
method uses a lot of LOG.info and no sys.out.

 JobClient#monitorAndPrintJob() should use logger for logging taskDiagnostics
 

 Key: MAPREDUCE-1582
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1582
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: job submission
Reporter: Johannes Zillmann
Priority: Minor
 Attachments: MAPREDUCE-1582.patch


 In JobClient#monitorAndPrintJob() (around line 1324 in version 0.20.1) the 
 exception of tasktrackers are printed via system.err.
 Using LOG.error() instead would make capturing the logs easier !

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-1582) JobClient#monitorAndPrintJob() should use logger for logging taskDiagnostics

2010-09-03 Thread Johannes Zillmann (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Johannes Zillmann updated MAPREDUCE-1582:
-

Status: Patch Available  (was: Open)

 JobClient#monitorAndPrintJob() should use logger for logging taskDiagnostics
 

 Key: MAPREDUCE-1582
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1582
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: job submission
Reporter: Johannes Zillmann
Priority: Minor
 Attachments: MAPREDUCE-1582.patch


 In JobClient#monitorAndPrintJob() (around line 1324 in version 0.20.1) the 
 exception of tasktrackers are printed via system.err.
 Using LOG.error() instead would make capturing the logs easier !

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-1582) JobClient#monitorAndPrintJob() should use logger for logging taskDiagnostics

2010-09-03 Thread Johannes Zillmann (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12905852#action_12905852
 ] 

Johannes Zillmann commented on MAPREDUCE-1582:
--

Its the 0.20 version which is patched, not sure if it differs ffrom 0.21, 
0.22...

 JobClient#monitorAndPrintJob() should use logger for logging taskDiagnostics
 

 Key: MAPREDUCE-1582
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1582
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: job submission
Reporter: Johannes Zillmann
Priority: Minor
 Attachments: MAPREDUCE-1582.patch


 In JobClient#monitorAndPrintJob() (around line 1324 in version 0.20.1) the 
 exception of tasktrackers are printed via system.err.
 Using LOG.error() instead would make capturing the logs easier !

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-1918) Add documentation to Rumen

2010-09-03 Thread Amar Kamat (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amar Kamat updated MAPREDUCE-1918:
--

Attachment: mapreduce-1918-v1.7.patch

Attaching a patch that adds user and API documentation to Rumen. test-patch 
passed

 Add documentation to Rumen
 --

 Key: MAPREDUCE-1918
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1918
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: tools/rumen
Affects Versions: 0.22.0
Reporter: Amar Kamat
Assignee: Amar Kamat
 Fix For: 0.22.0

 Attachments: mapreduce-1918-v1.3.patch, mapreduce-1918-v1.4.patch, 
 mapreduce-1918-v1.7.patch, rumen.pdf, rumen.pdf


 Add forrest documentation to Rumen tool.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Assigned: (MAPREDUCE-2050) TestMRCLI fails on trunk

2010-09-03 Thread Boris Shkolnik (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Boris Shkolnik reassigned MAPREDUCE-2050:
-

Assignee: Boris Shkolnik

 TestMRCLI fails on trunk
 

 Key: MAPREDUCE-2050
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2050
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: security, test
Affects Versions: 0.22.0
Reporter: Amareshwari Sriramadasu
Assignee: Boris Shkolnik
 Fix For: 0.22.0


 TestMRCLI fails with following error:
 {noformat}
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(150)) - Test ID: [1]
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(151)) -Test Description: 
 [refreshServiceAcl: refreshing security authorization policy for jobtracker]
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(152)) - 
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(156)) -   Test Commands: [-jt 
 JOBTRACKER -refreshServiceAcl ]
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(160)) - 
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(167)) - 
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(171)) -  Comparator: 
 [ExactComparator]
 2010-09-02 14:23:25,975 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(173)) -  Comparision result:   
 [fail]
 2010-09-02 14:23:25,975 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(175)) - Expected output:   []
 2010-09-02 14:23:25,975 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(177)) -   Actual output:   
 [refreshServiceAcl: Kerberos service principal name isn't configured properly 
 (should have 3 parts): 
 {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2050) TestMRCLI fails on trunk

2010-09-03 Thread Boris Shkolnik (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12905981#action_12905981
 ] 

Boris Shkolnik commented on MAPREDUCE-2050:
---

I think it is related to Client callinb ConnectionId.getRemotePrincipal()  for 
non-secure cluster.
This call should be conditioned by UGI.securityEnabled() constrain.

So the actual fix will be in HADOOP-COMMON.

 TestMRCLI fails on trunk
 

 Key: MAPREDUCE-2050
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2050
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: security, test
Affects Versions: 0.22.0
Reporter: Amareshwari Sriramadasu
Assignee: Boris Shkolnik
 Fix For: 0.22.0


 TestMRCLI fails with following error:
 {noformat}
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(150)) - Test ID: [1]
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(151)) -Test Description: 
 [refreshServiceAcl: refreshing security authorization policy for jobtracker]
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(152)) - 
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(156)) -   Test Commands: [-jt 
 JOBTRACKER -refreshServiceAcl ]
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(160)) - 
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(167)) - 
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(171)) -  Comparator: 
 [ExactComparator]
 2010-09-02 14:23:25,975 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(173)) -  Comparision result:   
 [fail]
 2010-09-02 14:23:25,975 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(175)) - Expected output:   []
 2010-09-02 14:23:25,975 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(177)) -   Actual output:   
 [refreshServiceAcl: Kerberos service principal name isn't configured properly 
 (should have 3 parts): 
 {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2050) TestMRCLI fails on trunk

2010-09-03 Thread Kan Zhang (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12905986#action_12905986
 ] 

Kan Zhang commented on MAPREDUCE-2050:
--

I ran TestMRCLI with the latest patch in HADOOP-6938, it passed. 

 TestMRCLI fails on trunk
 

 Key: MAPREDUCE-2050
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2050
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: security, test
Affects Versions: 0.22.0
Reporter: Amareshwari Sriramadasu
Assignee: Boris Shkolnik
 Fix For: 0.22.0


 TestMRCLI fails with following error:
 {noformat}
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(150)) - Test ID: [1]
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(151)) -Test Description: 
 [refreshServiceAcl: refreshing security authorization policy for jobtracker]
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(152)) - 
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(156)) -   Test Commands: [-jt 
 JOBTRACKER -refreshServiceAcl ]
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(160)) - 
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(167)) - 
 2010-09-02 14:23:25,974 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(171)) -  Comparator: 
 [ExactComparator]
 2010-09-02 14:23:25,975 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(173)) -  Comparision result:   
 [fail]
 2010-09-02 14:23:25,975 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(175)) - Expected output:   []
 2010-09-02 14:23:25,975 INFO  cli.CLITestHelper 
 (CLITestHelper.java:displayResults(177)) -   Actual output:   
 [refreshServiceAcl: Kerberos service principal name isn't configured properly 
 (should have 3 parts): 
 {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-1918) Add documentation to Rumen

2010-09-03 Thread Hong Tang (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12905998#action_12905998
 ] 

Hong Tang commented on MAPREDUCE-1918:
--

A few minor nits:
* Incase = in case
* For TraceBuilder, does it descend recursively into the input foloder, or do 
we need to specify the immediate parent directory that contains the files? 
* Can we add a bit more details on demuxer? How about the following?
bq. Demuxer decides how the input file maps to jobhistory file(s). [insert]Job 
history logs and job conf files are typically small files, and can be more 
effectively stored if we embed them in some container file format like 
SequenceFile or TFile. To support such usage cases, one can specify a 
customized Demuxer class that can extract individual job history logs and job 
conf files from source files. [/insert]
* There is no need to do canParse() check if you know which parser to use 
(hence no need to use ris). The parser will (or should) simply abort if the 
source is not of the expected version.
* VersionDetector seems rather internal, getParser() is probably what users 
should care about.



 Add documentation to Rumen
 --

 Key: MAPREDUCE-1918
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1918
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: tools/rumen
Affects Versions: 0.22.0
Reporter: Amar Kamat
Assignee: Amar Kamat
 Fix For: 0.22.0

 Attachments: mapreduce-1918-v1.3.patch, mapreduce-1918-v1.4.patch, 
 mapreduce-1918-v1.7.patch, rumen.pdf, rumen.pdf


 Add forrest documentation to Rumen tool.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (MAPREDUCE-2052) Fix URL encoding of job history logfiles

2010-09-03 Thread Patrick Angeles (JIRA)
Fix URL encoding of job history logfiles


 Key: MAPREDUCE-2052
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2052
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.20.2, 0.20.1
Reporter: Patrick Angeles


MAPREDUCE-1378 attempts to address this issue but sometimes results in 
doubly-URL-encoded logFile paths.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-2052) Fix URL encoding of job history logfiles

2010-09-03 Thread Patrick Angeles (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Patrick Angeles updated MAPREDUCE-2052:
---

Status: Patch Available  (was: Open)

This patch pre-empts MAPREDUCE-1378. It should apply cleanly to branch-20 as 
the patch for 1378 has not been applied. A slightly different patch would have 
to be ported to 0.21. However, the jobhistory.jsp code has been completely 
overhauled as of trunk.




 Fix URL encoding of job history logfiles
 

 Key: MAPREDUCE-2052
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2052
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.20.2, 0.20.1
Reporter: Patrick Angeles

 MAPREDUCE-1378 attempts to address this issue but sometimes results in 
 doubly-URL-encoded logFile paths.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-2052) Fix URL encoding of job history logfiles

2010-09-03 Thread Patrick Angeles (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Patrick Angeles updated MAPREDUCE-2052:
---

Attachment: mapred-2052-0.20-patch.txt

 Fix URL encoding of job history logfiles
 

 Key: MAPREDUCE-2052
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2052
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.20.1, 0.20.2
Reporter: Patrick Angeles
 Attachments: mapred-2052-0.20-patch.txt


 MAPREDUCE-1378 attempts to address this issue but sometimes results in 
 doubly-URL-encoded logFile paths.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2024) 5th JobTracker constructor (for simulator) uses outdated configs, duplicates code

2010-09-03 Thread Greg Roelofs (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12906035#action_12906035
 ] 

Greg Roelofs commented on MAPREDUCE-2024:
-

bq. Is this a duplicate of MAPREDUCE-1001? If so, we probably should keep only 
one.

It certainly appears to be.  Thanks, I'll close it as a dup.

 5th JobTracker constructor (for simulator) uses outdated configs, duplicates 
 code
 -

 Key: MAPREDUCE-2024
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2024
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobtracker
Affects Versions: 0.22.0
Reporter: Greg Roelofs
Priority: Minor

 The fifth JobTracker constructor (JobTracker(final JobConf conf, Clock clock, 
 boolean ignoredForSimulation)) still uses the old mapred.* config settings 
 and appears to duplicate much of the code in the fourth (main) ctor.  It 
 should be modernized and, ideally, share as much code as possible with the 
 main one in order to minimize simulation drift and the potential for config 
 errors.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Resolved: (MAPREDUCE-2024) 5th JobTracker constructor (for simulator) uses outdated configs, duplicates code

2010-09-03 Thread Greg Roelofs (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Greg Roelofs resolved MAPREDUCE-2024.
-

Resolution: Duplicate

 5th JobTracker constructor (for simulator) uses outdated configs, duplicates 
 code
 -

 Key: MAPREDUCE-2024
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2024
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobtracker
Affects Versions: 0.22.0
Reporter: Greg Roelofs
Priority: Minor

 The fifth JobTracker constructor (JobTracker(final JobConf conf, Clock clock, 
 boolean ignoredForSimulation)) still uses the old mapred.* config settings 
 and appears to duplicate much of the code in the fourth (main) ctor.  It 
 should be modernized and, ideally, share as much code as possible with the 
 main one in order to minimize simulation drift and the potential for config 
 errors.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-1001) Reducing code duplication in Mumak

2010-09-03 Thread Greg Roelofs (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12906039#action_12906039
 ] 

Greg Roelofs commented on MAPREDUCE-1001:
-

Hong's maintenance concerns have been borne out; here's my comment from 
(duplicate) MAPREDUCE-2024:

bq. The fifth JobTracker constructor (JobTracker(final JobConf conf, Clock 
clock, boolean ignoredForSimulation)) still uses the old mapred.* config 
settings and appears to duplicate much of the code in the fourth (main) ctor. 
It should be modernized and, ideally, share as much code as possible with the 
main one in order to minimize simulation drift and the potential for config 
errors.

(In case it's not clear, the other constructor has already been modified to use 
the newer mapreduce.* config settings.  Thus the config settings that work 
for one will have no effect on the other unless they're manually duplicated.)

 Reducing code duplication in Mumak
 --

 Key: MAPREDUCE-1001
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1001
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Hong Tang

 The first release of Mumak (MAPREDUCE-728) introduces some coupling between 
 the core mapred code with Mumak code. Specifically, New constructors are 
 added to JobTracker and JobInProgress to allow simulator to subclass and 
 alter JT/JIP behavior. This could be a code maintenance overhead when new 
 changes have to be ported to either the added constructors or the simulation 
 subclasses.
 It would be nice to refactor the constructors of JobTracker and JobInProgress 
 to avoid as much code duplication as possible.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-1001) Reducing code duplication in Mumak

2010-09-03 Thread Hong Tang (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12906079#action_12906079
 ] 

Hong Tang commented on MAPREDUCE-1001:
--

+1 for fixing it!

 Reducing code duplication in Mumak
 --

 Key: MAPREDUCE-1001
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1001
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Hong Tang

 The first release of Mumak (MAPREDUCE-728) introduces some coupling between 
 the core mapred code with Mumak code. Specifically, New constructors are 
 added to JobTracker and JobInProgress to allow simulator to subclass and 
 alter JT/JIP behavior. This could be a code maintenance overhead when new 
 changes have to be ported to either the added constructors or the simulation 
 subclasses.
 It would be nice to refactor the constructors of JobTracker and JobInProgress 
 to avoid as much code duplication as possible.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Assigned: (MAPREDUCE-1233) Incorrect Waiting maps/reduces in Jobtracker metrics

2010-09-03 Thread Luke Lu (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Luke Lu reassigned MAPREDUCE-1233:
--

Assignee: Luke Lu

 Incorrect Waiting maps/reduces in Jobtracker metrics 
 -

 Key: MAPREDUCE-1233
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1233
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobtracker
Reporter: V.Karthikeyan
Assignee: Luke Lu

 Waiting Maps/Reduces are incorrect in Jobtracker metrics when a job fails. 
 when a map/reduce fails(during job failure), waiting maps/reduce got 
 incremented and doesn't get decremented even after job cleanup.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-1233) Incorrect Waiting maps/reduces in Jobtracker metrics

2010-09-03 Thread Luke Lu (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Luke Lu updated MAPREDUCE-1233:
---

Attachment: mr-1233-y20s-v1.patch

Patch for y20.200 branch. Trunk doesn't seem to need the patch due to 
MAPREDUCE-1152, where killed tasks don't cause waiting tasks to be incremented.

 Incorrect Waiting maps/reduces in Jobtracker metrics 
 -

 Key: MAPREDUCE-1233
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1233
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobtracker
Reporter: V.Karthikeyan
Assignee: Luke Lu
 Attachments: mr-1233-y20s-v1.patch


 Waiting Maps/Reduces are incorrect in Jobtracker metrics when a job fails. 
 when a map/reduce fails(during job failure), waiting maps/reduce got 
 incremented and doesn't get decremented even after job cleanup.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-1938) Ability for having user's classes take precedence over the system classes for tasks' classpath

2010-09-03 Thread Krishna Ramachandran (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Krishna Ramachandran updated MAPREDUCE-1938:


Attachment: mapred-1938-3.patch

revised per previous comment 
+ Luke's test case review - vector is API requirement - can't modify

Yes there will be a separate jira for bin/hadoop client command line changes

 Ability for having user's classes take precedence over the system classes for 
 tasks' classpath
 --

 Key: MAPREDUCE-1938
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1938
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: job submission, task, tasktracker
Reporter: Devaraj Das
Assignee: Krishna Ramachandran
 Fix For: 0.22.0

 Attachments: mapred-1938-1, mapred-1938-2.patch, mapred-1938-3.patch, 
 mr-1938-bp20.1.patch, mr-1938-bp20.patch


 It would be nice to have the ability in MapReduce to allow users to specify 
 for their jobs alternate implementations of classes that are already defined 
 in the MapReduce libraries. For example, an alternate implementation for 
 CombineFileInputFormat. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.