[jira] [Commented] (MAPREDUCE-6674) configure parallel tests for mapreduce-client-jobclient

2017-08-02 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16112081#comment-16112081
 ] 

Hadoop QA commented on MAPREDUCE-6674:
--

| (x) *{color:red}-1 overall{color}* |
\\
\\
|| Vote || Subsystem || Runtime || Comment ||
| {color:blue}0{color} | {color:blue} reexec {color} | {color:blue}  0m 
36s{color} | {color:blue} Docker mode activated. {color} |
|| || || || {color:brown} Prechecks {color} ||
| {color:green}+1{color} | {color:green} @author {color} | {color:green}  0m  
0s{color} | {color:green} The patch does not contain any @author tags. {color} |
| {color:red}-1{color} | {color:red} test4tests {color} | {color:red}  0m  
0s{color} | {color:red} The patch doesn't appear to include any new or modified 
tests. Please justify why no new tests are needed for this patch. Also please 
list what manual steps were performed to verify this patch. {color} |
|| || || || {color:brown} trunk Compile Tests {color} ||
| {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 16m 
11s{color} | {color:green} trunk passed {color} |
| {color:green}+1{color} | {color:green} compile {color} | {color:green}  0m 
27s{color} | {color:green} trunk passed {color} |
| {color:green}+1{color} | {color:green} mvnsite {color} | {color:green}  0m 
31s{color} | {color:green} trunk passed {color} |
| {color:green}+1{color} | {color:green} javadoc {color} | {color:green}  0m 
13s{color} | {color:green} trunk passed {color} |
|| || || || {color:brown} Patch Compile Tests {color} ||
| {color:green}+1{color} | {color:green} mvninstall {color} | {color:green}  0m 
25s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} compile {color} | {color:green}  0m 
25s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} javac {color} | {color:green}  0m 
25s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} mvnsite {color} | {color:green}  0m 
26s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} whitespace {color} | {color:green}  0m 
 0s{color} | {color:green} The patch has no whitespace issues. {color} |
| {color:green}+1{color} | {color:green} xml {color} | {color:green}  0m  
1s{color} | {color:green} The patch has no ill-formed XML file. {color} |
| {color:green}+1{color} | {color:green} javadoc {color} | {color:green}  0m 
11s{color} | {color:green} the patch passed {color} |
|| || || || {color:brown} Other Tests {color} ||
| {color:red}-1{color} | {color:red} unit {color} | {color:red} 27m 17s{color} 
| {color:red} hadoop-mapreduce-client-jobclient in the patch failed. {color} |
| {color:red}-1{color} | {color:red} asflicense {color} | {color:red}  0m 
22s{color} | {color:red} The patch generated 6 ASF License warnings. {color} |
| {color:black}{color} | {color:black} {color} | {color:black} 47m 40s{color} | 
{color:black} {color} |
\\
\\
|| Reason || Tests ||
| Failed junit tests | hadoop.fs.slive.TestSlive |
|   | hadoop.mapreduce.TestMapReduce |
|   | hadoop.mapred.TestUserDefinedCounters |
|   | hadoop.mapred.TestReporter |
|   | hadoop.mapreduce.v2.TestMRJobs |
|   | hadoop.fs.TestFileSystem |
|   | hadoop.mapred.lib.TestChainMapReduce |
|   | hadoop.mapreduce.TestMapCollection |
|   | hadoop.mapreduce.lib.chain.TestSingleElementChain |
|   | hadoop.mapreduce.lib.input.TestLineRecordReaderJobs |
|   | hadoop.mapred.lib.TestKeyFieldBasedComparator |
|   | hadoop.mapred.lib.TestMultithreadedMapRunner |
|   | hadoop.mapred.TestMROpportunisticMaps |
|   | hadoop.mapreduce.TestLargeSort |
|   | hadoop.mapreduce.lib.jobcontrol.TestMapReduceJobControl |
|   | hadoop.mapred.TestCollect |
|   | hadoop.fs.TestDFSIO |
|   | hadoop.mapreduce.TestMapperReducerCleanup |
|   | hadoop.mapreduce.lib.fieldsel.TestMRFieldSelection |
|   | hadoop.mapreduce.v2.TestMRJobsWithProfiler |
|   | hadoop.mapred.TestSpecialCharactersInOutputPath |
|   | hadoop.conf.TestNoDefaultsJobConf |
|   | hadoop.mapred.TestJobCounters |
|   | hadoop.mapred.TestJobCleanup |
|   | hadoop.mapreduce.lib.output.TestMRMultipleOutputs |
|   | hadoop.mapreduce.TestMROutputFormat |
|   | hadoop.mapreduce.TestMRJobClient |
|   | hadoop.mapred.lib.aggregate.TestAggregates |
|   | hadoop.mapred.TestComparators |
|   | hadoop.mapreduce.v2.TestSpeculativeExecution |
|   | hadoop.mapreduce.lib.input.TestMultipleInputs |
|   | hadoop.mapred.TestMapRed |
|   | hadoop.mapred.TestFileOutputFormat |
|   | hadoop.mapreduce.TestValueIterReset |
|   | hadoop.mapred.pipes.TestPipeApplication |
|   | hadoop.mapred.TestLineRecordReaderJobs |
|   | hadoop.mapred.TestOldCombinerGrouping |
|   | hadoop.mapreduce.TestNewCombinerGrouping |
|   | hadoop.mapreduce.lib.chain.TestChainErrors |
|   | hadoop.hdfs.TestNNBench |
|   | hadoop.mapreduce.lib.aggregate.TestMapReduceAggregates |
|   | hadoop.mapreduce

[jira] [Updated] (MAPREDUCE-6674) configure parallel tests for mapreduce-client-jobclient

2017-08-02 Thread Allen Wittenauer (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-6674:

Attachment: MAPREDUCE-6674.01.patch

-01:
* rebase to see where we are at

> configure parallel tests for mapreduce-client-jobclient
> ---
>
> Key: MAPREDUCE-6674
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6674
> Project: Hadoop Map/Reduce
>  Issue Type: Test
>  Components: test
>Affects Versions: 3.0.0-alpha1
>Reporter: Allen Wittenauer
>Priority: Critical
> Attachments: MAPREDUCE-6674.00.patch, MAPREDUCE-6674.01.patch
>
>
> mapreduce-client-jobclient takes almost an hour and a half.  Configuring 
> parallel-tests would greatly reduce this run time.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Commented] (MAPREDUCE-6674) configure parallel tests for mapreduce-client-jobclient

2017-08-02 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16111996#comment-16111996
 ] 

Hadoop QA commented on MAPREDUCE-6674:
--

| (x) *{color:red}-1 overall{color}* |
\\
\\
|| Vote || Subsystem || Runtime || Comment ||
| {color:blue}0{color} | {color:blue} reexec {color} | {color:blue}  0m  
0s{color} | {color:blue} Docker mode activated. {color} |
| {color:red}-1{color} | {color:red} patch {color} | {color:red}  0m  5s{color} 
| {color:red} MAPREDUCE-6674 does not apply to trunk. Rebase required? Wrong 
Branch? See https://wiki.apache.org/hadoop/HowToContribute for help. {color} |
\\
\\
|| Subsystem || Report/Notes ||
| JIRA Issue | MAPREDUCE-6674 |
| JIRA Patch URL | 
https://issues.apache.org/jira/secure/attachment/12798342/MAPREDUCE-6674.00.patch
 |
| Console output | 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/7040/console |
| Powered by | Apache Yetus 0.5.0   http://yetus.apache.org |


This message was automatically generated.



> configure parallel tests for mapreduce-client-jobclient
> ---
>
> Key: MAPREDUCE-6674
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6674
> Project: Hadoop Map/Reduce
>  Issue Type: Test
>  Components: test
>Affects Versions: 3.0.0-alpha1
>Reporter: Allen Wittenauer
>Priority: Critical
> Attachments: MAPREDUCE-6674.00.patch
>
>
> mapreduce-client-jobclient takes almost an hour and a half.  Configuring 
> parallel-tests would greatly reduce this run time.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Commented] (MAPREDUCE-6931) Fix TestDFSIO "Total Throughput" calculation

2017-08-02 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16111987#comment-16111987
 ] 

ASF GitHub Bot commented on MAPREDUCE-6931:
---

GitHub user dennishuo opened a pull request:

https://github.com/apache/hadoop/pull/259

MAPREDUCE-6931. Fix TestDFSIO "Total Throughput" calculation.

Previously it failed to convert ms to seconds and thus reports aggregate
throughput as 1/1000x actual numbers. Also, make all the bytes-to-mb
and milliseconds-to-seconds conversions consistent in the reporting
messages to help avoid this type of error in the future.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/dennishuo/hadoop trunk

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/hadoop/pull/259.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #259


commit 23a374d6e818004daa0b1cf16c4fedb439ab4943
Author: Dennis Huo 
Date:   2017-08-02T21:30:51Z

MAPREDUCE-6931. Fix TestDFSIO "Total Throughput" calculation.

Previously it failed to convert ms to seconds and thus reports aggregate
throughput as 1/1000x actual numbers. Also, make all the bytes-to-mb
and milliseconds-to-seconds conversions consistent in the reporting
messages to help avoid this type of error in the future.




> Fix TestDFSIO "Total Throughput" calculation
> 
>
> Key: MAPREDUCE-6931
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6931
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: benchmarks, test
>Affects Versions: 2.8.0
>Reporter: Dennis Huo
>Priority: Trivial
>
> The new "Total Throughput" line added in 
> https://issues.apache.org/jira/browse/HDFS-9153 is currently calculated as 
> {{toMB(size) / ((float)execTime)}} and claims to be in units of "MB/s", but 
> {{execTime}} is in milliseconds; thus, the reported number is 1/1000x the 
> actual value:
> {code:java}
> String resultLines[] = {
> "- TestDFSIO - : " + testType,
> "Date & time: " + new Date(System.currentTimeMillis()),
> "Number of files: " + tasks,
> " Total MBytes processed: " + df.format(toMB(size)),
> "  Throughput mb/sec: " + df.format(size * 1000.0 / (time * 
> MEGA)),
> "Total Throughput mb/sec: " + df.format(toMB(size) / 
> ((float)execTime)),
> " Average IO rate mb/sec: " + df.format(med),
> "  IO rate std deviation: " + df.format(stdDev),
> " Test exec time sec: " + df.format((float)execTime / 1000),
> "" };
> {code}
> The different calculated fields can also use toMB and a shared 
> milliseconds-to-seconds conversion to make it easier to keep units consistent.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Created] (MAPREDUCE-6931) Fix TestDFSIO "Total Throughput" calculation

2017-08-02 Thread Dennis Huo (JIRA)
Dennis Huo created MAPREDUCE-6931:
-

 Summary: Fix TestDFSIO "Total Throughput" calculation
 Key: MAPREDUCE-6931
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6931
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: benchmarks, test
Affects Versions: 2.8.0
Reporter: Dennis Huo
Priority: Trivial


The new "Total Throughput" line added in 
https://issues.apache.org/jira/browse/HDFS-9153 is currently calculated as 
{{toMB(size) / ((float)execTime)}} and claims to be in units of "MB/s", but 
{{execTime}} is in milliseconds; thus, the reported number is 1/1000x the 
actual value:

{code:java}
String resultLines[] = {
"- TestDFSIO - : " + testType,
"Date & time: " + new Date(System.currentTimeMillis()),
"Number of files: " + tasks,
" Total MBytes processed: " + df.format(toMB(size)),
"  Throughput mb/sec: " + df.format(size * 1000.0 / (time * MEGA)),
"Total Throughput mb/sec: " + df.format(toMB(size) / ((float)execTime)),
" Average IO rate mb/sec: " + df.format(med),
"  IO rate std deviation: " + df.format(stdDev),
" Test exec time sec: " + df.format((float)execTime / 1000),
"" };
{code}

The different calculated fields can also use toMB and a shared 
milliseconds-to-seconds conversion to make it easier to keep units consistent.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Created] (MAPREDUCE-6930) mapreduce.map.cpu.vcores and mapreduce.reduce.cpu.vcores are both present twice in mapred-default.xml

2017-08-02 Thread Daniel Templeton (JIRA)
Daniel Templeton created MAPREDUCE-6930:
---

 Summary: mapreduce.map.cpu.vcores and mapreduce.reduce.cpu.vcores 
are both present twice in mapred-default.xml
 Key: MAPREDUCE-6930
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6930
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 3.0.0-alpha4
Reporter: Daniel Templeton


The second set should be deleted.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Updated] (MAPREDUCE-6929) TimelineV2Client hangs MR AM

2017-08-02 Thread Vrushali C (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vrushali C updated MAPREDUCE-6929:
--
Summary: TimelineV2Client hangs MR AM  (was: TimelineClient hangs MR AM)

> TimelineV2Client hangs MR AM
> 
>
> Key: MAPREDUCE-6929
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6929
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mr-am
>Reporter: Haibo Chen
>
> I happened to misconfigure ATSv2 settings, that is, I enabled emitting to 
> ATSv2 in MR and did not start YARN with ATSv2. The job was stuck after it 
> finished all its work. 
> Noticed that in MRAppMaster, TimelineClient is never added as a service, 
> which I think is why the AM was hanging. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Commented] (MAPREDUCE-6929) TimelineClient hangs MR AM

2017-08-02 Thread Haibo Chen (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16111769#comment-16111769
 ] 

Haibo Chen commented on MAPREDUCE-6929:
---

Yeah, that what I expected also. I did not collect the logs, unfortunately. I 
did remember seeing TimelineClient related failures in MRAM log. 
I can upload the logs later if I manage to reproduce it. In any case, we should 
add TimelineClient as a service under the AM service, right? That way, when AM 
receives a signal to exit, TimelineClient can also be stopped in serviceStop()

> TimelineClient hangs MR AM
> --
>
> Key: MAPREDUCE-6929
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6929
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mr-am
>Reporter: Haibo Chen
>
> I happened to misconfigure ATSv2 settings, that is, I enabled emitting to 
> ATSv2 in MR and did not start YARN with ATSv2. The job was stuck after it 
> finished all its work. 
> Noticed that in MRAppMaster, TimelineClient is never added as a service, 
> which I think is why the AM was hanging. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Assigned] (MAPREDUCE-6923) YARN Shuffle I/O for small partitions

2017-08-02 Thread Ravi Prakash (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ravi Prakash reassigned MAPREDUCE-6923:
---

Assignee: Robert Schmidtke

> YARN Shuffle I/O for small partitions
> -
>
> Key: MAPREDUCE-6923
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6923
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
> Environment: Observed in Hadoop 2.7.3 and above (judging from the 
> source code of future versions), and Ubuntu 16.04.
>Reporter: Robert Schmidtke
>Assignee: Robert Schmidtke
>
> When a job configuration results in small partitions read by each reducer 
> from each mapper (e.g. 65 kilobytes as in my setup: a 
> [TeraSort|https://github.com/apache/hadoop/blob/branch-2.7.3/hadoop-mapreduce-project/hadoop-mapreduce-examples/src/main/java/org/apache/hadoop/examples/terasort/TeraSort.java]
>  of 256 gigabytes using 2048 mappers and reducers each), and setting
> {code:xml}
> 
>   mapreduce.shuffle.transferTo.allowed
>   false
> 
> {code}
> then the default setting of
> {code:xml}
> 
>   mapreduce.shuffle.transfer.buffer.size
>   131072
> 
> {code}
> results in almost 100% overhead in reads during shuffle in YARN, because for 
> each 65K needed, 128K are read.
> I propose a fix in 
> [FadvisedFileRegion.java|https://github.com/apache/hadoop/blob/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-shuffle/src/main/java/org/apache/hadoop/mapred/FadvisedFileRegion.java#L114]
>  as follows:
> {code:java}
> ByteBuffer byteBuffer = ByteBuffer.allocate(Math.min(this.shuffleBufferSize, 
> trans > Integer.MAX_VALUE ? Integer.MAX_VALUE : (int) trans));
> {code}
> e.g. 
> [here|https://github.com/apache/hadoop/compare/branch-2.7.3...robert-schmidtke:adaptive-shuffle-buffer].
>  This sets the shuffle buffer size to the minimum value of the shuffle buffer 
> size specified in the configuration (128K by default), and the actual 
> partition size (65K on average in my setup). In my benchmarks this reduced 
> the read overhead in YARN from about 100% (255 additional gigabytes as 
> described above) down to about 18% (an additional 45 gigabytes). The runtime 
> of the job remained the same in my setup.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Commented] (MAPREDUCE-6923) YARN Shuffle I/O for small partitions

2017-08-02 Thread Ravi Prakash (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16111759#comment-16111759
 ] 

Ravi Prakash commented on MAPREDUCE-6923:
-

Hi Robert! Thanks for filing the JIRA and your contribution! I'm adding you as 
a contributor and assigning the JIRA to you. Could you please post a patch file 
to this JIRA? You can name the patch file MAPREDUCE\-6923.00.patch . One minor 
nit is that we limit lines to 80 characters. Could you please fix that in the 
patch file?
Also since {{trans}} is [guaranteed to be 
positive|https://github.com/apache/hadoop/blob/12e44e7bdaf53d3720a89d32f0cc2717241bd6b2/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-shuffle/src/main/java/org/apache/hadoop/mapred/FadvisedFileRegion.java#L103]
 and {{shuffleBufferSize}} is an integer, maybe we don't really need the 
ternary operator condition? Up to you to keep it though. 

I'm not surprised that there isn't an improvement in job performance but the 
read overhead improvement is great. Do you know where the 18% overhead is going?

The patch sounds reasonable to me. [~jlowe], [~nikola.vujic], [~cnauroth] do 
you have any comments? The diff he's proposing is in the link to the word "e.g. 
[here|https://github.com/apache/hadoop/compare/branch-2.7.3...robert-schmidtke:adaptive-shuffle-buffer]";
 .

> YARN Shuffle I/O for small partitions
> -
>
> Key: MAPREDUCE-6923
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6923
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
> Environment: Observed in Hadoop 2.7.3 and above (judging from the 
> source code of future versions), and Ubuntu 16.04.
>Reporter: Robert Schmidtke
>
> When a job configuration results in small partitions read by each reducer 
> from each mapper (e.g. 65 kilobytes as in my setup: a 
> [TeraSort|https://github.com/apache/hadoop/blob/branch-2.7.3/hadoop-mapreduce-project/hadoop-mapreduce-examples/src/main/java/org/apache/hadoop/examples/terasort/TeraSort.java]
>  of 256 gigabytes using 2048 mappers and reducers each), and setting
> {code:xml}
> 
>   mapreduce.shuffle.transferTo.allowed
>   false
> 
> {code}
> then the default setting of
> {code:xml}
> 
>   mapreduce.shuffle.transfer.buffer.size
>   131072
> 
> {code}
> results in almost 100% overhead in reads during shuffle in YARN, because for 
> each 65K needed, 128K are read.
> I propose a fix in 
> [FadvisedFileRegion.java|https://github.com/apache/hadoop/blob/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-shuffle/src/main/java/org/apache/hadoop/mapred/FadvisedFileRegion.java#L114]
>  as follows:
> {code:java}
> ByteBuffer byteBuffer = ByteBuffer.allocate(Math.min(this.shuffleBufferSize, 
> trans > Integer.MAX_VALUE ? Integer.MAX_VALUE : (int) trans));
> {code}
> e.g. 
> [here|https://github.com/apache/hadoop/compare/branch-2.7.3...robert-schmidtke:adaptive-shuffle-buffer].
>  This sets the shuffle buffer size to the minimum value of the shuffle buffer 
> size specified in the configuration (128K by default), and the actual 
> partition size (65K on average in my setup). In my benchmarks this reduced 
> the read overhead in YARN from about 100% (255 additional gigabytes as 
> described above) down to about 18% (an additional 45 gigabytes). The runtime 
> of the job remained the same in my setup.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Commented] (MAPREDUCE-6927) MR job should only set tracking url if history was successfully written

2017-08-02 Thread Jason Lowe (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16111744#comment-16111744
 ] 

Jason Lowe commented on MAPREDUCE-6927:
---

Thanks for the report and patch!  It would be helpful if you elaborated a bit 
more in the JIRA description about the problem this is fixing and how it 
manifests in practice.

I think we should set the tracking URL as soon as the .jhist file is 
successfully put in the done folder which is a bit earlier than where it is now 
in the patch.  Even if we fail to get the conf to done, there should be some 
MapReduce-specific history once the .jhist file is there.  Also as written in 
the patch, there could be a case where we "successfully" complete 
processDoneFiles but nothing was actually placed in the done directory (i.e.: 
for whatever reason mi.getHistoryFile() == null).

It would be nice if the unit test verified the job history URL was correctly 
rather than just any string at all.  I think the test should also verify that 
even if the job history event handler receives a job finished event but is 
unable to complete moving the history to the done directory that it does not 
set the tracking URL.


> MR job should only set tracking url if history was successfully written
> ---
>
> Key: MAPREDUCE-6927
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6927
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Eric Badger
>Assignee: Eric Badger
> Attachments: MAPREDUCE-6927.001.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Commented] (MAPREDUCE-6929) TimelineClient hangs MR AM

2017-08-02 Thread Vrushali C (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16111724#comment-16111724
 ] 

Vrushali C commented on MAPREDUCE-6929:
---

Hmm, so the NMs did not have timeline collector as an auxiliary service but the 
AM was trying to connect to timeline service? It should retry for a few times 
(30 I think by default) and then just "complete". Do you have any logs? 

> TimelineClient hangs MR AM
> --
>
> Key: MAPREDUCE-6929
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6929
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mr-am
>Reporter: Haibo Chen
>
> I happened to misconfigure ATSv2 settings, that is, I enabled emitting to 
> ATSv2 in MR and did not start YARN with ATSv2. The job was stuck after it 
> finished all its work. 
> Noticed that in MRAppMaster, TimelineClient is never added as a service, 
> which I think is why the AM was hanging. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Created] (MAPREDUCE-6929) TimelineClient hangs MR AM

2017-08-02 Thread Haibo Chen (JIRA)
Haibo Chen created MAPREDUCE-6929:
-

 Summary: TimelineClient hangs MR AM
 Key: MAPREDUCE-6929
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6929
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mr-am
Reporter: Haibo Chen


I happened to misconfigure ATSv2 settings, that is, I enabled emitting to ATSv2 
in MR and did not start YARN with ATSv2. The job was stuck after it finished 
all its work. 

Noticed that in MRAppMaster, TimelineClient is never added as a service, which 
I think is why the AM was hanging. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Commented] (MAPREDUCE-6914) Tests use assertTrue(....equals(...)) instead of assertEquals()

2017-08-02 Thread Maya Wexler (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16111690#comment-16111690
 ] 

Maya Wexler commented on MAPREDUCE-6914:


+1 (non-binding)

I verified that all of the test clauses are still valid. Code looks cleaner 
with assertEquals().

> Tests use assertTrue(equals(...)) instead of assertEquals()
> ---
>
> Key: MAPREDUCE-6914
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6914
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: test
>Affects Versions: 2.8.1, 3.0.0-alpha4
>Reporter: Daniel Templeton
>Assignee: Daniel Templeton
>Priority: Minor
> Attachments: MAPREDUCE-6914.001.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Commented] (MAPREDUCE-6892) Issues with the count of failed/killed tasks in the jhist file

2017-08-02 Thread Haibo Chen (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16111628#comment-16111628
 ] 

Haibo Chen commented on MAPREDUCE-6892:
---

bq. 3. Looking at the toTimelineEvent() methods, those are a bit messed up too. 
As I mentioned above, it has prop. keys like NUM_MAPS and FINISHED_MAPS, but 
both contain the same value. So either we rename NUM_MAPS to SUCCEEDED_MAPS and 
delete FINISHED_MAPS or NUM_MAPS should be succeeded+failed+killed and rename 
FINISHED_MAPS to SUCCEEDED_MAPS.
TimelineServiceV2 related changes (new fields and renames) are fine as long as 
we do it before 3.0 beta1, from which we'll need to consider compatibility. So 
let's also include the changes needed here for ATSv2.

bq. 1. I can add new fields to the Avro schema and keep the old ones, but it 
adds to the complexity, because we have to support both pairs
bq. 2. We probably also have to check whether "finished" or "succeeded" values 
are defined in the jhist file, but not both
we can keep the old names and only add new fields, so we continue to use 
'finished' as succeeded. In this case, there won't be a new field, succeeded.

A few more minor comments on the 2rd patch:
1) In JobHistoryEventHandler when we handles Job_KILL event, can we also add 
killedMappers + killedReducers in the summary as well?
2) there is one line above JobImpl.unsuccessfulFinish() which is unnecessary.
3) In JobImpl.InternalTerminationTransition, can we include all killed/failed 
counters instead of zeroes?
4) JobImpl#L1959 is more than 80 characters
5) In Job20LineHistoryEventEmitter, can we also parse the killed + failed 
counters instead of passing -1s?
6) CompletedJob: instead of returning 0s for the new counters, we can return 
jobInfo.get*() that we get from parsing .jhist files.
7) In TestJobHistoryParser.testHistoryParsingForKilledAndFailedAttempts(), 
verification of noOffailedAttempts seems unnecessary.
In fact, the total # depends on job configuration. Can we remove the related 
code?


> Issues with the count of failed/killed tasks in the jhist file
> --
>
> Key: MAPREDUCE-6892
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6892
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: client, jobhistoryserver
>Reporter: Peter Bacsko
>Assignee: Peter Bacsko
> Attachments: MAPREDUCE-6892-001.patch, MAPREDUCE-6892-002.PATCH
>
>
> Recently we encountered some issues with the value of failed tasks. After 
> parsing the jhist file, {{JobInfo.getFailedMaps()}} returned 0, but actually 
> there were failures. 
> Another minor thing is that you cannot get the number of killed tasks 
> (although this can be calculated).
> The root cause is that {{JobUnsuccessfulCompletionEvent}} contains only the 
> successful map/reduce task counts. Number of failed (or killed) tasks are not 
> stored.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Created] (MAPREDUCE-6928) BackupStore Should Checksum Files Persisted To Disk

2017-08-02 Thread BELUGA BEHR (JIRA)
BELUGA BEHR created MAPREDUCE-6928:
--

 Summary: BackupStore Should Checksum Files Persisted To Disk
 Key: MAPREDUCE-6928
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6928
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 3.0.0-alpha4, 2.8.1
Reporter: BELUGA BEHR
Priority: Minor


The class {{org.apache.hadoop.mapred.BackupStore}} persists data to memory or 
disk.  Like the {{IFile}} format that this class works in tandem with, the data 
in {{BackupStore}} should be under a check sum to prevent otherwise good data 
from becoming corrupt.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Commented] (MAPREDUCE-6870) Add configuration for MR job to finish when all reducers are complete (even with unfinished mappers)

2017-08-02 Thread Haibo Chen (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16111487#comment-16111487
 ] 

Haibo Chen commented on MAPREDUCE-6870:
---

bq. 1. Variable names (is preemptMappersOnReduceFinish good?)
I thingk we can rename JobImplpreemptMappersOnReduceFinish,  
MRJobConfig.PREEMPT_MAPPERS_ON_REDUCE_FINISH and 
MRJobConfig.DEFAULT_PREEMPT_MAPPERS_ON_REDUCE_FINISH to convey more clearly 
that our intention is to finish the job from the users' perspective (because 
the configuration is user-facing). Also the description of 
mapreduce.job.finish-when-all-reducers-done in mapred-default.xml.

bq. 2. Added a new method to MapTaskImpl with locking, which is probably not 
necessary but I felt it's better to have it anyway
What I meant previously is that we are creating duplicated TA_KILL events in 
JobImpl.checkReadyForCompletionWhenAllReducersDone(). I am not sure adding 
MapTaskImp.preemptedDueToReducerFinish is going to help in this case, because 
we will still be sending duplicated TA_KILL events. I was proposing something 
like the following
{code}
private void checkReadyForCompletionWhenAllReducersDone(JobImpl job) {
  if (job.preemptRestartedMappersOnReduceFinish) {
int totalReduces = job.getTotalReduces();
int completedReduces = job.getCompletedReduces();
// only if all reducers have finished and we have not sent TA_KILL 
events to running
// map tasks before
if (totalReduces > 0 && totalReduces == completedReduces && 
!inCompletingJob) {
  for (TaskId mapTaskId : job.mapTasks) {
MapTaskImpl task = (MapTaskImpl) job.tasks.get(mapTaskId);
if (!task.isFinished() && !task.isPreemptedDueToReducerFinish()) {
  LOG.info("Killing map task " + task.getID());
  task.setPreemptedDueToReducerFinish(true);
  job.eventHandler.handle(
  new TaskEvent(task.getID(), TaskEventType.T_KILL));
}
  }
  inCompletingJob = true;
}
  }
{code}

In TestJobImpl, can you explain to me what this code is doing
{code}
// finish mappers - if the mapper is preempted, its state will be
// KILLED, set by KillWaitAttemptKilledTransition
for (TaskId taskId: job.tasks.keySet()) {
  TaskState state = killMappers ? TaskState.KILLED : TaskState.SUCCEEDED;
  if (taskId.getTaskType() == TaskType.MAP) {
job.handle(new JobTaskEvent(taskId, state));
  }
}
{code}
Also, do we expect the job the succeed even when killMappers is set to false? 
I'd expect if killMappers is false,
The mappers will stay in RUNNING state, and thus hanging the job. In other 
words, the job should stay in
RUNNING state (after we drain events in the dispatcher event queue). No?


> Add configuration for MR job to finish when all reducers are complete (even 
> with unfinished mappers)
> 
>
> Key: MAPREDUCE-6870
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6870
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>Affects Versions: 2.6.1
>Reporter: Zhe Zhang
>Assignee: Peter Bacsko
> Attachments: MAPREDUCE-6870-001.patch, MAPREDUCE-6870-002.patch, 
> MAPREDUCE-6870-003.patch
>
>
> Even with MAPREDUCE-5817, there could still be cases where mappers get 
> scheduled before all reducers are complete, but those mappers run for long 
> time, even after all reducers are complete. This could hurt the performance 
> of large MR jobs.
> In some cases, mappers don't have any materialize-able outcome other than 
> providing intermediate data to reducers. In that case, the job owner should 
> have the config option to finish the job once all reducers are complete.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Commented] (MAPREDUCE-6927) MR job should only set tracking url if history was successfully written

2017-08-02 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16111262#comment-16111262
 ] 

Hadoop QA commented on MAPREDUCE-6927:
--

| (/) *{color:green}+1 overall{color}* |
\\
\\
|| Vote || Subsystem || Runtime || Comment ||
| {color:blue}0{color} | {color:blue} reexec {color} | {color:blue}  0m 
20s{color} | {color:blue} Docker mode activated. {color} |
| {color:green}+1{color} | {color:green} @author {color} | {color:green}  0m  
0s{color} | {color:green} The patch does not contain any @author tags. {color} |
| {color:green}+1{color} | {color:green} test4tests {color} | {color:green}  0m 
 0s{color} | {color:green} The patch appears to include 3 new or modified test 
files. {color} |
| {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue}  0m  
8s{color} | {color:blue} Maven dependency ordering for branch {color} |
| {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 14m 
34s{color} | {color:green} trunk passed {color} |
| {color:green}+1{color} | {color:green} compile {color} | {color:green}  1m 
47s{color} | {color:green} trunk passed {color} |
| {color:green}+1{color} | {color:green} checkstyle {color} | {color:green}  0m 
29s{color} | {color:green} trunk passed {color} |
| {color:green}+1{color} | {color:green} mvnsite {color} | {color:green}  0m 
50s{color} | {color:green} trunk passed {color} |
| {color:green}+1{color} | {color:green} mvneclipse {color} | {color:green}  0m 
36s{color} | {color:green} trunk passed {color} |
| {color:green}+1{color} | {color:green} findbugs {color} | {color:green}  1m  
8s{color} | {color:green} trunk passed {color} |
| {color:green}+1{color} | {color:green} javadoc {color} | {color:green}  0m 
30s{color} | {color:green} trunk passed {color} |
| {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue}  0m  
7s{color} | {color:blue} Maven dependency ordering for patch {color} |
| {color:green}+1{color} | {color:green} mvninstall {color} | {color:green}  0m 
44s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} compile {color} | {color:green}  1m 
40s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} javac {color} | {color:green}  1m 
40s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} checkstyle {color} | {color:green}  0m 
27s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} mvnsite {color} | {color:green}  0m 
45s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} mvneclipse {color} | {color:green}  0m 
31s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} whitespace {color} | {color:green}  0m 
 0s{color} | {color:green} The patch has no whitespace issues. {color} |
| {color:green}+1{color} | {color:green} findbugs {color} | {color:green}  1m 
17s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} javadoc {color} | {color:green}  0m 
26s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} unit {color} | {color:green}  9m  
9s{color} | {color:green} hadoop-mapreduce-client-app in the patch passed. 
{color} |
| {color:green}+1{color} | {color:green} unit {color} | {color:green}  3m 
14s{color} | {color:green} hadoop-mapreduce-client-hs in the patch passed. 
{color} |
| {color:green}+1{color} | {color:green} asflicense {color} | {color:green}  0m 
15s{color} | {color:green} The patch does not generate ASF License warnings. 
{color} |
| {color:black}{color} | {color:black} {color} | {color:black} 39m 51s{color} | 
{color:black} {color} |
\\
\\
|| Subsystem || Report/Notes ||
| Docker |  Image:yetus/hadoop:14b5c93 |
| JIRA Issue | MAPREDUCE-6927 |
| JIRA Patch URL | 
https://issues.apache.org/jira/secure/attachment/12880066/MAPREDUCE-6927.001.patch
 |
| Optional Tests |  asflicense  compile  javac  javadoc  mvninstall  mvnsite  
unit  findbugs  checkstyle  |
| uname | Linux ff869078caaa 3.13.0-117-generic #164-Ubuntu SMP Fri Apr 7 
11:05:26 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux |
| Build tool | maven |
| Personality | /testptch/hadoop/patchprocess/precommit/personality/provided.sh 
|
| git revision | trunk / 4889913 |
| Default Java | 1.8.0_131 |
| findbugs | v3.1.0-RC1 |
|  Test Results | 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/7039/testReport/ |
| modules | C: 
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app 
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs U: 
hadoop-mapreduce-project/hadoop-mapreduce-client |
| Console output | 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/7039/console |
| Powered by | Apache Yetus 0.4.0   http://yetus.apache.org |


This message was automatically generated.



[jira] [Updated] (MAPREDUCE-6927) MR job should only set tracking url if history was successfully written

2017-08-02 Thread Eric Badger (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eric Badger updated MAPREDUCE-6927:
---
Attachment: MAPREDUCE-6927.001.patch

> MR job should only set tracking url if history was successfully written
> ---
>
> Key: MAPREDUCE-6927
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6927
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Eric Badger
>Assignee: Eric Badger
> Attachments: MAPREDUCE-6927.001.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Updated] (MAPREDUCE-6927) MR job should only set tracking url if history was successfully written

2017-08-02 Thread Eric Badger (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eric Badger updated MAPREDUCE-6927:
---
Status: Patch Available  (was: Open)

> MR job should only set tracking url if history was successfully written
> ---
>
> Key: MAPREDUCE-6927
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6927
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Eric Badger
>Assignee: Eric Badger
> Attachments: MAPREDUCE-6927.001.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Comment Edited] (MAPREDUCE-6892) Issues with the count of failed/killed tasks in the jhist file

2017-08-02 Thread Peter Bacsko (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16110785#comment-16110785
 ] 

Peter Bacsko edited comment on MAPREDUCE-6892 at 8/2/17 12:28 PM:
--

[~haibochen] I have been thinking about this renaming and IMO we're better off 
solve it in a separate JIRA.

Some problems:
1. I can add new fields to the Avro schema and keep the old ones, but it adds 
to the complexity, because we have to support both pairs
2. We probably also have to check whether "finished" or "succeeded" values are 
defined in the jhist file, but not both
3. Looking at the {{toTimelineEvent()}} methods, those are a bit messed up too. 
As I mentioned above, it has prop. keys like {{NUM_MAPS}} and 
{{FINISHED_MAPS}}, but both contain the same value. So either we rename 
{{NUM_MAPS}} to {{SUCCEEDED_MAPS}} and delete {{FINISHED_MAPS}} or {{NUM_MAPS}} 
should be succeeded+failed+killed and rename {{FINISHED_MAPS}} to 
{{SUCCEEDED_MAPS}}.

So it's a bit more complicated. It looks more beneficial to us if we create a 
separate JIRA where we try to consolidate the naming inconsistencies (and 
possibly involve other upstream devs) and leave these fields alone in this one. 
What do you think?




was (Author: pbacsko):
[~haibochen] I have been thinking about this renaming and IMO we're better off 
solve it in a separate JIRA.

Some problems:
1. I can add new fields to the Avro schema and keep the old ones, but it adds 
to the complexity, because we have to support both pairs
2. We probably also have to check whether "finished" or "succeeded" values are 
defined in the jhist file, but not both
3. Looking at the {{toTimelineEvent()}} methods, those are a bit messed up too. 
As I mentioned above, it has prop. keys like {{NUM_MAPS}} and 
{{FINISHED_MAPS}}, but both contain the same value. So either we rename 
{{NUM_MAPS}} to {{SUCCEEDED_MAPS}} and delete {{FINISHED_MAPS}} or {{NUM_MAPS}} 
should be succeeded+failed+killed and rename {{FINISHED_MAPS}} to 
{{SUCCEEDED_MAPS}}.

So it's a bit more complicated. It looks more beneficial to us if we create a 
separate JIRA where we try to consolidate the naming inconsistencies (and 
possibly involve other upstream devs) and leave them alone in this one. What do 
you think?



> Issues with the count of failed/killed tasks in the jhist file
> --
>
> Key: MAPREDUCE-6892
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6892
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: client, jobhistoryserver
>Reporter: Peter Bacsko
>Assignee: Peter Bacsko
> Attachments: MAPREDUCE-6892-001.patch, MAPREDUCE-6892-002.PATCH
>
>
> Recently we encountered some issues with the value of failed tasks. After 
> parsing the jhist file, {{JobInfo.getFailedMaps()}} returned 0, but actually 
> there were failures. 
> Another minor thing is that you cannot get the number of killed tasks 
> (although this can be calculated).
> The root cause is that {{JobUnsuccessfulCompletionEvent}} contains only the 
> successful map/reduce task counts. Number of failed (or killed) tasks are not 
> stored.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Commented] (MAPREDUCE-6892) Issues with the count of failed/killed tasks in the jhist file

2017-08-02 Thread Peter Bacsko (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16110785#comment-16110785
 ] 

Peter Bacsko commented on MAPREDUCE-6892:
-

[~haibochen] I have been thinking about this renaming and IMO we're better off 
solve it in a separate JIRA.

Some problems:
1. I can add new fields to the Avro schema and keep the old ones, but it adds 
to the complexity, because we have to support both pairs
2. We probably also have to check whether "finished" or "succeeded" values are 
defined in the jhist file, but not both
3. Looking at the {{toTimelineEvent()}} methods, those are a bit messed up too. 
As I mentioned above, it has prop. keys like {{NUM_MAPS}} and 
{{FINISHED_MAPS}}, but both contain the same value. So either we rename 
{{NUM_MAPS}} to {{SUCCEEDED_MAPS}} and delete {{FINISHED_MAPS}} or {{NUM_MAPS}} 
should be succeeded+failed+killed and rename {{FINISHED_MAPS}} to 
{{SUCCEEDED_MAPS}}.

So it's a bit more complicated. It looks more beneficial to us if we create a 
separate JIRA where we try to consolidate the naming inconsistencies (and 
possibly involve other upstream devs) and leave them alone in this one. What do 
you think?



> Issues with the count of failed/killed tasks in the jhist file
> --
>
> Key: MAPREDUCE-6892
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6892
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: client, jobhistoryserver
>Reporter: Peter Bacsko
>Assignee: Peter Bacsko
> Attachments: MAPREDUCE-6892-001.patch, MAPREDUCE-6892-002.PATCH
>
>
> Recently we encountered some issues with the value of failed tasks. After 
> parsing the jhist file, {{JobInfo.getFailedMaps()}} returned 0, but actually 
> there were failures. 
> Another minor thing is that you cannot get the number of killed tasks 
> (although this can be calculated).
> The root cause is that {{JobUnsuccessfulCompletionEvent}} contains only the 
> successful map/reduce task counts. Number of failed (or killed) tasks are not 
> stored.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Commented] (MAPREDUCE-6870) Add configuration for MR job to finish when all reducers are complete (even with unfinished mappers)

2017-08-02 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16110695#comment-16110695
 ] 

Hadoop QA commented on MAPREDUCE-6870:
--

| (/) *{color:green}+1 overall{color}* |
\\
\\
|| Vote || Subsystem || Runtime || Comment ||
| {color:blue}0{color} | {color:blue} reexec {color} | {color:blue}  0m 
21s{color} | {color:blue} Docker mode activated. {color} |
| {color:green}+1{color} | {color:green} @author {color} | {color:green}  0m  
0s{color} | {color:green} The patch does not contain any @author tags. {color} |
| {color:green}+1{color} | {color:green} test4tests {color} | {color:green}  0m 
 0s{color} | {color:green} The patch appears to include 1 new or modified test 
files. {color} |
| {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue}  0m 
54s{color} | {color:blue} Maven dependency ordering for branch {color} |
| {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 14m 
36s{color} | {color:green} trunk passed {color} |
| {color:green}+1{color} | {color:green} compile {color} | {color:green}  1m 
50s{color} | {color:green} trunk passed {color} |
| {color:green}+1{color} | {color:green} checkstyle {color} | {color:green}  0m 
31s{color} | {color:green} trunk passed {color} |
| {color:green}+1{color} | {color:green} mvnsite {color} | {color:green}  0m 
57s{color} | {color:green} trunk passed {color} |
| {color:green}+1{color} | {color:green} mvneclipse {color} | {color:green}  0m 
37s{color} | {color:green} trunk passed {color} |
| {color:green}+1{color} | {color:green} findbugs {color} | {color:green}  1m 
31s{color} | {color:green} trunk passed {color} |
| {color:green}+1{color} | {color:green} javadoc {color} | {color:green}  0m 
40s{color} | {color:green} trunk passed {color} |
| {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue}  0m  
8s{color} | {color:blue} Maven dependency ordering for patch {color} |
| {color:green}+1{color} | {color:green} mvninstall {color} | {color:green}  0m 
50s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} compile {color} | {color:green}  1m 
45s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} javac {color} | {color:green}  1m 
45s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} checkstyle {color} | {color:green}  0m 
27s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} mvnsite {color} | {color:green}  0m 
54s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} mvneclipse {color} | {color:green}  0m 
30s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} whitespace {color} | {color:green}  0m 
 0s{color} | {color:green} The patch has no whitespace issues. {color} |
| {color:green}+1{color} | {color:green} xml {color} | {color:green}  0m  
2s{color} | {color:green} The patch has no ill-formed XML file. {color} |
| {color:green}+1{color} | {color:green} findbugs {color} | {color:green}  1m 
42s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} javadoc {color} | {color:green}  0m 
35s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} unit {color} | {color:green}  2m 
45s{color} | {color:green} hadoop-mapreduce-client-core in the patch passed. 
{color} |
| {color:green}+1{color} | {color:green} unit {color} | {color:green}  9m  
6s{color} | {color:green} hadoop-mapreduce-client-app in the patch passed. 
{color} |
| {color:green}+1{color} | {color:green} asflicense {color} | {color:green}  0m 
17s{color} | {color:green} The patch does not generate ASF License warnings. 
{color} |
| {color:black}{color} | {color:black} {color} | {color:black} 41m 55s{color} | 
{color:black} {color} |
\\
\\
|| Subsystem || Report/Notes ||
| Docker |  Image:yetus/hadoop:14b5c93 |
| JIRA Issue | MAPREDUCE-6870 |
| JIRA Patch URL | 
https://issues.apache.org/jira/secure/attachment/12880006/MAPREDUCE-6870-003.patch
 |
| Optional Tests |  asflicense  compile  javac  javadoc  mvninstall  mvnsite  
unit  findbugs  checkstyle  xml  |
| uname | Linux 9d9cb5dc9e02 3.13.0-117-generic #164-Ubuntu SMP Fri Apr 7 
11:05:26 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux |
| Build tool | maven |
| Personality | /testptch/hadoop/patchprocess/precommit/personality/provided.sh 
|
| git revision | trunk / f9139ac |
| Default Java | 1.8.0_131 |
| findbugs | v3.1.0-RC1 |
|  Test Results | 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/7038/testReport/ |
| modules | C: 
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core 
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app U: 
hadoop-mapreduce-project/hadoop-mapreduce-client |
| Console output | 
https://builds.apache

[jira] [Commented] (MAPREDUCE-6870) Add configuration for MR job to finish when all reducers are complete (even with unfinished mappers)

2017-08-02 Thread Peter Bacsko (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16110667#comment-16110667
 ] 

Peter Bacsko commented on MAPREDUCE-6870:
-

Thanks [~haibochen], I addressed your comments.

Things to check / consider:
1. Variable names (is {{preemptMappersOnReduceFinish}} good?)
2. Added a new method to {{MapTaskImpl}} with locking, which is probably not 
necessary but I felt it's better to have it anyway


> Add configuration for MR job to finish when all reducers are complete (even 
> with unfinished mappers)
> 
>
> Key: MAPREDUCE-6870
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6870
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>Affects Versions: 2.6.1
>Reporter: Zhe Zhang
>Assignee: Peter Bacsko
> Attachments: MAPREDUCE-6870-001.patch, MAPREDUCE-6870-002.patch, 
> MAPREDUCE-6870-003.patch
>
>
> Even with MAPREDUCE-5817, there could still be cases where mappers get 
> scheduled before all reducers are complete, but those mappers run for long 
> time, even after all reducers are complete. This could hurt the performance 
> of large MR jobs.
> In some cases, mappers don't have any materialize-able outcome other than 
> providing intermediate data to reducers. In that case, the job owner should 
> have the config option to finish the job once all reducers are complete.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org



[jira] [Updated] (MAPREDUCE-6870) Add configuration for MR job to finish when all reducers are complete (even with unfinished mappers)

2017-08-02 Thread Peter Bacsko (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Peter Bacsko updated MAPREDUCE-6870:

Attachment: MAPREDUCE-6870-003.patch

> Add configuration for MR job to finish when all reducers are complete (even 
> with unfinished mappers)
> 
>
> Key: MAPREDUCE-6870
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6870
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>Affects Versions: 2.6.1
>Reporter: Zhe Zhang
>Assignee: Peter Bacsko
> Attachments: MAPREDUCE-6870-001.patch, MAPREDUCE-6870-002.patch, 
> MAPREDUCE-6870-003.patch
>
>
> Even with MAPREDUCE-5817, there could still be cases where mappers get 
> scheduled before all reducers are complete, but those mappers run for long 
> time, even after all reducers are complete. This could hurt the performance 
> of large MR jobs.
> In some cases, mappers don't have any materialize-able outcome other than 
> providing intermediate data to reducers. In that case, the job owner should 
> have the config option to finish the job once all reducers are complete.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org