[jira] [Comment Edited] (TEZ-3809) The buffer size allocated for InMemoryMapOutput can be optimized

2017-08-02 Thread Muhammad Samir Khan (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16111794#comment-16111794
 ] 

Muhammad Samir Khan edited comment on TEZ-3809 at 8/2/17 10:07 PM:
---

Took a heap dump on ordered word count before final merge. In the after case, 
one of the outputs was written to disk instead of kept in memory and that is 
why it has 37 entries. 

Before:

Class Name  
   | Shallow Heap | Retained Heap | Percentage
---
java.lang.Thread @ 0x5d2c473f8  ShuffleAndMergeRunner {Tokenizer} Thread
   |  120 | 2,229,207,992 | 96.48%
|- java.util.ArrayList @ 0x73f978f10
   |   24 | 2,229,206,760 | 96.48%
|  '- java.lang.Object[38] @ 0x73f979130
   |  168 | 2,229,206,736 | 96.48%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5e4a88898|   32 |68,078,192 |  2.95%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0b260|   32 |67,839,520 |  2.94%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5e4a888b8|   32 |67,700,608 |  2.93%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x73f9db168|   32 |67,500,816 |  2.92%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60ab36218|   32 |67,408,704 |  2.92%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631deed28|   32 |67,367,424 |  2.92%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x743b86ee0|   32 |67,337,936 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60af3a698|   32 |67,300,896 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c5b8|   32 |67,282,464 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60ab33140|   32 |67,264,304 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5e4a88878|   32 |67,127,368 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0b218|   32 |67,098,216 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c6c8|   32 |67,064,504 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5d239a6c8|   32 |67,003,776 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5d23b7e10|   32 |66,965,296 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631def2b8|   32 |66,928,032 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60ab351d0|   32 |66,916,896 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x74805dfb8|   32 |66,886,272 |  2.89%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60af39598|   32 |66,718,800 |  2.89%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x73fb0fb78|   32 |66,688,296 |  2.89%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c4b0|   32 |66,656,312 |  2.88%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60af39578|   32 |66,629,936 |  2.88%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631deec30|   32 |66,584,576 |  2.88%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c680|   

[jira] [Comment Edited] (TEZ-3809) The buffer size allocated for InMemoryMapOutput can be optimized

2017-08-02 Thread Muhammad Samir Khan (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16111794#comment-16111794
 ] 

Muhammad Samir Khan edited comment on TEZ-3809 at 8/2/17 10:04 PM:
---

Took a heap dump on ordered word count before final merge. In the after case, 
one of the outputs was written to disk instead of kept in memory and that is 
why it has 37 entries. 

Before:

Class Name  
   | Shallow Heap | Retained Heap | Percentage
---
java.lang.Thread @ 0x5d2c473f8  ShuffleAndMergeRunner {Tokenizer} Thread
   |  120 | 2,229,207,992 | 96.48%
|- java.util.ArrayList @ 0x73f978f10
   |   24 | 2,229,206,760 | 96.48%
|  '- java.lang.Object[38] @ 0x73f979130
   |  168 | 2,229,206,736 | 96.48%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5e4a88898|   32 |68,078,192 |  2.95%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0b260|   32 |67,839,520 |  2.94%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5e4a888b8|   32 |67,700,608 |  2.93%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x73f9db168|   32 |67,500,816 |  2.92%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60ab36218|   32 |67,408,704 |  2.92%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631deed28|   32 |67,367,424 |  2.92%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x743b86ee0|   32 |67,337,936 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60af3a698|   32 |67,300,896 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c5b8|   32 |67,282,464 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60ab33140|   32 |67,264,304 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5e4a88878|   32 |67,127,368 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0b218|   32 |67,098,216 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c6c8|   32 |67,064,504 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5d239a6c8|   32 |67,003,776 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5d23b7e10|   32 |66,965,296 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631def2b8|   32 |66,928,032 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60ab351d0|   32 |66,916,896 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x74805dfb8|   32 |66,886,272 |  2.89%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60af39598|   32 |66,718,800 |  2.89%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x73fb0fb78|   32 |66,688,296 |  2.89%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c4b0|   32 |66,656,312 |  2.88%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60af39578|   32 |66,629,936 |  2.88%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631deec30|   32 |66,584,576 |  2.88%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c680|   

[jira] [Comment Edited] (TEZ-3809) The buffer size allocated for InMemoryMapOutput can be optimized

2017-08-02 Thread Muhammad Samir Khan (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16111794#comment-16111794
 ] 

Muhammad Samir Khan edited comment on TEZ-3809 at 8/2/17 10:04 PM:
---

Took a heap dump on ordered word count before final merge. In the after case, 
one of the outputs was written to disk instead of kept in memory and that is 
why it has 37 entries. 

Before:

Class Name  
   | Shallow Heap | Retained Heap | Percentage
---
java.lang.Thread @ 0x5d2c473f8  ShuffleAndMergeRunner {Tokenizer} Thread
   |  120 | 2,229,207,992 | 96.48%
|- java.util.ArrayList @ 0x73f978f10
   |   24 | 2,229,206,760 | 96.48%
|  '- java.lang.Object[38] @ 0x73f979130
   |  168 | 2,229,206,736 | 96.48%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5e4a88898|   32 |68,078,192 |  2.95%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0b260|   32 |67,839,520 |  2.94%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5e4a888b8|   32 |67,700,608 |  2.93%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x73f9db168|   32 |67,500,816 |  2.92%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60ab36218|   32 |67,408,704 |  2.92%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631deed28|   32 |67,367,424 |  2.92%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x743b86ee0|   32 |67,337,936 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60af3a698|   32 |67,300,896 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c5b8|   32 |67,282,464 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60ab33140|   32 |67,264,304 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5e4a88878|   32 |67,127,368 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0b218|   32 |67,098,216 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c6c8|   32 |67,064,504 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5d239a6c8|   32 |67,003,776 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5d23b7e10|   32 |66,965,296 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631def2b8|   32 |66,928,032 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60ab351d0|   32 |66,916,896 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x74805dfb8|   32 |66,886,272 |  2.89%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60af39598|   32 |66,718,800 |  2.89%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x73fb0fb78|   32 |66,688,296 |  2.89%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c4b0|   32 |66,656,312 |  2.88%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60af39578|   32 |66,629,936 |  2.88%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631deec30|   32 |66,584,576 |  2.88%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c680|   

[jira] [Comment Edited] (TEZ-3809) The buffer size allocated for InMemoryMapOutput can be optimized

2017-08-02 Thread Muhammad Samir Khan (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16111794#comment-16111794
 ] 

Muhammad Samir Khan edited comment on TEZ-3809 at 8/2/17 10:03 PM:
---

Took a heap dump on ordered word count before final merge. In the after case, 
one of the outputs was written to disk instead of kept in memory and that is 
why it has 37 entries. 

Before:
Class Name  
   | Shallow Heap | Retained Heap | Percentage
---
java.lang.Thread @ 0x5d2c473f8  ShuffleAndMergeRunner {Tokenizer} Thread
   |  120 | 2,229,207,992 | 96.48%
|- java.util.ArrayList @ 0x73f978f10
   |   24 | 2,229,206,760 | 96.48%
|  '- java.lang.Object[38] @ 0x73f979130
   |  168 | 2,229,206,736 | 96.48%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5e4a88898|   32 |68,078,192 |  2.95%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0b260|   32 |67,839,520 |  2.94%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5e4a888b8|   32 |67,700,608 |  2.93%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x73f9db168|   32 |67,500,816 |  2.92%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60ab36218|   32 |67,408,704 |  2.92%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631deed28|   32 |67,367,424 |  2.92%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x743b86ee0|   32 |67,337,936 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60af3a698|   32 |67,300,896 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c5b8|   32 |67,282,464 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60ab33140|   32 |67,264,304 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5e4a88878|   32 |67,127,368 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0b218|   32 |67,098,216 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c6c8|   32 |67,064,504 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5d239a6c8|   32 |67,003,776 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5d23b7e10|   32 |66,965,296 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631def2b8|   32 |66,928,032 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60ab351d0|   32 |66,916,896 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x74805dfb8|   32 |66,886,272 |  2.89%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60af39598|   32 |66,718,800 |  2.89%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x73fb0fb78|   32 |66,688,296 |  2.89%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c4b0|   32 |66,656,312 |  2.88%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60af39578|   32 |66,629,936 |  2.88%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631deec30|   32 |66,584,576 |  2.88%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c680|   

[jira] [Commented] (TEZ-3809) The buffer size allocated for InMemoryMapOutput can be optimized

2017-08-02 Thread Muhammad Samir Khan (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16111794#comment-16111794
 ] 

Muhammad Samir Khan commented on TEZ-3809:
--

Took a heap dump on ordered word count before final merge. In the after case, 
one of the outputs was written to disk instead of kept in memory and that is 
why it has 37 entries. 

Before:
Class Name  
   | Shallow Heap | Retained Heap | Percentage
---
java.lang.Thread @ 0x5d2c473f8  ShuffleAndMergeRunner {Tokenizer} Thread
   |  120 | 2,229,207,992 | 96.48%
|- java.util.ArrayList @ 0x73f978f10
   |   24 | 2,229,206,760 | 96.48%
|  '- java.lang.Object[38] @ 0x73f979130
   |  168 | 2,229,206,736 | 96.48%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5e4a88898|   32 |68,078,192 |  2.95%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0b260|   32 |67,839,520 |  2.94%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5e4a888b8|   32 |67,700,608 |  2.93%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x73f9db168|   32 |67,500,816 |  2.92%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60ab36218|   32 |67,408,704 |  2.92%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631deed28|   32 |67,367,424 |  2.92%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x743b86ee0|   32 |67,337,936 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60af3a698|   32 |67,300,896 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c5b8|   32 |67,282,464 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60ab33140|   32 |67,264,304 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5e4a88878|   32 |67,127,368 |  2.91%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0b218|   32 |67,098,216 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c6c8|   32 |67,064,504 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5d239a6c8|   32 |67,003,776 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x5d23b7e10|   32 |66,965,296 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631def2b8|   32 |66,928,032 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60ab351d0|   32 |66,916,896 |  2.90%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x74805dfb8|   32 |66,886,272 |  2.89%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60af39598|   32 |66,718,800 |  2.89%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x73fb0fb78|   32 |66,688,296 |  2.89%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c4b0|   32 |66,656,312 |  2.88%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x60af39578|   32 |66,629,936 |  2.88%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631deec30|   32 |66,584,576 |  2.88%
| |- 
org.apache.tez.runtime.library.common.shuffle.orderedgrouped.MapOutput$InMemoryMapOutput
 @ 0x631e0c680|   32 |66,537,624 |  2.88%
| |- 

Success: TEZ-3810 PreCommit Build #2597

2017-08-02 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/TEZ-3810
Build: https://builds.apache.org/job/PreCommit-TEZ-Build/2597/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 339.59 KB...]
[INFO] Tez  SUCCESS [  0.025 s]
[INFO] 
[INFO] BUILD SUCCESS
[INFO] 
[INFO] Total time: 55:29 min
[INFO] Finished at: 2017-08-02T17:13:37Z
[INFO] Final Memory: 89M/1585M
[INFO] 




{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12880063/TEZ-3810-001.patch
  against master revision 614937c.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 3.0.1) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-TEZ-Build/2597//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/2597//console

This message is automatically generated.


==
==
Adding comment to Jira.
==
==


Comment added.
ded15c2f828ef98c8d997d6f2e88582eacb7f80d logged out


==
==
Finished build.
==
==


Archiving artifacts
[description-setter] Description set: TEZ-3810
Recording test results
Email was triggered for: Success
Sending email for trigger: Success



###
## FAILED TESTS (if any) 
##
All tests passed

[jira] [Commented] (TEZ-3810) TezCounter for idle time in shuffle phase

2017-08-02 Thread TezQA (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-3810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16111340#comment-16111340
 ] 

TezQA commented on TEZ-3810:


{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12880063/TEZ-3810-001.patch
  against master revision 614937c.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 3.0.1) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-TEZ-Build/2597//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/2597//console

This message is automatically generated.

> TezCounter for idle time in shuffle phase
> -
>
> Key: TEZ-3810
> URL: https://issues.apache.org/jira/browse/TEZ-3810
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: Ashwin Ramesh
> Attachments: TEZ-3810-001.patch
>
>
>  A task attempt counter that tracks how much time was spent waiting for 
> inputs in the shuffle phase. We can use this to quickly identify jobs that 
> are wasting a lot of time on the grid with idle reducer tasks instead of 
> shuffling/merging.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (TEZ-3810) TezCounter for idle time in shuffle phase

2017-08-02 Thread Ashwin Ramesh (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-3810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1661#comment-1661
 ] 

Ashwin Ramesh commented on TEZ-3810:


[~kshukla] Thanks for the review, will look at that immediately.


> TezCounter for idle time in shuffle phase
> -
>
> Key: TEZ-3810
> URL: https://issues.apache.org/jira/browse/TEZ-3810
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: Ashwin Ramesh
> Attachments: TEZ-3810-001.patch
>
>
>  A task attempt counter that tracks how much time was spent waiting for 
> inputs in the shuffle phase. We can use this to quickly identify jobs that 
> are wasting a lot of time on the grid with idle reducer tasks instead of 
> shuffling/merging.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (TEZ-3810) TezCounter for idle time in shuffle phase

2017-08-02 Thread Kuhu Shukla (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-3810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1642#comment-1642
 ] 

Kuhu Shukla commented on TEZ-3810:
--

Thanks [~aramesh2] for the patch!
I haven't fully reviewed this patch, but I think we will need an equivalent 
change in ShuffleScheduler as well (which is the shuffle-r for the ordered 
case).

> TezCounter for idle time in shuffle phase
> -
>
> Key: TEZ-3810
> URL: https://issues.apache.org/jira/browse/TEZ-3810
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: Ashwin Ramesh
> Attachments: TEZ-3810-001.patch
>
>
>  A task attempt counter that tracks how much time was spent waiting for 
> inputs in the shuffle phase. We can use this to quickly identify jobs that 
> are wasting a lot of time on the grid with idle reducer tasks instead of 
> shuffling/merging.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Updated] (TEZ-3810) TezCounter for idle time in shuffle phase

2017-08-02 Thread Ashwin Ramesh (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-3810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashwin Ramesh updated TEZ-3810:
---
Attachment: TEZ-3810-001.patch

> TezCounter for idle time in shuffle phase
> -
>
> Key: TEZ-3810
> URL: https://issues.apache.org/jira/browse/TEZ-3810
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: Ashwin Ramesh
> Attachments: TEZ-3810-001.patch
>
>
>  A task attempt counter that tracks how much time was spent waiting for 
> inputs in the shuffle phase. We can use this to quickly identify jobs that 
> are wasting a lot of time on the grid with idle reducer tasks instead of 
> shuffling/merging.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Updated] (TEZ-3810) TezCounter for idle time in shuffle phase

2017-08-02 Thread Ashwin Ramesh (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-3810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashwin Ramesh updated TEZ-3810:
---
Description: 
 A task attempt counter that tracks how much time was spent waiting for inputs 
in the shuffle phase. We can use this to quickly identify jobs that are wasting 
a lot of time on the grid with idle reducer tasks instead of shuffling/merging.


  was:
We can use this to quickly identify jobs that are wasting a lot of time on the 
grid with idle reducer tasks instead of shuffling/merging.



> TezCounter for idle time in shuffle phase
> -
>
> Key: TEZ-3810
> URL: https://issues.apache.org/jira/browse/TEZ-3810
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: Ashwin Ramesh
>
>  A task attempt counter that tracks how much time was spent waiting for 
> inputs in the shuffle phase. We can use this to quickly identify jobs that 
> are wasting a lot of time on the grid with idle reducer tasks instead of 
> shuffling/merging.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Updated] (TEZ-3810) TezCounter for idle time in shuffle phase

2017-08-02 Thread Ashwin Ramesh (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-3810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashwin Ramesh updated TEZ-3810:
---
Summary: TezCounter for idle time in shuffle phase  (was:  A task attempt 
counter that tracks how much time was spent waiting for inputs in the shuffle 
phase. )

> TezCounter for idle time in shuffle phase
> -
>
> Key: TEZ-3810
> URL: https://issues.apache.org/jira/browse/TEZ-3810
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: Ashwin Ramesh
>
> We can use this to quickly identify jobs that are wasting a lot of time on 
> the grid with idle reducer tasks instead of shuffling/merging.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Updated] (TEZ-3810) A task attempt counter that tracks how much time was spent waiting for inputs in the shuffle phase.

2017-08-02 Thread Ashwin Ramesh (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-3810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashwin Ramesh updated TEZ-3810:
---
Priority: Major  (was: Minor)

>  A task attempt counter that tracks how much time was spent waiting for 
> inputs in the shuffle phase. 
> -
>
> Key: TEZ-3810
> URL: https://issues.apache.org/jira/browse/TEZ-3810
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: Ashwin Ramesh
>
> We can use this to quickly identify jobs that are wasting a lot of time on 
> the grid with idle reducer tasks instead of shuffling/merging.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (TEZ-3810) A task attempt counter that tracks how much time was spent waiting for inputs in the shuffle phase.

2017-08-02 Thread Ashwin Ramesh (JIRA)
Ashwin Ramesh created TEZ-3810:
--

 Summary:  A task attempt counter that tracks how much time was 
spent waiting for inputs in the shuffle phase. 
 Key: TEZ-3810
 URL: https://issues.apache.org/jira/browse/TEZ-3810
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Ashwin Ramesh
Priority: Minor


We can use this to quickly identify jobs that are wasting a lot of time on the 
grid with idle reducer tasks instead of shuffling/merging.




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)