[jira] [Comment Edited] (TEZ-3814) Inserts into a bucketed table fail randomly with Hive on Tez

2017-08-07 Thread Anant Mittal (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117544#comment-16117544
 ] 

Anant Mittal edited comment on TEZ-3814 at 8/7/17 11:24 PM:


[~gopalv] Thanks for the response. This is a functionality which is not working 
as expected. Please tell me if anything further can be done so that it 
satisfies being a bug.
The JIRA pointed out was something new. I gave the suggestion a try but even 
that failed to fix the issue and we still see failures with the same error. I 
used the value 4000 for net.core.somaxconn.
Please note that this is not a large load. The table used for insert has only 
62 rows with a total size of 26948 bytes.


was (Author: infinitymittal):
[~gopalv] Thanks for the response. This is a functionality which is not working 
as expected. Please tell me if I anything further can be done so that it 
satisfies being a bug.
The JIRA pointed out was something I had not yet tried then. I gave it a try 
but even that failed to fix the issue and we still see failures with the same 
error. I used the value 4000 for net.core.somaxconn.
Please note that this is not a large load. The table used for insert has only 
62 rows with a total size of 26948 bytes.

> Inserts into a bucketed table fail randomly with Hive on Tez
> 
>
> Key: TEZ-3814
> URL: https://issues.apache.org/jira/browse/TEZ-3814
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.7.0
>Reporter: Anant Mittal
>  Labels: Bucketing, Hive, Tez
>
> The MAP phase for Inserts into a bucketed table randomly fails with the error 
> "Vertex  [Map 1] failed as task  failed after vertex 
> succeeded.]DAG did not succeed due to VERTEX_FAILURE. failedVertices:1 
> killedVertices:0".
> The task fails because it fails for all attempts with " being 
> failed for too many output errors. failureFraction=0.2, 
> MAX_ALLOWED_OUTPUT_FAILURES_FRACTION=0.1, uniquefailedOutputReports=1, 
> MAX_ALLOWED_OUTPUT_FAILURES=10, MAX_ALLOWED_TIME_FOR_TASK_READ_ERROR_SEC=300, 
> readErrorTimespan=0"
> This happens more often if the table is ACID enabled and a delete operation 
> is performed before the inserts.
> I have tried the following:
> Changed tez.am.launch.cmd-opts, tez.task.launch.cmd-opts and 
> hive.tez.java.opts to use parallel GC.
> tez.runtime.shuffle.max.allowed.failed.fetch.fraction = 0.95
> tez.runtime.shuffle.failed.check.since-last.completion=false
> tez.runtime.shuffle.fetch.buffer.percent = 0.1
> tez.runtime.shuffle.memory.limit.percent = 0.25
> tez.runtime.shuffle.ssl.enable=false
> Deleted ".../usercache//filecache" and ".../usercache//appcache"
> I am using HDP 2.6 dsitribution.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Reopened] (TEZ-3814) Inserts into a bucketed table fail randomly with Hive on Tez

2017-08-07 Thread Anant Mittal (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anant Mittal reopened TEZ-3814:
---

> Inserts into a bucketed table fail randomly with Hive on Tez
> 
>
> Key: TEZ-3814
> URL: https://issues.apache.org/jira/browse/TEZ-3814
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.7.0
>Reporter: Anant Mittal
>  Labels: Bucketing, Hive, Tez
>
> The MAP phase for Inserts into a bucketed table randomly fails with the error 
> "Vertex  [Map 1] failed as task  failed after vertex 
> succeeded.]DAG did not succeed due to VERTEX_FAILURE. failedVertices:1 
> killedVertices:0".
> The task fails because it fails for all attempts with " being 
> failed for too many output errors. failureFraction=0.2, 
> MAX_ALLOWED_OUTPUT_FAILURES_FRACTION=0.1, uniquefailedOutputReports=1, 
> MAX_ALLOWED_OUTPUT_FAILURES=10, MAX_ALLOWED_TIME_FOR_TASK_READ_ERROR_SEC=300, 
> readErrorTimespan=0"
> This happens more often if the table is ACID enabled and a delete operation 
> is performed before the inserts.
> I have tried the following:
> Changed tez.am.launch.cmd-opts, tez.task.launch.cmd-opts and 
> hive.tez.java.opts to use parallel GC.
> tez.runtime.shuffle.max.allowed.failed.fetch.fraction = 0.95
> tez.runtime.shuffle.failed.check.since-last.completion=false
> tez.runtime.shuffle.fetch.buffer.percent = 0.1
> tez.runtime.shuffle.memory.limit.percent = 0.25
> tez.runtime.shuffle.ssl.enable=false
> Deleted ".../usercache//filecache" and ".../usercache//appcache"
> I am using HDP 2.6 dsitribution.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (TEZ-3814) Inserts into a bucketed table fail randomly with Hive on Tez

2017-08-07 Thread Anant Mittal (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117544#comment-16117544
 ] 

Anant Mittal commented on TEZ-3814:
---

[~gopalv] Thanks for the response. This is a functionality which is not working 
as expected. Please tell me if I anything further can be done so that it 
satisfies being a bug.
The JIRA pointed out was something I had not yet tried then. I gave it a try 
but even that failed to fix the issue and we still see failures with the same 
error. I used the value 4000 for net.core.somaxconn.
Please note that this is not a large load. The table used for insert has only 
62 rows with a total size of 26948 bytes.

> Inserts into a bucketed table fail randomly with Hive on Tez
> 
>
> Key: TEZ-3814
> URL: https://issues.apache.org/jira/browse/TEZ-3814
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.7.0
>Reporter: Anant Mittal
>  Labels: Bucketing, Hive, Tez
>
> The MAP phase for Inserts into a bucketed table randomly fails with the error 
> "Vertex  [Map 1] failed as task  failed after vertex 
> succeeded.]DAG did not succeed due to VERTEX_FAILURE. failedVertices:1 
> killedVertices:0".
> The task fails because it fails for all attempts with " being 
> failed for too many output errors. failureFraction=0.2, 
> MAX_ALLOWED_OUTPUT_FAILURES_FRACTION=0.1, uniquefailedOutputReports=1, 
> MAX_ALLOWED_OUTPUT_FAILURES=10, MAX_ALLOWED_TIME_FOR_TASK_READ_ERROR_SEC=300, 
> readErrorTimespan=0"
> This happens more often if the table is ACID enabled and a delete operation 
> is performed before the inserts.
> I have tried the following:
> Changed tez.am.launch.cmd-opts, tez.task.launch.cmd-opts and 
> hive.tez.java.opts to use parallel GC.
> tez.runtime.shuffle.max.allowed.failed.fetch.fraction = 0.95
> tez.runtime.shuffle.failed.check.since-last.completion=false
> tez.runtime.shuffle.fetch.buffer.percent = 0.1
> tez.runtime.shuffle.memory.limit.percent = 0.25
> tez.runtime.shuffle.ssl.enable=false
> Deleted ".../usercache//filecache" and ".../usercache//appcache"
> I am using HDP 2.6 dsitribution.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (TEZ-3813) Reduce Object size of MemoryFetchedInput for large jobs

2017-08-07 Thread TezQA (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-3813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117457#comment-16117457
 ] 

TezQA commented on TEZ-3813:


{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12880711/TEZ-3813.003.patch
  against master revision 8dcf8a1.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:red}-1 findbugs{color}.  The patch appears to introduce 1 new 
Findbugs (version 3.0.1) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-TEZ-Build/2603//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-TEZ-Build/2603//artifact/patchprocess/newPatchFindbugsWarningstez-runtime-library.html
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/2603//console

This message is automatically generated.

> Reduce Object size of MemoryFetchedInput for large jobs
> ---
>
> Key: TEZ-3813
> URL: https://issues.apache.org/jira/browse/TEZ-3813
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: Muhammad Samir Khan
>Assignee: Muhammad Samir Khan
> Attachments: TEZ-3813.001.patch, TEZ-3813.002.patch, 
> TEZ-3813.003.patch
>
>
> Same as TEZ-3752 for the unordered case. MemoryFetchedInput has a 
> BoundedByteArrayOutputStream that is not used (only the underlying byte[] is 
> used).



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


Failed: TEZ-3813 PreCommit Build #2603

2017-08-07 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/TEZ-3813
Build: https://builds.apache.org/job/PreCommit-TEZ-Build/2603/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 340.48 KB...]
[INFO] 
[INFO] Total time: 55:38 min
[INFO] Finished at: 2017-08-07T22:34:14Z
[INFO] Final Memory: 89M/1389M
[INFO] 




{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12880711/TEZ-3813.003.patch
  against master revision 8dcf8a1.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:red}-1 findbugs{color}.  The patch appears to introduce 1 new 
Findbugs (version 3.0.1) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-TEZ-Build/2603//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-TEZ-Build/2603//artifact/patchprocess/newPatchFindbugsWarningstez-runtime-library.html
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/2603//console

This message is automatically generated.


==
==
Adding comment to Jira.
==
==


Comment added.
8fe01799cae2e517338c6d547aec916db4bded6d logged out


==
==
Finished build.
==
==


Build step 'Execute shell' marked build as failure
Archiving artifacts
[Fast Archiver] Compressed 3.51 MB of artifacts by 32.9% relative to #2601
[description-setter] Could not determine description.
Recording test results
Email was triggered for: Failure - Any
Sending email for trigger: Failure - Any



###
## FAILED TESTS (if any) 
##
All tests passed

[jira] [Commented] (TEZ-3813) Reduce Object size of MemoryFetchedInput for large jobs

2017-08-07 Thread TezQA (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-3813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117354#comment-16117354
 ] 

TezQA commented on TEZ-3813:


{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12880696/TEZ-3813.002.patch
  against master revision 8dcf8a1.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:red}-1 findbugs{color}.  The patch appears to introduce 1 new 
Findbugs (version 3.0.1) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-TEZ-Build/2602//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-TEZ-Build/2602//artifact/patchprocess/newPatchFindbugsWarningstez-runtime-library.html
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/2602//console

This message is automatically generated.

> Reduce Object size of MemoryFetchedInput for large jobs
> ---
>
> Key: TEZ-3813
> URL: https://issues.apache.org/jira/browse/TEZ-3813
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: Muhammad Samir Khan
>Assignee: Muhammad Samir Khan
> Attachments: TEZ-3813.001.patch, TEZ-3813.002.patch, 
> TEZ-3813.003.patch
>
>
> Same as TEZ-3752 for the unordered case. MemoryFetchedInput has a 
> BoundedByteArrayOutputStream that is not used (only the underlying byte[] is 
> used).



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


Failed: TEZ-3813 PreCommit Build #2602

2017-08-07 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/TEZ-3813
Build: https://builds.apache.org/job/PreCommit-TEZ-Build/2602/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 340.27 KB...]
[INFO] BUILD SUCCESS
[INFO] 
[INFO] Total time: 01:00 h
[INFO] Finished at: 2017-08-07T21:36:34Z
[INFO] Final Memory: 81M/1411M
[INFO] 




{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12880696/TEZ-3813.002.patch
  against master revision 8dcf8a1.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:red}-1 findbugs{color}.  The patch appears to introduce 1 new 
Findbugs (version 3.0.1) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-TEZ-Build/2602//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-TEZ-Build/2602//artifact/patchprocess/newPatchFindbugsWarningstez-runtime-library.html
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/2602//console

This message is automatically generated.


==
==
Adding comment to Jira.
==
==


Comment added.
13491b4683aab50fd83a9a58b8123590a7e8a6b5 logged out


==
==
Finished build.
==
==


Build step 'Execute shell' marked build as failure
Archiving artifacts
[description-setter] Could not determine description.
Recording test results
Email was triggered for: Failure - Any
Sending email for trigger: Failure - Any



###
## FAILED TESTS (if any) 
##
All tests passed

[jira] [Updated] (TEZ-3813) Reduce Object size of MemoryFetchedInput for large jobs

2017-08-07 Thread Muhammad Samir Khan (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-3813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Muhammad Samir Khan updated TEZ-3813:
-
Attachment: TEZ-3813.003.patch

Fixed a NPE when getSize() is called after a free().
Removed actualSize and compressedSize from FetchedInput. Now child classes are 
responsible for maintaining size (compressed or actual).

*JOL Dump:*
+After:+
Internals:
{code}
# Running 64-bit HotSpot VM.
# Using compressed oop with 3-bit shift.
# Using compressed klass with 3-bit shift.
# Objects are 8 bytes aligned.
# Field sizes by type: 4, 1, 1, 2, 2, 4, 4, 8, 8 [bytes]
# Array element sizes: 4, 1, 1, 2, 2, 4, 4, 8, 8 [bytes]

Instantiated the sample instance via public 
org.apache.tez.runtime.library.common.shuffle.MemoryFetchedInput(long,org.apache.tez.runtime.library.common.InputAttemptIdentifier,org.apache.tez.runtime.library.common.shuffle.FetchedInputCallback)

org.apache.tez.runtime.library.common.shuffle.MemoryFetchedInput object 
internals:
 OFFSET  SIZE 
TYPE DESCRIPTION   VALUE
  0 4   
   (object header)   01 00 00 00 (0001  
 ) (1)
  4 4   
   (object header)   00 00 00 00 (  
 ) (0)
  8 4   
   (object header)   7d 12 01 f8 (0101 00010010 
0001 1000) (-134147459)
 12 4  
int FetchedInput.id   0
 16 1 
byte FetchedInput.state0
 17 3   
   (alignment/padding gap)  
 20 4 
org.apache.tez.runtime.library.common.InputAttemptIdentifier 
FetchedInput.inputAttemptIdentifier   null
 24 4   
org.apache.tez.runtime.library.common.shuffle.FetchedInputCallback 
FetchedInput.callback null
 28 4  
int MemoryFetchedInput.size   0
 32 4   
byte[] MemoryFetchedInput.byteArray  []
 36 4   
   (loss due to the next object alignment)
Instance size: 40 bytes
Space losses: 3 bytes internal + 4 bytes external = 7 bytes total
{code}

Footprint:
{code}
# Running 64-bit HotSpot VM.
# Using compressed oop with 3-bit shift.
# Using compressed klass with 3-bit shift.
# Objects are 8 bytes aligned.
# Field sizes by type: 4, 1, 1, 2, 2, 4, 4, 8, 8 [bytes]
# Array element sizes: 4, 1, 1, 2, 2, 4, 4, 8, 8 [bytes]

Instantiated the sample instance via public 
org.apache.tez.runtime.library.common.shuffle.MemoryFetchedInput(long,org.apache.tez.runtime.library.common.InputAttemptIdentifier,org.apache.tez.runtime.library.common.shuffle.FetchedInputCallback)

org.apache.tez.runtime.library.common.shuffle.MemoryFetchedInput@215be6bbd 
footprint:
 COUNT   AVG   SUM   DESCRIPTION
 11616   [B
 14040   
org.apache.tez.runtime.library.common.shuffle.MemoryFetchedInput
 2  56   (total)
{code}

> Reduce Object size of MemoryFetchedInput for large jobs
> ---
>
> Key: TEZ-3813
> URL: https://issues.apache.org/jira/browse/TEZ-3813
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: Muhammad Samir Khan
>Assignee: Muhammad Samir Khan
> Attachments: TEZ-3813.001.patch, TEZ-3813.002.patch, 
> TEZ-3813.003.patch
>
>
> Same as TEZ-3752 for the unordered case. MemoryFetchedInput has a 
> BoundedByteArrayOutputStream that is not used (only the underlying byte[] is 
> used).



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Updated] (TEZ-3813) Reduce Object size of MemoryFetchedInput for large jobs

2017-08-07 Thread Muhammad Samir Khan (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-3813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Muhammad Samir Khan updated TEZ-3813:
-
Attachment: TEZ-3813.002.patch

Some more optimizations related to enums, similar to TEZ-3732.

*JOL Dump:*
+Before:+
Internals:
{code}
# Running 64-bit HotSpot VM.
# Using compressed oop with 3-bit shift.
# Using compressed klass with 3-bit shift.
# Objects are 8 bytes aligned.
# Field sizes by type: 4, 1, 1, 2, 2, 4, 4, 8, 8 [bytes]
# Array element sizes: 4, 1, 1, 2, 2, 4, 4, 8, 8 [bytes]

Instantiated the sample instance via public 
org.apache.tez.runtime.library.common.shuffle.MemoryFetchedInput(long,long,org.apache.tez.runtime.library.common.InputAttemptIdentifier,org.apache.tez.runtime.library.common.shuffle.FetchedInputCallback)

org.apache.tez.runtime.library.common.shuffle.MemoryFetchedInput object 
internals:
 OFFSET  SIZE 
TYPE DESCRIPTION   VALUE
  0 4   
   (object header)   01 00 00 00 (0001  
 ) (1)
  4 4   
   (object header)   00 00 00 00 (  
 ) (0)
  8 4   
   (object header)   7a 12 01 f8 (0010 00010010 
0001 1000) (-134147462)
 12 4  
int FetchedInput.id   0
 16 8 
long FetchedInput.actualSize   0
 24 8 
long FetchedInput.compressedSize   0
 32 4 
org.apache.tez.runtime.library.common.InputAttemptIdentifier 
FetchedInput.inputAttemptIdentifier   null
 36 4  
org.apache.tez.runtime.library.common.shuffle.FetchedInput.Type 
FetchedInput.type (object)
 40 4   
org.apache.tez.runtime.library.common.shuffle.FetchedInputCallback 
FetchedInput.callback null
 44 4 
org.apache.tez.runtime.library.common.shuffle.FetchedInput.State 
FetchedInput.state(object)
 48 4
org.apache.hadoop.io.BoundedByteArrayOutputStream MemoryFetchedInput.byteStream 
(object)
 52 4   
   (loss due to the next object alignment)
Instance size: 56 bytes
Space losses: 0 bytes internal + 4 bytes external = 4 bytes total
{code}

Footprint:
{code}
# Running 64-bit HotSpot VM.
# Using compressed oop with 3-bit shift.
# Using compressed klass with 3-bit shift.
# Objects are 8 bytes aligned.
# Field sizes by type: 4, 1, 1, 2, 2, 4, 4, 8, 8 [bytes]
# Array element sizes: 4, 1, 1, 2, 2, 4, 4, 8, 8 [bytes]

Instantiated the sample instance via public 
org.apache.tez.runtime.library.common.shuffle.MemoryFetchedInput(long,long,org.apache.tez.runtime.library.common.InputAttemptIdentifier,org.apache.tez.runtime.library.common.shuffle.FetchedInputCallback)

org.apache.tez.runtime.library.common.shuffle.MemoryFetchedInput@215be6bbd 
footprint:
 COUNT   AVG   SUM   DESCRIPTION
 11616   [B
 23264   [C
 22448   java.lang.String
 13232   
org.apache.hadoop.io.BoundedByteArrayOutputStream
 12424   
org.apache.tez.runtime.library.common.shuffle.FetchedInput$State
 12424   
org.apache.tez.runtime.library.common.shuffle.FetchedInput$Type
 15656   
org.apache.tez.runtime.library.common.shuffle.MemoryFetchedInput
 9 264   (total)
{code}

+After:+
Internals:
{code}
# Running 64-bit HotSpot VM.
# Using compressed oop with 3-bit shift.
# Using compressed klass with 3-bit shift.
# Objects are 8 bytes aligned.
# Field sizes by type: 4, 1, 1, 2, 2, 4, 4, 8, 8 [bytes]
# Array element sizes: 4, 1, 1, 2, 2, 4, 4, 8, 8 [bytes]

Instantiated the sample instance via public 
org.apache.tez.runtime.library.common.shuffle.MemoryFetchedInput(long,long,org.apache.tez.runtime.library.common.InputAttemptIdentifier,org.apache.tez.runtime.library.common.shuffle.FetchedInputCallback)

org.apache.tez.runtime.library.common.shuffle.MemoryFetchedInput object 
internals:
 OFFSET  SIZE 
TYPE DESCRIPTION   VALUE
  0 4   
   (object header)   01 00 00 00 

[jira] [Commented] (TEZ-3431) Add unit tests for container release

2017-08-07 Thread Taklon Stephen Wu (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-3431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117082#comment-16117082
 ] 

Taklon Stephen Wu commented on TEZ-3431:


I have written a patch this item, can I be the assignee?

> Add unit tests for container release
> 
>
> Key: TEZ-3431
> URL: https://issues.apache.org/jira/browse/TEZ-3431
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: Sushmitha Sreenivasan
>  Labels: newbie
>
> * Add unit tests to verify that scheduler release container after expiry 
> time(HeldContainer.containerExpiryTime).



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (TEZ-3809) The buffer size allocated for InMemoryMapOutput can be optimized

2017-08-07 Thread TezQA (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117078#comment-16117078
 ] 

TezQA commented on TEZ-3809:


{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12880667/TEZ-3809.002.patch
  against master revision 8dcf8a1.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 2 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 3.0.1) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-TEZ-Build/2601//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/2601//console

This message is automatically generated.

> The buffer size allocated for InMemoryMapOutput can be optimized
> 
>
> Key: TEZ-3809
> URL: https://issues.apache.org/jira/browse/TEZ-3809
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: Muhammad Samir Khan
>Assignee: Muhammad Samir Khan
> Attachments: TEZ-3809.001.patch, TEZ-3809.002.patch
>
>
> Related jiras: TEZ-3752 and TEZ-3732.
> -When shuffling input to memory, the decompressed length is used to create 
> the InMemoryMapOutput object. However, IFile.Reader's readToMemory reads 4 
> bytes less (the IFile header). These 4 bytes can optimized and, in an extreme 
> case of 10,000,000 fetches, can save ~38 MB (TEZ-3732).
> -Memory-to-memory merge sums up the sizes of input InMemoryMapOutput buffers 
> to allocate the new InMemoryMapOutput. However, each input has two 
> EOF_MARKERs while only two are needed at the end.
> -InMemoryWriter wraps the output BoundedByteArrayOutputStream in 
> IFileOutputStream which will write checksum at close. This creates an 
> inconsistency between the primary input buffers which don't have checksum and 
> the merged buffers which do. IFileOutputStream wrap can be removed to save 4 
> bytes per merged buffers.
> -InMemoryWriter does not account for two EOF_MARKERs written at close() in 
> its accounting so that the getRawLength() method is off by two bytes.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


Success: TEZ-3809 PreCommit Build #2601

2017-08-07 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/TEZ-3809
Build: https://builds.apache.org/job/PreCommit-TEZ-Build/2601/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 340.41 KB...]
[INFO] Tez  SUCCESS [  0.018 s]
[INFO] 
[INFO] BUILD SUCCESS
[INFO] 
[INFO] Total time: 54:40 min
[INFO] Finished at: 2017-08-07T19:07:06Z
[INFO] Final Memory: 93M/1338M
[INFO] 




{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12880667/TEZ-3809.002.patch
  against master revision 8dcf8a1.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 2 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 3.0.1) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-TEZ-Build/2601//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/2601//console

This message is automatically generated.


==
==
Adding comment to Jira.
==
==


Comment added.
2c4486fb33f7b93b0277293295b55d07c28ffe68 logged out


==
==
Finished build.
==
==


Archiving artifacts
[description-setter] Description set: TEZ-3809
Recording test results
Email was triggered for: Success
Sending email for trigger: Success



###
## FAILED TESTS (if any) 
##
All tests passed

[jira] [Updated] (TEZ-3809) The buffer size allocated for InMemoryMapOutput can be optimized

2017-08-07 Thread Muhammad Samir Khan (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Muhammad Samir Khan updated TEZ-3809:
-
Attachment: TEZ-3809.002.patch

Removed the unused imports and fixed the names for new methods.

> The buffer size allocated for InMemoryMapOutput can be optimized
> 
>
> Key: TEZ-3809
> URL: https://issues.apache.org/jira/browse/TEZ-3809
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: Muhammad Samir Khan
>Assignee: Muhammad Samir Khan
> Attachments: TEZ-3809.001.patch, TEZ-3809.002.patch
>
>
> Related jiras: TEZ-3752 and TEZ-3732.
> -When shuffling input to memory, the decompressed length is used to create 
> the InMemoryMapOutput object. However, IFile.Reader's readToMemory reads 4 
> bytes less (the IFile header). These 4 bytes can optimized and, in an extreme 
> case of 10,000,000 fetches, can save ~38 MB (TEZ-3732).
> -Memory-to-memory merge sums up the sizes of input InMemoryMapOutput buffers 
> to allocate the new InMemoryMapOutput. However, each input has two 
> EOF_MARKERs while only two are needed at the end.
> -InMemoryWriter wraps the output BoundedByteArrayOutputStream in 
> IFileOutputStream which will write checksum at close. This creates an 
> inconsistency between the primary input buffers which don't have checksum and 
> the merged buffers which do. IFileOutputStream wrap can be removed to save 4 
> bytes per merged buffers.
> -InMemoryWriter does not account for two EOF_MARKERs written at close() in 
> its accounting so that the getRawLength() method is off by two bytes.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)