[jira] [Created] (TEZ-2457) Improve Documentation to explicitly list all valid Tez configuration variables

2015-05-18 Thread Hari Sekhon (JIRA)
Hari Sekhon created TEZ-2457:


 Summary: Improve Documentation to explicitly list all valid Tez 
configuration variables
 Key: TEZ-2457
 URL: https://issues.apache.org/jira/browse/TEZ-2457
 Project: Apache Tez
  Issue Type: Improvement
Affects Versions: 0.5.2
 Environment: HDP 2.2
Reporter: Hari Sekhon


Request to improve Tez documentation by adding a page showing all valid Tez 
configuration variables with their defaults and description as well as which 
MapReduce variables Tez respects.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


Success: TEZ-2244 PreCommit Build #688

2015-05-18 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/TEZ-2244
Build: https://builds.apache.org/job/PreCommit-TEZ-Build/688/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 2840 lines...]
[INFO] Final Memory: 71M/932M
[INFO] 




{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12733447/TEZ-2244.1.patch
  against master revision cea05e3.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/688//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/688//console

This message is automatically generated.


==
==
Adding comment to Jira.
==
==


Comment added.
3527164ef466e674c8342bcd28f5b6749ab2dc97 logged out


==
==
Finished build.
==
==


Archiving artifacts
Sending artifact delta relative to PreCommit-TEZ-Build #687
Archived 44 artifacts
Archive block size is 32768
Received 4 blocks and 2643370 bytes
Compression is 4.7%
Took 2.1 sec
Description set: TEZ-2244
Recording test results
Email was triggered for: Success
Sending email for trigger: Success



###
## FAILED TESTS (if any) 
##
All tests passed

[jira] [Commented] (TEZ-2244) PipelinedSorter: Progressive allocation for sort-buffers

2015-05-18 Thread TezQA (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14547577#comment-14547577
 ] 

TezQA commented on TEZ-2244:


{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12733447/TEZ-2244.1.patch
  against master revision cea05e3.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/688//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/688//console

This message is automatically generated.

 PipelinedSorter: Progressive allocation for sort-buffers
 

 Key: TEZ-2244
 URL: https://issues.apache.org/jira/browse/TEZ-2244
 Project: Apache Tez
  Issue Type: Improvement
Affects Versions: 0.7.0
Reporter: Gopal V
Assignee: Rajesh Balamohan
 Attachments: TEZ-2244.1.patch, TEZ-2244.WIP.patch


 Currently, the sort buffers are allocated pessimistically for all tasks so 
 that the largest task's spill stays within memory.
 After the chained buffer implementation inside PipelinedSorter, it brings up 
 the possibility of only allocating the first chunk of the sort buffer when 
 the sorter starts up.
 This allows for the tasks which do not heavily use the sort buffer (like a 
 grouping aggregation) to use the sort-space only when the map-aggregation 
 turns itself off.
 Not reserving memory on startup hurts the worst-case scenario for the 
 pipelined sorter, but improves the average case.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TEZ-2457) Improve Documentation to explicitly list all valid Tez configuration variables

2015-05-18 Thread Hitesh Shah (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14548472#comment-14548472
 ] 

Hitesh Shah commented on TEZ-2457:
--

TEZ-2294 might be a good starting point for you to look. It has configs 
generated from close to the 0.7 release but most should be applicable to your 
query.

For all general purposes, Tez does *not* support any MapReduce variables. 
However, there are some helper functions that Hive/Pig use to convert MR 
configs to Tez configs. Good point on the MR configs though - I will look to 
see if there is a way to document this mapping more clearly. 

 Improve Documentation to explicitly list all valid Tez configuration variables
 --

 Key: TEZ-2457
 URL: https://issues.apache.org/jira/browse/TEZ-2457
 Project: Apache Tez
  Issue Type: Improvement
Affects Versions: 0.5.2
 Environment: HDP 2.2
Reporter: Hari Sekhon

 Request to improve Tez documentation by adding a page showing all valid Tez 
 configuration variables with their defaults and description as well as which 
 MapReduce variables Tez respects.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (TEZ-2458) Tez UI: killed vertices display diagnostics 'killed/failed due to:null'

2015-05-18 Thread Prakash Ramachandran (JIRA)
Prakash Ramachandran created TEZ-2458:
-

 Summary: Tez UI: killed vertices display diagnostics 
'killed/failed due to:null'
 Key: TEZ-2458
 URL: https://issues.apache.org/jira/browse/TEZ-2458
 Project: Apache Tez
  Issue Type: Bug
  Components: UI
Reporter: Prakash Ramachandran
Assignee: Prakash Ramachandran


When you run a hive query and kills it in the middle of running it with cmd+c, 
you will see 'Vertex killed due to user-initiated job kill.' in the console.
However, when you go to Tez UI, select that dag name, and choose Map/Reducer 
vertex under View, Diagnostics shows 'killed/failed due to:null' as shown in 
the attached screenshot.

reported by [~taksaito] 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TEZ-2458) Tez UI: killed vertices display diagnostics 'killed/failed due to:null'

2015-05-18 Thread Prakash Ramachandran (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prakash Ramachandran updated TEZ-2458:
--
Attachment: TEZ-2458.1.patch

 Tez UI: killed vertices display diagnostics 'killed/failed due to:null'
 ---

 Key: TEZ-2458
 URL: https://issues.apache.org/jira/browse/TEZ-2458
 Project: Apache Tez
  Issue Type: Bug
  Components: UI
Reporter: Prakash Ramachandran
Assignee: Prakash Ramachandran
 Attachments: Screen Shot 2015-05-19 at 12.26.01 AM.png, 
 TEZ-2458.1.patch


 When you run a hive query and kills it in the middle of running it with 
 cmd+c, you will see 'Vertex killed due to user-initiated job kill.' in the 
 console.
 However, when you go to Tez UI, select that dag name, and choose Map/Reducer 
 vertex under View, Diagnostics shows 'killed/failed due to:null' as shown in 
 the attached screenshot.
 reported by [~taksaito] 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


Failed: TEZ-2458 PreCommit Build #689

2015-05-18 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/TEZ-2458
Build: https://builds.apache.org/job/PreCommit-TEZ-Build/689/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 2867 lines...]



{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12733600/TEZ-2458.1.patch
  against master revision 12fc2c7.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/689//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/689//console

This message is automatically generated.


==
==
Adding comment to Jira.
==
==


Comment added.
106b5773b84bcc6b3736c920ddb76d4818cc1c0a logged out


==
==
Finished build.
==
==


Build step 'Execute shell' marked build as failure
Archiving artifacts
Sending artifact delta relative to PreCommit-TEZ-Build #688
Archived 44 artifacts
Archive block size is 32768
Received 4 blocks and 2677547 bytes
Compression is 4.7%
Took 1.3 sec
[description-setter] Could not determine description.
Recording test results
Email was triggered for: Failure
Sending email for trigger: Failure



###
## FAILED TESTS (if any) 
##
All tests passed

[jira] [Commented] (TEZ-2076) Tez framework to extract/analyze data stored in ATS for specific dag

2015-05-18 Thread Hitesh Shah (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14549387#comment-14549387
 ] 

Hitesh Shah commented on TEZ-2076:
--

Minor comments: 

{code}
String help = LINE_SEPARATOR
385 + java -jar tez-history-parser-x.y.z-jar-with-dependencies.jar
386 + LINE_SEPARATOR
387 + OR
388 + LINE_SEPARATOR
389 + java -cp tez-history-parser-x.y.z-jar-with-dependencies.jar 
org.apache.tez.history.ATSImportTool
390 + LINE_SEPARATOR
391 + OR
392 + LINE_SEPARATOR
393 + 
HADOOP_CLASSPATH=$TEZ_HOME/*:$TEZ_HOME/lib/*:$HADOOP_CLASSPATH hadoop jar 
394 + tez-history-parser-x.y.z.jar  + 
ATSImportTool.class.getName()
395 + LINE_SEPARATOR;
396 formatter.printHelp(240, help, Options,
{code}
  - Not sure why there are repetitive options ( with/without class name ) - can 
this be reduced to 2 i.e. one via java -cp and the other via hadoop jar? 

Also, [~gopalv] raised this point offline too. There needs to be a way to warn 
the user if the import tool is used on an in-progress dag - maybe this tool 
should just error out for an in-progress dag and only work if there is a 
special --allow-incomplete-data or similar flag set. Could be done in a 
follow-up jira. \cc [~pramachandran] as the same is applicable to the UI 
download tool. 





 Tez framework to extract/analyze data stored in ATS for specific dag
 

 Key: TEZ-2076
 URL: https://issues.apache.org/jira/browse/TEZ-2076
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Rajesh Balamohan
Assignee: Rajesh Balamohan
 Attachments: TEZ-2076.1.patch, TEZ-2076.10.patch, TEZ-2076.11.patch, 
 TEZ-2076.12.patch, TEZ-2076.13.patch, TEZ-2076.14.patch, TEZ-2076.15.patch, 
 TEZ-2076.2.patch, TEZ-2076.3.patch, TEZ-2076.4.patch, TEZ-2076.5.patch, 
 TEZ-2076.6.patch, TEZ-2076.7.patch, TEZ-2076.8.patch, TEZ-2076.9.patch, 
 TEZ-2076.WIP.2.patch, TEZ-2076.WIP.3.patch, TEZ-2076.WIP.patch


 - Users should be able to download ATS data pertaining to a DAG from Tez-UI 
 (more like a zip file containing DAG/Vertex/Task/TaskAttempt info).
 - This can be plugged to an analyzer which parses the data, adds semantics 
 and provides an in-memory representation for further analysis.
 - This will enable to write different analyzer rules, which can be run on top 
 of this in-memory representation to come up with analysis on the DAG.
 - Results of this analyzer rules can be rendered on to UI (standalone webapp) 
 later point in time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TEZ-2409) Allow different edges to have different routing plugins

2015-05-18 Thread TezQA (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14549273#comment-14549273
 ] 

TezQA commented on TEZ-2409:


{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12733617/TEZ-2409.1.patch
  against master revision 12fc2c7.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 5 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/691//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/691//console

This message is automatically generated.

 Allow different edges to have different routing plugins
 ---

 Key: TEZ-2409
 URL: https://issues.apache.org/jira/browse/TEZ-2409
 Project: Apache Tez
  Issue Type: Task
Reporter: Bikas Saha
Assignee: Bikas Saha
Priority: Critical
 Attachments: TEZ-2409.1.patch


 It may be useful to allow different edge manager plugin types based on 
 different requirements. In order to support this, we would need to support 
 different plugins per edge for routing the events on that edge. A motivating 
 scenario is when a custom plugin from an older release of a downstream 
 project is using older API's while the latest release of that project has 
 moved on to newer API's. This would allow both old and new releases to work 
 with the latest Tez framework as optimally as possible.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (TEZ-2076) Tez framework to extract/analyze data stored in ATS for specific dag

2015-05-18 Thread Hitesh Shah (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14549441#comment-14549441
 ] 

Hitesh Shah edited comment on TEZ-2076 at 5/18/15 10:51 PM:


+1 once above usage help and related docs are fixed. 


was (Author: hitesh):
+1 once above usage docs are fixed. 

 Tez framework to extract/analyze data stored in ATS for specific dag
 

 Key: TEZ-2076
 URL: https://issues.apache.org/jira/browse/TEZ-2076
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Rajesh Balamohan
Assignee: Rajesh Balamohan
 Attachments: TEZ-2076.1.patch, TEZ-2076.10.patch, TEZ-2076.11.patch, 
 TEZ-2076.12.patch, TEZ-2076.13.patch, TEZ-2076.14.patch, TEZ-2076.15.patch, 
 TEZ-2076.2.patch, TEZ-2076.3.patch, TEZ-2076.4.patch, TEZ-2076.5.patch, 
 TEZ-2076.6.patch, TEZ-2076.7.patch, TEZ-2076.8.patch, TEZ-2076.9.patch, 
 TEZ-2076.WIP.2.patch, TEZ-2076.WIP.3.patch, TEZ-2076.WIP.patch


 - Users should be able to download ATS data pertaining to a DAG from Tez-UI 
 (more like a zip file containing DAG/Vertex/Task/TaskAttempt info).
 - This can be plugged to an analyzer which parses the data, adds semantics 
 and provides an in-memory representation for further analysis.
 - This will enable to write different analyzer rules, which can be run on top 
 of this in-memory representation to come up with analysis on the DAG.
 - Results of this analyzer rules can be rendered on to UI (standalone webapp) 
 later point in time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TEZ-2456) Refactor recovery event logging to ensure it meet the recovery event spec

2015-05-18 Thread Hitesh Shah (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14549198#comment-14549198
 ] 

Hitesh Shah commented on TEZ-2456:
--

bq. DAGFinishedEvent may be logged multiple times. ( DAG move from SUCCEEDED 
from ERROR ? Should we ignore this ? )

No - as this affects final state. 

bq. VertexFinishedEvent should be logged before DAGFinishedEvent

There should never be cases where a vertex can change state after a dag has 
finished. We should make sure that the state machine ensures that this scenario 
can never occur. \cc [~bikassaha]. 

bq. RootInputDataInformation must be logged before VertexInitializedEvent

This depends on what criteria we choose to detect whether the root input 
initializer has completed running. If that flag is the vertex init event, that 
it fine. 

bq. DataMovement must be logged before TaskFinishedEvent

Re=phrase this to all events generated from all task attempts of a given task 
should be logged before the task finished event. This becomes tricky for 
retrospective failures. Needs to be looked at what happens when we have events 
followed by task finished, then task re-run, more events and a final finished 
event. There could be crashes at any stage in the list.

bq. TaskFinishedEvent must be logged before VertexFinishedEvent

Task re-runs? Retrospective failures? 

bq. VertexParallelismUpdatedEvent must be logged before TaskStartedEvent

This depends on the state machine. If the framework supports it, then recovery 
also should. 

bq. TaskAttemptFinishedEvent should be logged before TaskFinishedEvent

Retrospective failures? 
Also, nothing in the list related to speculated attempts. How should those be 
handled?

bq. should only be logged once. Should not log again when it’s recovered.

There are multiple places where this is called out. Is there an issue if it 
gets logged twice? Will something break? Should there be checks to ensure it is 
logged only once or can the recovery handle it if the event is logged twice? 
What kind of problems do you see if it happens twice? 






 Refactor recovery event logging to ensure it meet the recovery event spec
 -

 Key: TEZ-2456
 URL: https://issues.apache.org/jira/browse/TEZ-2456
 Project: Apache Tez
  Issue Type: Sub-task
Reporter: Jeff Zhang
Assignee: Jeff Zhang

 Currently we don't have spec for the recovery event logging. Recovery would 
 be fragile to code change. This jira try to define the spec and refactor the 
 recovery event logging to ensure it meet the spec. [~hitesh] Please help 
 review the following spec I drafted.
 *DAG*
 * DAGSubmitted/DAGInitializedEvent/DAGStartedEvent must been logged once, 
 Should not log it again when it’s recovered.
 * DAGFinishedEvent may be logged multiple times.  ( DAG move from SUCCEEDED 
 from ERROR ? Should we ignore this ? )
 * VertexFinishedEvent should be logged before DAGFinishedEvent
 *Vertex* 
 * RootInputDataInformation must be logged before VertexInitializedEvent
 * DataMovement must be logged before TaskFinishedEvent
 * TaskFinishedEvent must be logged before VertexFinishedEvent
 * VertexInitializedEvent / VertexStartedEvent should only be logged once, 
 should not log again when it’s recovered.
 * VertexFinishedEvent may be logged multiple times. (e.g. Vertex move from 
 SUCCEEDED to FAILED)
 * VertexParallelismUpdatedEvent must be logged before TaskStartedEvent
 * TaskFinishedEvent should be logged before VertexFinishedEvent
 *Task*
 * If there’s no TaskStartedEvent, TaskFinishedEvent may still be logged (e.g. 
 Task is killed in NEW )  Current’s behavior is that TaskFinishedEvent won’t 
 be logged if there’s no TaskStartedEvent. 
 * TaskStartedEvent should only be logged once.  Should not log again when 
 it’s recovered.
 * TaskFinishedEvent may be logged multiple times (e.g. Task move from 
 SUCCEEDED to FAILED)
 * TaskAttemptFinishedEvent should be logged before TaskFinishedEvent
   
 *TaskAttempt*
 * If there’s no TaskAttemptStartedEvent, TaskAttemptFinishedEvent may still 
 be logged ( e.g. TaskAttempt is killed in NEW )  Current’s behavior is that 
 TaskAttemptFinishedEvent won’t be logged if there’s no TaskAttemptStartedEvent
 * TaskAttemptStartedEvent should only be logged once.  Should not log again 
 when it’s recovered.
 * TaskAttemptFinishedEvent may be logged multiple times. (e.g. TaskAttempt 
 move from SUCCEEDED to FAILED)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TEZ-2458) Tez UI: killed vertices display diagnostics 'killed/failed due to:null'

2015-05-18 Thread TezQA (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14549173#comment-14549173
 ] 

TezQA commented on TEZ-2458:


{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12733600/TEZ-2458.1.patch
  against master revision 12fc2c7.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/689//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/689//console

This message is automatically generated.

 Tez UI: killed vertices display diagnostics 'killed/failed due to:null'
 ---

 Key: TEZ-2458
 URL: https://issues.apache.org/jira/browse/TEZ-2458
 Project: Apache Tez
  Issue Type: Bug
  Components: UI
Reporter: Prakash Ramachandran
Assignee: Prakash Ramachandran
 Attachments: Screen Shot 2015-05-19 at 12.26.01 AM.png, 
 TEZ-2458.1.patch


 When you run a hive query and kills it in the middle of running it with 
 cmd+c, you will see 'Vertex killed due to user-initiated job kill.' in the 
 console.
 However, when you go to Tez UI, select that dag name, and choose Map/Reducer 
 vertex under View, Diagnostics shows 'killed/failed due to:null' as shown in 
 the attached screenshot.
 reported by [~taksaito] 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TEZ-2409) Allow different edges to have different routing plugins

2015-05-18 Thread Bikas Saha (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-2409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bikas Saha updated TEZ-2409:

Attachment: TEZ-2409.1.patch

 Allow different edges to have different routing plugins
 ---

 Key: TEZ-2409
 URL: https://issues.apache.org/jira/browse/TEZ-2409
 Project: Apache Tez
  Issue Type: Task
Reporter: Bikas Saha
Assignee: Bikas Saha
Priority: Critical
 Attachments: TEZ-2409.1.patch


 It may be useful to allow different edge manager plugin types based on 
 different requirements. In order to support this, we would need to support 
 different plugins per edge for routing the events on that edge. A motivating 
 scenario is when a custom plugin from an older release of a downstream 
 project is using older API's while the latest release of that project has 
 moved on to newer API's. This would allow both old and new releases to work 
 with the latest Tez framework as optimally as possible.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TEZ-2409) Allow different edges to have different routing plugins

2015-05-18 Thread Bikas Saha (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-2409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bikas Saha updated TEZ-2409:

Attachment: (was: TEZ-2409.1.patch)

 Allow different edges to have different routing plugins
 ---

 Key: TEZ-2409
 URL: https://issues.apache.org/jira/browse/TEZ-2409
 Project: Apache Tez
  Issue Type: Task
Reporter: Bikas Saha
Assignee: Bikas Saha
Priority: Critical

 It may be useful to allow different edge manager plugin types based on 
 different requirements. In order to support this, we would need to support 
 different plugins per edge for routing the events on that edge. A motivating 
 scenario is when a custom plugin from an older release of a downstream 
 project is using older API's while the latest release of that project has 
 moved on to newer API's. This would allow both old and new releases to work 
 with the latest Tez framework as optimally as possible.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TEZ-2409) Allow different edges to have different routing plugins

2015-05-18 Thread Bikas Saha (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-2409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bikas Saha updated TEZ-2409:

Attachment: TEZ-2409.1.patch

 Allow different edges to have different routing plugins
 ---

 Key: TEZ-2409
 URL: https://issues.apache.org/jira/browse/TEZ-2409
 Project: Apache Tez
  Issue Type: Task
Reporter: Bikas Saha
Assignee: Bikas Saha
Priority: Critical
 Attachments: TEZ-2409.1.patch


 It may be useful to allow different edge manager plugin types based on 
 different requirements. In order to support this, we would need to support 
 different plugins per edge for routing the events on that edge. A motivating 
 scenario is when a custom plugin from an older release of a downstream 
 project is using older API's while the latest release of that project has 
 moved on to newer API's. This would allow both old and new releases to work 
 with the latest Tez framework as optimally as possible.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TEZ-2454) Change FetcherOrderedGroup to work as Callables instead of blocking threads

2015-05-18 Thread TezQA (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14549248#comment-14549248
 ] 

TezQA commented on TEZ-2454:


{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12733611/TEZ-2454.3.txt
  against master revision 12fc2c7.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:red}-1 findbugs{color}.  The patch appears to introduce 1 new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/690//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-TEZ-Build/690//artifact/patchprocess/newPatchFindbugsWarningstez-runtime-library.html
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/690//console

This message is automatically generated.

 Change FetcherOrderedGroup to work as Callables instead of blocking threads
 ---

 Key: TEZ-2454
 URL: https://issues.apache.org/jira/browse/TEZ-2454
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Siddharth Seth
Assignee: Siddharth Seth
 Attachments: TEZ-2454.1.wip.txt, TEZ-2454.2.txt, TEZ-2454.3.txt


 The Fetcher threads for Ordered Input currently run and block till merge 
 completes, which makes it difficult to use them via ThreadPools.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


Success: TEZ-2409 PreCommit Build #691

2015-05-18 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/TEZ-2409
Build: https://builds.apache.org/job/PreCommit-TEZ-Build/691/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 2881 lines...]
[INFO] Final Memory: 71M/925M
[INFO] 




{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12733617/TEZ-2409.1.patch
  against master revision 12fc2c7.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 5 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/691//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/691//console

This message is automatically generated.


==
==
Adding comment to Jira.
==
==


Comment added.
37a8963f2705e32e7444048be50f016557b4b759 logged out


==
==
Finished build.
==
==


Archiving artifacts
Sending artifact delta relative to PreCommit-TEZ-Build #688
Archived 44 artifacts
Archive block size is 32768
Received 4 blocks and 2679576 bytes
Compression is 4.7%
Took 1 sec
Description set: TEZ-2409
Recording test results
Email was triggered for: Success
Sending email for trigger: Success



###
## FAILED TESTS (if any) 
##
All tests passed

[jira] [Assigned] (TEZ-2461) tez-history-parser compile fails with hadoop-2.4

2015-05-18 Thread Rajesh Balamohan (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rajesh Balamohan reassigned TEZ-2461:
-

Assignee: Rajesh Balamohan

 tez-history-parser compile fails with hadoop-2.4
 

 Key: TEZ-2461
 URL: https://issues.apache.org/jira/browse/TEZ-2461
 Project: Apache Tez
  Issue Type: Bug
Reporter: Jeff Zhang
Assignee: Rajesh Balamohan
 Attachments: TEZ-2461.1.patch


 https://builds.apache.org/job/Tez-Build-Hadoop-2.4/98/console
 \cc [~rajesh.balamohan]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TEZ-2460) Temporary solution for issue due to YARN-2560

2015-05-18 Thread Jeff Zhang (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14549723#comment-14549723
 ] 

Jeff Zhang commented on TEZ-2460:
-

[~pramachandran] [~bikassaha] [~hitesh] Please help review it. ( no test 
included, manually verify the version incompatible case )

 Temporary solution for issue due to YARN-2560
 -

 Key: TEZ-2460
 URL: https://issues.apache.org/jira/browse/TEZ-2460
 Project: Apache Tez
  Issue Type: Bug
Reporter: Jeff Zhang
Assignee: Jeff Zhang
 Attachments: TEZ-2460-1.patch


 Due to YARN-2560, DAGClient can't get the correct diagnostics, this jira is 
 to make a temporary solution for it before YARN-2560 is resolved.  Here's one 
 example that dag is failed due to version incompatible.
 {noformat}
 15/05/18 17:34:18 INFO client.TezClient: Tez Client Version: [ 
 component=tez-api, version=0.7.0.2.3.0.0-1986, 
 revision=7a048b526519e53de5d6464493697e92b15718fc, 
 SCM-URL=scm:git:https://git-wip-us.apache.org/repos/asf/tez.git, 
 buildTime=20150514-0857 ]
 15/05/18 17:34:19 INFO impl.TimelineClientImpl: Timeline service address: 
 http://c6402.ambari.apache.org:8188/ws/v1/timeline/
 15/05/18 17:34:19 INFO client.RMProxy: Connecting to ResourceManager at 
 c6402.ambari.apache.org/192.168.64.102:8050
 15/05/18 17:34:19 INFO client.TezClient: Using 
 org.apache.tez.dag.history.ats.acls.ATSHistoryACLPolicyManager to manage 
 Timeline ACLs
 15/05/18 17:34:19 INFO impl.TimelineClientImpl: Timeline service address: 
 http://c6402.ambari.apache.org:8188/ws/v1/timeline/
 15/05/18 17:34:21 INFO examples.OrderedWordCount: Running OrderedWordCount
 15/05/18 17:34:21 INFO client.TezClient: Submitting DAG application with id: 
 application_1431967480765_0017
 15/05/18 17:34:21 INFO client.TezClientUtils: Using tez.lib.uris value from 
 configuration: /hdp/apps/2.3.0.0-1986/tez/tez.tar.gz
 15/05/18 17:34:21 INFO client.TezClient: Tez system stage directory 
 hdfs://c1ha/tmp/ambari-qa/staging/.tez/application_1431967480765_0017 doesn't 
 exist and is created
 15/05/18 17:34:21 INFO acls.ATSHistoryACLPolicyManager: Created Timeline 
 Domain for History ACLs, domainId=Tez_ATS_application_1431967480765_0017
 15/05/18 17:34:22 INFO client.TezClient: Submitting DAG to YARN, 
 applicationId=application_1431967480765_0017, dagName=OrderedWordCount
 15/05/18 17:34:22 INFO impl.YarnClientImpl: Submitted application 
 application_1431967480765_0017
 15/05/18 17:34:22 INFO client.TezClient: The url to track the Tez AM: 
 http://c6402.ambari.apache.org:8088/proxy/application_1431967480765_0017/
 15/05/18 17:34:22 INFO impl.TimelineClientImpl: Timeline service address: 
 http://c6402.ambari.apache.org:8188/ws/v1/timeline/
 15/05/18 17:34:22 INFO client.RMProxy: Connecting to ResourceManager at 
 c6402.ambari.apache.org/192.168.64.102:8050
 15/05/18 17:34:22 INFO client.DAGClientImpl: Waiting for DAG to start running
 15/05/18 17:34:34 INFO client.DAGClientImpl: DAG initialized: 
 CurrentState=Running
 15/05/18 17:34:38 INFO client.DAGClientImpl: DAG completed. FinalState=FAILED
 15/05/18 17:34:38 INFO examples.OrderedWordCount: DAG diagnostics: []
 {noformat}
 {noformat}
 2015-05-18 16:57:46,807 INFO [main] app.DAGAppMaster: Created DAGAppMaster 
 for application appattempt_1431967480765_0008_01, versionInfo=[ 
 component=tez-dag, version=0.5.2.2.2.3.0-2611, 
 revision=2d3c6b639d5b1048bd20aad5736823a35edd2485, 
 SCM-URL=scm:git:https://git-wip-us.apache.org/repos/asf/tez.git, 
 buildTIme=20150314-1805 ]
 2015-05-18 16:57:47,931 INFO [main] app.DAGAppMaster: Comparing client 
 version with AM version, clientVersion=0.7.0.2.3.0.0-1986, 
 AMVersion=0.5.2.2.2.3.0-2611
 2015-05-18 16:57:47,932 FATAL [main] app.DAGAppMaster: Incompatible versions 
 found, clientVersion=0.7.0.2.3.0.0-1986, AMVersion=0.5.2.2.2.3.0-2611
 2015-05-18 16:57:49,348 INFO [main] app.DAGAppMaster: Adding session token to 
 jobTokenSecretManager for application
 2015-05-18 16:57:49,354 INFO [main] event.AsyncDispatcher: Registering class 
 org.apache.tez.dag.app.rm.container.AMContainerEventType for class 
 org.apache.tez.dag.app.rm.container.AMContainerMap
 2015-05-18 16:57:49,355 INFO [main] event.AsyncDispatcher: Registering class 
 org.apache.tez.dag.app.rm.node.AMNodeEventType for class 
 org.apache.tez.dag.app.rm.node.AMNodeTracker
 2015-05-18 16:57:49,356 INFO [main] event.AsyncDispatcher: Registering class 
 org.apache.tez.dag.app.dag.event.DAGAppMasterEventType for class 
 org.apache.tez.dag.app.DAGAppMaster$DAGAppMasterEventHandler
 2015-05-18 16:57:49,356 INFO [main] event.AsyncDispatcher: Registering class 
 org.apache.tez.dag.app.dag.event.DAGEventType for class 
 org.apache.tez.dag.app.DAGAppMaster$DagEventDispatcher
 2015-05-18 16:57:49,357 INFO [main] event.AsyncDispatcher: Registering class 
 

[jira] [Issue Comment Deleted] (TEZ-2461) tez-history-parser compile fails with hadoop-2.4

2015-05-18 Thread Hitesh Shah (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hitesh Shah updated TEZ-2461:
-
Comment: was deleted

(was: {code}
[ERROR] Failed to execute goal 
org.apache.maven.plugins:maven-compiler-plugin:3.1:compile (default-compile) on 
project tez-history-parser: Compilation failure: Compilation failure:
[ERROR] 
https://builds.apache.org/job/Tez-Build-Hadoop-2.4/ws/tez-plugins/tez-history-parser/src/main/java/org/apache/tez/history/ATSImportTool.java:[44,37]
 package org.apache.directory.api.util does not exist
[ERROR] 
https://builds.apache.org/job/Tez-Build-Hadoop-2.4/ws/tez-plugins/tez-history-parser/src/main/java/org/apache/tez/history/parser/utils/Utils.java:[21,37]
 package org.apache.directory.api.util does not exist
{code}

Saw the directory api issue too. )

 tez-history-parser compile fails with hadoop-2.4
 

 Key: TEZ-2461
 URL: https://issues.apache.org/jira/browse/TEZ-2461
 Project: Apache Tez
  Issue Type: Bug
Reporter: Jeff Zhang
Assignee: Rajesh Balamohan
 Fix For: 0.8.0

 Attachments: TEZ-2461.1.patch


 https://builds.apache.org/job/Tez-Build-Hadoop-2.4/98/console
 \cc [~rajesh.balamohan]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TEZ-2454) Change FetcherOrderedGroup to work as Callables instead of blocking threads

2015-05-18 Thread TezQA (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14549715#comment-14549715
 ] 

TezQA commented on TEZ-2454:


{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12733686/TEZ-2454.4.txt
  against master revision 11aa17e.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/694//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/694//console

This message is automatically generated.

 Change FetcherOrderedGroup to work as Callables instead of blocking threads
 ---

 Key: TEZ-2454
 URL: https://issues.apache.org/jira/browse/TEZ-2454
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Siddharth Seth
Assignee: Siddharth Seth
 Attachments: TEZ-2454.1.wip.txt, TEZ-2454.2.txt, TEZ-2454.3.txt, 
 TEZ-2454.4.txt


 The Fetcher threads for Ordered Input currently run and block till merge 
 completes, which makes it difficult to use them via ThreadPools.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


Failed: TEZ-2460 PreCommit Build #695

2015-05-18 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/TEZ-2460
Build: https://builds.apache.org/job/PreCommit-TEZ-Build/695/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 3008 lines...]



{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12733689/TEZ-2460-1.patch
  against master revision 11aa17e.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/695//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/695//console

This message is automatically generated.


==
==
Adding comment to Jira.
==
==


Comment added.
97aedef2d25aed1edfc413165d616fc6077e6f09 logged out


==
==
Finished build.
==
==


Build step 'Execute shell' marked build as failure
Archiving artifacts
Sending artifact delta relative to PreCommit-TEZ-Build #694
Archived 47 artifacts
Archive block size is 32768
Received 6 blocks and 2691232 bytes
Compression is 6.8%
Took 0.81 sec
[description-setter] Could not determine description.
Recording test results
Email was triggered for: Failure
Sending email for trigger: Failure



###
## FAILED TESTS (if any) 
##
All tests passed

[jira] [Commented] (TEZ-2456) Refactor recovery event logging to ensure it meet the recovery event spec

2015-05-18 Thread Jeff Zhang (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14549781#comment-14549781
 ] 

Jeff Zhang commented on TEZ-2456:
-

[~hitesh] Thanks for the review. 

bq. TaskFinishedEvent must be logged before VertexFinishedEvent   ( 
Retrospective failures? )
Rephase it to:   For VertexFinishedEvent (SUCCEEDED), there must be at least n 
TaskFinishedEvent (SUCCEEDED)

bq. TaskAttemptFinishedEvent should be logged before TaskFinishedEvent ( 
Retrospective failures? )
Rephase it to:  For TaskFinishedEvent (SUCCEEDED), there must be at least one 
TaskAttemptFinishedEvent (SUCCEEDED)

bq. Also, nothing in the list related to speculated attempts. How should those 
be handled?
After TEZ-2249, all the task attempts should be finished before task is 
finished. 

bq. There are multiple places where this is called out. Is there an issue if it 
gets logged twice? Will something break? Should there be checks to ensure it is 
logged only once or can the recovery handle it if the event is logged twice? 
What kind of problems do you see if it happens twice? 
Two issues will be caused by multiple logging for the same event.
* The metrics will be incorrect, specially for the start_time  finished_time
* If the AM is killed again, the next recovery will handle the same recovery 
event multiple times, may cause some potential issue. I think the 
restoreFromEvent method assume every event is logged once. 



 Refactor recovery event logging to ensure it meet the recovery event spec
 -

 Key: TEZ-2456
 URL: https://issues.apache.org/jira/browse/TEZ-2456
 Project: Apache Tez
  Issue Type: Sub-task
Reporter: Jeff Zhang
Assignee: Jeff Zhang

 Currently we don't have spec for the recovery event logging. Recovery would 
 be fragile to code change. This jira try to define the spec and refactor the 
 recovery event logging to ensure it meet the spec. [~hitesh] Please help 
 review the following spec I drafted.
 *DAG*
 * DAGSubmitted/DAGInitializedEvent/DAGStartedEvent must been logged once, 
 Should not log it again when it’s recovered.
 * DAGFinishedEvent may be logged multiple times.  ( DAG move from SUCCEEDED 
 from ERROR ? Should we ignore this ? )
 * VertexFinishedEvent should be logged before DAGFinishedEvent
 *Vertex* 
 * RootInputDataInformation must be logged before VertexInitializedEvent
 * DataMovement must be logged before TaskFinishedEvent
 * TaskFinishedEvent must be logged before VertexFinishedEvent
 * VertexInitializedEvent / VertexStartedEvent should only be logged once, 
 should not log again when it’s recovered.
 * VertexFinishedEvent may be logged multiple times. (e.g. Vertex move from 
 SUCCEEDED to FAILED)
 * VertexParallelismUpdatedEvent must be logged before TaskStartedEvent
 * TaskFinishedEvent should be logged before VertexFinishedEvent
 *Task*
 * If there’s no TaskStartedEvent, TaskFinishedEvent may still be logged (e.g. 
 Task is killed in NEW )  Current’s behavior is that TaskFinishedEvent won’t 
 be logged if there’s no TaskStartedEvent. 
 * TaskStartedEvent should only be logged once.  Should not log again when 
 it’s recovered.
 * TaskFinishedEvent may be logged multiple times (e.g. Task move from 
 SUCCEEDED to FAILED)
 * TaskAttemptFinishedEvent should be logged before TaskFinishedEvent
   
 *TaskAttempt*
 * If there’s no TaskAttemptStartedEvent, TaskAttemptFinishedEvent may still 
 be logged ( e.g. TaskAttempt is killed in NEW )  Current’s behavior is that 
 TaskAttemptFinishedEvent won’t be logged if there’s no TaskAttemptStartedEvent
 * TaskAttemptStartedEvent should only be logged once.  Should not log again 
 when it’s recovered.
 * TaskAttemptFinishedEvent may be logged multiple times. (e.g. TaskAttempt 
 move from SUCCEEDED to FAILED)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (TEZ-2456) Refactor recovery event logging to ensure it meet the recovery event spec

2015-05-18 Thread Jeff Zhang (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14549781#comment-14549781
 ] 

Jeff Zhang edited comment on TEZ-2456 at 5/19/15 4:51 AM:
--

[~hitesh] Thanks for the review. 

bq. TaskFinishedEvent must be logged before VertexFinishedEvent   ( 
Retrospective failures? )
Rephase it to:   For VertexFinishedEvent (SUCCEEDED), before it there must be 
at least n TaskFinishedEvent (SUCCEEDED)

bq. TaskAttemptFinishedEvent should be logged before TaskFinishedEvent ( 
Retrospective failures? )
Rephase it to:  For TaskFinishedEvent (SUCCEEDED), before it there must be at 
least one TaskAttemptFinishedEvent (SUCCEEDED)

bq. Also, nothing in the list related to speculated attempts. How should those 
be handled?
After TEZ-2249, all the task attempts should be finished before task is 
finished. 

bq. There are multiple places where this is called out. Is there an issue if it 
gets logged twice? Will something break? Should there be checks to ensure it is 
logged only once or can the recovery handle it if the event is logged twice? 
What kind of problems do you see if it happens twice? 
Two issues will be caused by multiple logging for the same event.
* The metrics will be incorrect, specially for the start_time  finished_time
* If the AM is killed again, the next recovery will handle the same recovery 
event multiple times, may cause some potential issue. I think the 
restoreFromEvent method assume every event is logged once. 




was (Author: zjffdu):
[~hitesh] Thanks for the review. 

bq. TaskFinishedEvent must be logged before VertexFinishedEvent   ( 
Retrospective failures? )
Rephase it to:   For VertexFinishedEvent (SUCCEEDED), there must be at least n 
TaskFinishedEvent (SUCCEEDED)

bq. TaskAttemptFinishedEvent should be logged before TaskFinishedEvent ( 
Retrospective failures? )
Rephase it to:  For TaskFinishedEvent (SUCCEEDED), there must be at least one 
TaskAttemptFinishedEvent (SUCCEEDED)

bq. Also, nothing in the list related to speculated attempts. How should those 
be handled?
After TEZ-2249, all the task attempts should be finished before task is 
finished. 

bq. There are multiple places where this is called out. Is there an issue if it 
gets logged twice? Will something break? Should there be checks to ensure it is 
logged only once or can the recovery handle it if the event is logged twice? 
What kind of problems do you see if it happens twice? 
Two issues will be caused by multiple logging for the same event.
* The metrics will be incorrect, specially for the start_time  finished_time
* If the AM is killed again, the next recovery will handle the same recovery 
event multiple times, may cause some potential issue. I think the 
restoreFromEvent method assume every event is logged once. 



 Refactor recovery event logging to ensure it meet the recovery event spec
 -

 Key: TEZ-2456
 URL: https://issues.apache.org/jira/browse/TEZ-2456
 Project: Apache Tez
  Issue Type: Sub-task
Reporter: Jeff Zhang
Assignee: Jeff Zhang

 Currently we don't have spec for the recovery event logging. Recovery would 
 be fragile to code change. This jira try to define the spec and refactor the 
 recovery event logging to ensure it meet the spec. [~hitesh] Please help 
 review the following spec I drafted.
 *DAG*
 * DAGSubmitted/DAGInitializedEvent/DAGStartedEvent must been logged once, 
 Should not log it again when it’s recovered.
 * DAGFinishedEvent may be logged multiple times.  ( DAG move from SUCCEEDED 
 from ERROR ? Should we ignore this ? )
 * VertexFinishedEvent should be logged before DAGFinishedEvent
 *Vertex* 
 * RootInputDataInformation must be logged before VertexInitializedEvent
 * DataMovement must be logged before TaskFinishedEvent
 * TaskFinishedEvent must be logged before VertexFinishedEvent
 * VertexInitializedEvent / VertexStartedEvent should only be logged once, 
 should not log again when it’s recovered.
 * VertexFinishedEvent may be logged multiple times. (e.g. Vertex move from 
 SUCCEEDED to FAILED)
 * VertexParallelismUpdatedEvent must be logged before TaskStartedEvent
 * TaskFinishedEvent should be logged before VertexFinishedEvent
 *Task*
 * If there’s no TaskStartedEvent, TaskFinishedEvent may still be logged (e.g. 
 Task is killed in NEW )  Current’s behavior is that TaskFinishedEvent won’t 
 be logged if there’s no TaskStartedEvent. 
 * TaskStartedEvent should only be logged once.  Should not log again when 
 it’s recovered.
 * TaskFinishedEvent may be logged multiple times (e.g. Task move from 
 SUCCEEDED to FAILED)
 * TaskAttemptFinishedEvent should be logged before TaskFinishedEvent
   
 *TaskAttempt*
 * If there’s no TaskAttemptStartedEvent, TaskAttemptFinishedEvent may still 
 be logged ( 

[jira] [Commented] (TEZ-2461) tez-history-parser compile fails with hadoop-2.4

2015-05-18 Thread TezQA (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14549738#comment-14549738
 ] 

TezQA commented on TEZ-2461:


{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12733697/TEZ-2461.1.patch
  against master revision 11aa17e.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/696//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/696//console

This message is automatically generated.

 tez-history-parser compile fails with hadoop-2.4
 

 Key: TEZ-2461
 URL: https://issues.apache.org/jira/browse/TEZ-2461
 Project: Apache Tez
  Issue Type: Bug
Reporter: Jeff Zhang
Assignee: Rajesh Balamohan
 Attachments: TEZ-2461.1.patch


 https://builds.apache.org/job/Tez-Build-Hadoop-2.4/98/console
 \cc [~rajesh.balamohan]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TEZ-2461) tez-history-parser compile fails with hadoop-2.4

2015-05-18 Thread Hitesh Shah (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14549791#comment-14549791
 ] 

Hitesh Shah commented on TEZ-2461:
--

{code}
[ERROR] Failed to execute goal 
org.apache.maven.plugins:maven-compiler-plugin:3.1:compile (default-compile) on 
project tez-history-parser: Compilation failure: Compilation failure:
[ERROR] 
https://builds.apache.org/job/Tez-Build-Hadoop-2.4/ws/tez-plugins/tez-history-parser/src/main/java/org/apache/tez/history/ATSImportTool.java:[44,37]
 package org.apache.directory.api.util does not exist
[ERROR] 
https://builds.apache.org/job/Tez-Build-Hadoop-2.4/ws/tez-plugins/tez-history-parser/src/main/java/org/apache/tez/history/parser/utils/Utils.java:[21,37]
 package org.apache.directory.api.util does not exist
{code}

Saw the directory api issue too. 

 tez-history-parser compile fails with hadoop-2.4
 

 Key: TEZ-2461
 URL: https://issues.apache.org/jira/browse/TEZ-2461
 Project: Apache Tez
  Issue Type: Bug
Reporter: Jeff Zhang
Assignee: Rajesh Balamohan
 Fix For: 0.8.0

 Attachments: TEZ-2461.1.patch


 https://builds.apache.org/job/Tez-Build-Hadoop-2.4/98/console
 \cc [~rajesh.balamohan]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TEZ-2460) Temporary solution for issue due to YARN-2560

2015-05-18 Thread Bikas Saha (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14549803#comment-14549803
 ] 

Bikas Saha commented on TEZ-2460:
-

This workaround may need to be around longer than temporary depending on how 
much effort it is to resolve the root cause in YARN. The patch looks fine 
though we should add a private configuration to determine how long the timeout 
should be. Rests looks good.

 Temporary solution for issue due to YARN-2560
 -

 Key: TEZ-2460
 URL: https://issues.apache.org/jira/browse/TEZ-2460
 Project: Apache Tez
  Issue Type: Bug
Reporter: Jeff Zhang
Assignee: Jeff Zhang
 Attachments: TEZ-2460-1.patch


 Due to YARN-2560, DAGClient can't get the correct diagnostics, this jira is 
 to make a temporary solution for it before YARN-2560 is resolved.  Here's one 
 example that dag is failed due to version incompatible.
 {noformat}
 15/05/18 17:34:18 INFO client.TezClient: Tez Client Version: [ 
 component=tez-api, version=0.7.0.2.3.0.0-1986, 
 revision=7a048b526519e53de5d6464493697e92b15718fc, 
 SCM-URL=scm:git:https://git-wip-us.apache.org/repos/asf/tez.git, 
 buildTime=20150514-0857 ]
 15/05/18 17:34:19 INFO impl.TimelineClientImpl: Timeline service address: 
 http://c6402.ambari.apache.org:8188/ws/v1/timeline/
 15/05/18 17:34:19 INFO client.RMProxy: Connecting to ResourceManager at 
 c6402.ambari.apache.org/192.168.64.102:8050
 15/05/18 17:34:19 INFO client.TezClient: Using 
 org.apache.tez.dag.history.ats.acls.ATSHistoryACLPolicyManager to manage 
 Timeline ACLs
 15/05/18 17:34:19 INFO impl.TimelineClientImpl: Timeline service address: 
 http://c6402.ambari.apache.org:8188/ws/v1/timeline/
 15/05/18 17:34:21 INFO examples.OrderedWordCount: Running OrderedWordCount
 15/05/18 17:34:21 INFO client.TezClient: Submitting DAG application with id: 
 application_1431967480765_0017
 15/05/18 17:34:21 INFO client.TezClientUtils: Using tez.lib.uris value from 
 configuration: /hdp/apps/2.3.0.0-1986/tez/tez.tar.gz
 15/05/18 17:34:21 INFO client.TezClient: Tez system stage directory 
 hdfs://c1ha/tmp/ambari-qa/staging/.tez/application_1431967480765_0017 doesn't 
 exist and is created
 15/05/18 17:34:21 INFO acls.ATSHistoryACLPolicyManager: Created Timeline 
 Domain for History ACLs, domainId=Tez_ATS_application_1431967480765_0017
 15/05/18 17:34:22 INFO client.TezClient: Submitting DAG to YARN, 
 applicationId=application_1431967480765_0017, dagName=OrderedWordCount
 15/05/18 17:34:22 INFO impl.YarnClientImpl: Submitted application 
 application_1431967480765_0017
 15/05/18 17:34:22 INFO client.TezClient: The url to track the Tez AM: 
 http://c6402.ambari.apache.org:8088/proxy/application_1431967480765_0017/
 15/05/18 17:34:22 INFO impl.TimelineClientImpl: Timeline service address: 
 http://c6402.ambari.apache.org:8188/ws/v1/timeline/
 15/05/18 17:34:22 INFO client.RMProxy: Connecting to ResourceManager at 
 c6402.ambari.apache.org/192.168.64.102:8050
 15/05/18 17:34:22 INFO client.DAGClientImpl: Waiting for DAG to start running
 15/05/18 17:34:34 INFO client.DAGClientImpl: DAG initialized: 
 CurrentState=Running
 15/05/18 17:34:38 INFO client.DAGClientImpl: DAG completed. FinalState=FAILED
 15/05/18 17:34:38 INFO examples.OrderedWordCount: DAG diagnostics: []
 {noformat}
 {noformat}
 2015-05-18 16:57:46,807 INFO [main] app.DAGAppMaster: Created DAGAppMaster 
 for application appattempt_1431967480765_0008_01, versionInfo=[ 
 component=tez-dag, version=0.5.2.2.2.3.0-2611, 
 revision=2d3c6b639d5b1048bd20aad5736823a35edd2485, 
 SCM-URL=scm:git:https://git-wip-us.apache.org/repos/asf/tez.git, 
 buildTIme=20150314-1805 ]
 2015-05-18 16:57:47,931 INFO [main] app.DAGAppMaster: Comparing client 
 version with AM version, clientVersion=0.7.0.2.3.0.0-1986, 
 AMVersion=0.5.2.2.2.3.0-2611
 2015-05-18 16:57:47,932 FATAL [main] app.DAGAppMaster: Incompatible versions 
 found, clientVersion=0.7.0.2.3.0.0-1986, AMVersion=0.5.2.2.2.3.0-2611
 2015-05-18 16:57:49,348 INFO [main] app.DAGAppMaster: Adding session token to 
 jobTokenSecretManager for application
 2015-05-18 16:57:49,354 INFO [main] event.AsyncDispatcher: Registering class 
 org.apache.tez.dag.app.rm.container.AMContainerEventType for class 
 org.apache.tez.dag.app.rm.container.AMContainerMap
 2015-05-18 16:57:49,355 INFO [main] event.AsyncDispatcher: Registering class 
 org.apache.tez.dag.app.rm.node.AMNodeEventType for class 
 org.apache.tez.dag.app.rm.node.AMNodeTracker
 2015-05-18 16:57:49,356 INFO [main] event.AsyncDispatcher: Registering class 
 org.apache.tez.dag.app.dag.event.DAGAppMasterEventType for class 
 org.apache.tez.dag.app.DAGAppMaster$DAGAppMasterEventHandler
 2015-05-18 16:57:49,356 INFO [main] event.AsyncDispatcher: Registering class 
 org.apache.tez.dag.app.dag.event.DAGEventType for class 
 

Failed: TEZ-2461 PreCommit Build #696

2015-05-18 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/TEZ-2461
Build: https://builds.apache.org/job/PreCommit-TEZ-Build/696/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 3009 lines...]



{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12733697/TEZ-2461.1.patch
  against master revision 11aa17e.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/696//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/696//console

This message is automatically generated.


==
==
Adding comment to Jira.
==
==


Comment added.
a5aff1cc25b5559b0f2d6be8d21bff33b2423fb1 logged out


==
==
Finished build.
==
==


Build step 'Execute shell' marked build as failure
Archiving artifacts
Sending artifact delta relative to PreCommit-TEZ-Build #694
Archived 47 artifacts
Archive block size is 32768
Received 6 blocks and 2689005 bytes
Compression is 6.8%
Took 1.6 sec
[description-setter] Could not determine description.
Recording test results
Email was triggered for: Failure
Sending email for trigger: Failure



###
## FAILED TESTS (if any) 
##
All tests passed

[jira] [Commented] (TEZ-2461) tez-history-parser compile fails with hadoop-2.4

2015-05-18 Thread Jeff Zhang (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14549774#comment-14549774
 ] 

Jeff Zhang commented on TEZ-2461:
-

+1

 tez-history-parser compile fails with hadoop-2.4
 

 Key: TEZ-2461
 URL: https://issues.apache.org/jira/browse/TEZ-2461
 Project: Apache Tez
  Issue Type: Bug
Reporter: Jeff Zhang
Assignee: Rajesh Balamohan
 Attachments: TEZ-2461.1.patch


 https://builds.apache.org/job/Tez-Build-Hadoop-2.4/98/console
 \cc [~rajesh.balamohan]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TEZ-2076) Tez framework to extract/analyze data stored in ATS for specific dag

2015-05-18 Thread Rajesh Balamohan (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-2076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rajesh Balamohan updated TEZ-2076:
--
Attachment: TEZ-2076.16.patch

Reduced the options to 2 and fixed javadoc to reflect this.
Thanks [~hitesh]. Will wait for precommit build and commit to master.  Will 
create a followup jira for --allow-incomplete-data flag

 Tez framework to extract/analyze data stored in ATS for specific dag
 

 Key: TEZ-2076
 URL: https://issues.apache.org/jira/browse/TEZ-2076
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Rajesh Balamohan
Assignee: Rajesh Balamohan
 Attachments: TEZ-2076.1.patch, TEZ-2076.10.patch, TEZ-2076.11.patch, 
 TEZ-2076.12.patch, TEZ-2076.13.patch, TEZ-2076.14.patch, TEZ-2076.15.patch, 
 TEZ-2076.16.patch, TEZ-2076.2.patch, TEZ-2076.3.patch, TEZ-2076.4.patch, 
 TEZ-2076.5.patch, TEZ-2076.6.patch, TEZ-2076.7.patch, TEZ-2076.8.patch, 
 TEZ-2076.9.patch, TEZ-2076.WIP.2.patch, TEZ-2076.WIP.3.patch, 
 TEZ-2076.WIP.patch


 - Users should be able to download ATS data pertaining to a DAG from Tez-UI 
 (more like a zip file containing DAG/Vertex/Task/TaskAttempt info).
 - This can be plugged to an analyzer which parses the data, adds semantics 
 and provides an in-memory representation for further analysis.
 - This will enable to write different analyzer rules, which can be run on top 
 of this in-memory representation to come up with analysis on the DAG.
 - Results of this analyzer rules can be rendered on to UI (standalone webapp) 
 later point in time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TEZ-2454) Change FetcherOrderedGroup to work as Callables instead of blocking threads

2015-05-18 Thread Siddharth Seth (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated TEZ-2454:

Attachment: TEZ-2454.4.txt

findbugs ignored properly this time.

 Change FetcherOrderedGroup to work as Callables instead of blocking threads
 ---

 Key: TEZ-2454
 URL: https://issues.apache.org/jira/browse/TEZ-2454
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Siddharth Seth
Assignee: Siddharth Seth
 Attachments: TEZ-2454.1.wip.txt, TEZ-2454.2.txt, TEZ-2454.3.txt, 
 TEZ-2454.4.txt


 The Fetcher threads for Ordered Input currently run and block till merge 
 completes, which makes it difficult to use them via ThreadPools.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


Failed: TEZ-2454 PreCommit Build #690

2015-05-18 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/TEZ-2454
Build: https://builds.apache.org/job/PreCommit-TEZ-Build/690/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 2881 lines...]




{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12733611/TEZ-2454.3.txt
  against master revision 12fc2c7.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:red}-1 findbugs{color}.  The patch appears to introduce 1 new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/690//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-TEZ-Build/690//artifact/patchprocess/newPatchFindbugsWarningstez-runtime-library.html
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/690//console

This message is automatically generated.


==
==
Adding comment to Jira.
==
==


Comment added.
f2606419b5a31bab23cce0d96ba1b66745001750 logged out


==
==
Finished build.
==
==


Build step 'Execute shell' marked build as failure
Archiving artifacts
Sending artifact delta relative to PreCommit-TEZ-Build #688
Archived 44 artifacts
Archive block size is 32768
Received 20 blocks and 2135483 bytes
Compression is 23.5%
Took 1.7 sec
[description-setter] Could not determine description.
Recording test results
Email was triggered for: Failure
Sending email for trigger: Failure



###
## FAILED TESTS (if any) 
##
All tests passed

Success: TEZ-2076 PreCommit Build #693

2015-05-18 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/TEZ-2076
Build: https://builds.apache.org/job/PreCommit-TEZ-Build/693/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 3021 lines...]
[INFO] Final Memory: 87M/1470M
[INFO] 




{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12733662/TEZ-2076.16.patch
  against master revision 12fc2c7.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/693//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/693//console

This message is automatically generated.


==
==
Adding comment to Jira.
==
==


Comment added.
3dd8d5ce1c11fdeef9f1a2bbbcbb64c36ae54460 logged out


==
==
Finished build.
==
==


Archiving artifacts
Sending artifact delta relative to PreCommit-TEZ-Build #691
Archived 47 artifacts
Archive block size is 32768
Received 4 blocks and 2736290 bytes
Compression is 4.6%
Took 0.76 sec
Description set: TEZ-2076
Recording test results
Email was triggered for: Success
Sending email for trigger: Success



###
## FAILED TESTS (if any) 
##
All tests passed

Failed: TEZ-2409 PreCommit Build #692

2015-05-18 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/TEZ-2409
Build: https://builds.apache.org/job/PreCommit-TEZ-Build/692/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 2474 lines...]




{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12733655/TEZ-2409.1.patch
  against master revision 12fc2c7.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 6 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The following test timeouts occurred in :
 org.apache.tez.dag.app.dag.impl.TestVertexImpl

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/692//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/692//console

This message is automatically generated.


==
==
Adding comment to Jira.
==
==


Comment added.
1b8cc91d36a0437e0ad6f9073c3dc9cd7920fcf5 logged out


==
==
Finished build.
==
==


Build step 'Execute shell' marked build as failure
Archiving artifacts
Sending artifact delta relative to PreCommit-TEZ-Build #691
Archived 44 artifacts
Archive block size is 32768
Received 4 blocks and 2625496 bytes
Compression is 4.8%
Took 0.84 sec
[description-setter] Could not determine description.
Recording test results
Email was triggered for: Failure
Sending email for trigger: Failure



###
## FAILED TESTS (if any) 
##
All tests passed

[jira] [Commented] (TEZ-2409) Allow different edges to have different routing plugins

2015-05-18 Thread TezQA (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14549501#comment-14549501
 ] 

TezQA commented on TEZ-2409:


{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12733655/TEZ-2409.1.patch
  against master revision 12fc2c7.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 6 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The following test timeouts occurred in :
 org.apache.tez.dag.app.dag.impl.TestVertexImpl

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/692//testReport/
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/692//console

This message is automatically generated.

 Allow different edges to have different routing plugins
 ---

 Key: TEZ-2409
 URL: https://issues.apache.org/jira/browse/TEZ-2409
 Project: Apache Tez
  Issue Type: Task
Reporter: Bikas Saha
Assignee: Bikas Saha
Priority: Critical
 Attachments: TEZ-2409.1.patch


 It may be useful to allow different edge manager plugin types based on 
 different requirements. In order to support this, we would need to support 
 different plugins per edge for routing the events on that edge. A motivating 
 scenario is when a custom plugin from an older release of a downstream 
 project is using older API's while the latest release of that project has 
 moved on to newer API's. This would allow both old and new releases to work 
 with the latest Tez framework as optimally as possible.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (TEZ-2459) Add --allow-incomplete-data flag to ATSImportTool

2015-05-18 Thread Rajesh Balamohan (JIRA)
Rajesh Balamohan created TEZ-2459:
-

 Summary: Add --allow-incomplete-data flag to ATSImportTool
 Key: TEZ-2459
 URL: https://issues.apache.org/jira/browse/TEZ-2459
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Rajesh Balamohan


Provide a way to warn the user if the import tool is used on an in-progress dag 
- maybe this tool should just error out for an in-progress dag and only work if 
there is a special --allow-incomplete-data or similar flag set



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TEZ-2460) Temporary solution for issue due to YARN-2560

2015-05-18 Thread Jeff Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-2460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Zhang updated TEZ-2460:

Attachment: TEZ-2460-1.patch

 Temporary solution for issue due to YARN-2560
 -

 Key: TEZ-2460
 URL: https://issues.apache.org/jira/browse/TEZ-2460
 Project: Apache Tez
  Issue Type: Bug
Reporter: Jeff Zhang
Assignee: Jeff Zhang
 Attachments: TEZ-2460-1.patch


 Due to YARN-2560, DAGClient can't get the correct diagnostics, this jira is 
 to make a temporary solution for it before YARN-2560 is resolved.  Here's one 
 example that dag is failed due to version incompatible.
 {noformat}
 15/05/18 17:34:18 INFO client.TezClient: Tez Client Version: [ 
 component=tez-api, version=0.7.0.2.3.0.0-1986, 
 revision=7a048b526519e53de5d6464493697e92b15718fc, 
 SCM-URL=scm:git:https://git-wip-us.apache.org/repos/asf/tez.git, 
 buildTime=20150514-0857 ]
 15/05/18 17:34:19 INFO impl.TimelineClientImpl: Timeline service address: 
 http://c6402.ambari.apache.org:8188/ws/v1/timeline/
 15/05/18 17:34:19 INFO client.RMProxy: Connecting to ResourceManager at 
 c6402.ambari.apache.org/192.168.64.102:8050
 15/05/18 17:34:19 INFO client.TezClient: Using 
 org.apache.tez.dag.history.ats.acls.ATSHistoryACLPolicyManager to manage 
 Timeline ACLs
 15/05/18 17:34:19 INFO impl.TimelineClientImpl: Timeline service address: 
 http://c6402.ambari.apache.org:8188/ws/v1/timeline/
 15/05/18 17:34:21 INFO examples.OrderedWordCount: Running OrderedWordCount
 15/05/18 17:34:21 INFO client.TezClient: Submitting DAG application with id: 
 application_1431967480765_0017
 15/05/18 17:34:21 INFO client.TezClientUtils: Using tez.lib.uris value from 
 configuration: /hdp/apps/2.3.0.0-1986/tez/tez.tar.gz
 15/05/18 17:34:21 INFO client.TezClient: Tez system stage directory 
 hdfs://c1ha/tmp/ambari-qa/staging/.tez/application_1431967480765_0017 doesn't 
 exist and is created
 15/05/18 17:34:21 INFO acls.ATSHistoryACLPolicyManager: Created Timeline 
 Domain for History ACLs, domainId=Tez_ATS_application_1431967480765_0017
 15/05/18 17:34:22 INFO client.TezClient: Submitting DAG to YARN, 
 applicationId=application_1431967480765_0017, dagName=OrderedWordCount
 15/05/18 17:34:22 INFO impl.YarnClientImpl: Submitted application 
 application_1431967480765_0017
 15/05/18 17:34:22 INFO client.TezClient: The url to track the Tez AM: 
 http://c6402.ambari.apache.org:8088/proxy/application_1431967480765_0017/
 15/05/18 17:34:22 INFO impl.TimelineClientImpl: Timeline service address: 
 http://c6402.ambari.apache.org:8188/ws/v1/timeline/
 15/05/18 17:34:22 INFO client.RMProxy: Connecting to ResourceManager at 
 c6402.ambari.apache.org/192.168.64.102:8050
 15/05/18 17:34:22 INFO client.DAGClientImpl: Waiting for DAG to start running
 15/05/18 17:34:34 INFO client.DAGClientImpl: DAG initialized: 
 CurrentState=Running
 15/05/18 17:34:38 INFO client.DAGClientImpl: DAG completed. FinalState=FAILED
 15/05/18 17:34:38 INFO examples.OrderedWordCount: DAG diagnostics: []
 {noformat}
 {noformat}
 2015-05-18 16:57:46,807 INFO [main] app.DAGAppMaster: Created DAGAppMaster 
 for application appattempt_1431967480765_0008_01, versionInfo=[ 
 component=tez-dag, version=0.5.2.2.2.3.0-2611, 
 revision=2d3c6b639d5b1048bd20aad5736823a35edd2485, 
 SCM-URL=scm:git:https://git-wip-us.apache.org/repos/asf/tez.git, 
 buildTIme=20150314-1805 ]
 2015-05-18 16:57:47,931 INFO [main] app.DAGAppMaster: Comparing client 
 version with AM version, clientVersion=0.7.0.2.3.0.0-1986, 
 AMVersion=0.5.2.2.2.3.0-2611
 2015-05-18 16:57:47,932 FATAL [main] app.DAGAppMaster: Incompatible versions 
 found, clientVersion=0.7.0.2.3.0.0-1986, AMVersion=0.5.2.2.2.3.0-2611
 2015-05-18 16:57:49,348 INFO [main] app.DAGAppMaster: Adding session token to 
 jobTokenSecretManager for application
 2015-05-18 16:57:49,354 INFO [main] event.AsyncDispatcher: Registering class 
 org.apache.tez.dag.app.rm.container.AMContainerEventType for class 
 org.apache.tez.dag.app.rm.container.AMContainerMap
 2015-05-18 16:57:49,355 INFO [main] event.AsyncDispatcher: Registering class 
 org.apache.tez.dag.app.rm.node.AMNodeEventType for class 
 org.apache.tez.dag.app.rm.node.AMNodeTracker
 2015-05-18 16:57:49,356 INFO [main] event.AsyncDispatcher: Registering class 
 org.apache.tez.dag.app.dag.event.DAGAppMasterEventType for class 
 org.apache.tez.dag.app.DAGAppMaster$DAGAppMasterEventHandler
 2015-05-18 16:57:49,356 INFO [main] event.AsyncDispatcher: Registering class 
 org.apache.tez.dag.app.dag.event.DAGEventType for class 
 org.apache.tez.dag.app.DAGAppMaster$DagEventDispatcher
 2015-05-18 16:57:49,357 INFO [main] event.AsyncDispatcher: Registering class 
 org.apache.tez.dag.app.dag.event.VertexEventType for class 
 org.apache.tez.dag.app.DAGAppMaster$VertexEventDispatcher
 2015-05-18 16:57:49,358 INFO [main] 

[jira] [Commented] (TEZ-2450) support async http clients in ordered unordered inputs

2015-05-18 Thread Siddharth Seth (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14547618#comment-14547618
 ] 

Siddharth Seth commented on TEZ-2450:
-

Minor stuff
- Default value for TEZ_RUNTIME_SHUFFLE_USE_ASYNC_HTTP in TezConfiguration
- ShuffleUtils.getHttpConnection - Avoid reading from config on each request.
- TestPipelinedShuffle - 
conf.setBoolean(TezRuntimeConfiguration.TEZ_RUNTIME_SHUFFLE_USE_ASYNC_HTTP, 
true); and later set to true again in the test. Think all tests may end up 
running with the AsyncClient

More important
- CompletionHandler - this throws exceptions from the various callbacks. In 
case of errors (Connection Validation), this probably needs to inform the 
ShuffleManager / ShuffleScheduler - like the Fetcher does. Not sure how the 
library handles exceptions thrown by the callback.
- response = httpAsyncClient.executeRequest(request, new 
CompletionHandler()).get(); - Will this only return after all data has been 
fetched, or will it return right after the Connection establishment ? Will this 
need to go to disk / memory based on the response size - using the 
onPartReceived callback ?
Given the async nature and callbacks when data is received - I'm guessing the 
requirement for N fetcher threads probably goes away in the future, to be 
replaced by thread management done in the library itself ?

 support async http clients in ordered  unordered inputs
 

 Key: TEZ-2450
 URL: https://issues.apache.org/jira/browse/TEZ-2450
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Rajesh Balamohan
Assignee: Rajesh Balamohan
 Attachments: TEZ-2450.1.patch, TEZ-2450.WIP.patch


 It will be helpful to switch between JDK  other async http impls.  For LLAP 
 scenarios, it would be useful to make http clients interruptible which is 
 supported in async libraries.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TEZ-2450) support async http clients in ordered unordered inputs

2015-05-18 Thread Rajesh Balamohan (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14547773#comment-14547773
 ] 

Rajesh Balamohan commented on TEZ-2450:
---

Thanks for the review [~sseth].  Canceling the patch as onBodyPartReceived() 
implementation would need more thinking, as it would impact the way MapOutputs 
are generated.  It is possible to stream the contents of onBodyPartReceived to 
file. But even for that decision (file/mem), we need to read ShuffleHeader from 
inputStream which does not look like a straight forward approach using 
onBodyPartReceived. It appears that response.getResponseBodyAsStream() is 
wrapping the bytes in inputstream, which wouldn't be much useful for large 
downloads.

 support async http clients in ordered  unordered inputs
 

 Key: TEZ-2450
 URL: https://issues.apache.org/jira/browse/TEZ-2450
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Rajesh Balamohan
Assignee: Rajesh Balamohan
 Attachments: TEZ-2450.1.patch, TEZ-2450.WIP.patch


 It will be helpful to switch between JDK  other async http impls.  For LLAP 
 scenarios, it would be useful to make http clients interruptible which is 
 supported in async libraries.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TEZ-2455) Tez UI: Dag view caching, error handling and minor layout changes

2015-05-18 Thread Prakash Ramachandran (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-2455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prakash Ramachandran updated TEZ-2455:
--
Summary: Tez UI: Dag view caching, error handling and minor layout changes  
(was: Tez UI: Dag view changes and data load changes)

 Tez UI: Dag view caching, error handling and minor layout changes
 -

 Key: TEZ-2455
 URL: https://issues.apache.org/jira/browse/TEZ-2455
 Project: Apache Tez
  Issue Type: Bug
Reporter: Sreenath Somarajapuram
Assignee: Sreenath Somarajapuram
 Attachments: TEZ-2455.1.patch, TEZ-2455.2.patch, TEZ-2455.3.patch


 # Enable caching in Dag View.
 # View throws error when all vertex data are not loaded
 # Display single outputs directly below the vertex
 # Vetex  Input nodes at the same level slightly overlaps at times
 # Entities under a DAG not loading in IE
 # Prevent 'All DAGs' page from hitting ATS with a huge limit when rowCount is 
 manually edited



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TEZ-2455) Tez UI: Dag view caching, error handling and minor layout changes

2015-05-18 Thread Prakash Ramachandran (JIRA)

[ 
https://issues.apache.org/jira/browse/TEZ-2455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14547864#comment-14547864
 ] 

Prakash Ramachandran commented on TEZ-2455:
---

+1 LGTM. committing. 

 Tez UI: Dag view caching, error handling and minor layout changes
 -

 Key: TEZ-2455
 URL: https://issues.apache.org/jira/browse/TEZ-2455
 Project: Apache Tez
  Issue Type: Bug
Reporter: Sreenath Somarajapuram
Assignee: Sreenath Somarajapuram
 Attachments: TEZ-2455.1.patch, TEZ-2455.2.patch, TEZ-2455.3.patch


 # Enable caching in Dag View.
 # View throws error when all vertex data are not loaded
 # Display single outputs directly below the vertex
 # Vetex  Input nodes at the same level slightly overlaps at times
 # Entities under a DAG not loading in IE
 # Prevent 'All DAGs' page from hitting ATS with a huge limit when rowCount is 
 manually edited



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)