[jira] [Assigned] (TEZ-4570) Implement data-via-events for ordered outputs

2024-06-18 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles reassigned TEZ-4570:
---

Assignee: Jonathan Turner Eagles

> Implement data-via-events for ordered outputs
> -
>
> Key: TEZ-4570
> URL: https://issues.apache.org/jira/browse/TEZ-4570
> Project: Apache Tez
>  Issue Type: New Feature
>Reporter: Jonathan Turner Eagles
>Assignee: Jonathan Turner Eagles
>Priority: Major
>
> Currently, data-via-events is only implemented by the unordered outputs and 
> unordered fetch.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (TEZ-4571) Shared fetch enabled fetches all partitions on task 0000s directly to disk for non-broadcast edges

2024-06-14 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4571:
---

 Summary: Shared fetch enabled fetches all partitions on task s 
directly to disk for non-broadcast edges
 Key: TEZ-4571
 URL: https://issues.apache.org/jira/browse/TEZ-4571
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Jonathan Turner Eagles


I think the idea of shared fetch is to fetch once per node for broadcast input. 
However, the enabled in the fetcher doesn't check the edge type but only that 
1) shared fetch is enabled and 2) task for the vertex is . For broadcast 
edge this is correct perhaps, but for non-broadcast edges, all partitions are 
fetched to disk without possibility of sharing



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (TEZ-4570) Implement data-via-events for ordered outputs

2024-06-14 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4570:
---

 Summary: Implement data-via-events for ordered outputs
 Key: TEZ-4570
 URL: https://issues.apache.org/jira/browse/TEZ-4570
 Project: Apache Tez
  Issue Type: New Feature
Reporter: Jonathan Turner Eagles


Currently, data-via-events is only implemented by the unordered outputs and 
unordered fetch.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (TEZ-4565) TestAnalyzer subtest testInternalPreemption is flaky

2024-05-16 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4565:
---

 Summary: TestAnalyzer subtest testInternalPreemption is flaky
 Key: TEZ-4565
 URL: https://issues.apache.org/jira/browse/TEZ-4565
 Project: Apache Tez
  Issue Type: Test
Reporter: Jonathan Turner Eagles
Assignee: Jonathan Turner Eagles






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (TEZ-4562) Fix Tez Job Analyzer after TEZ_DAG_EXTRA_INFO

2024-05-07 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4562:
---

 Summary: Fix Tez Job Analyzer after TEZ_DAG_EXTRA_INFO
 Key: TEZ-4562
 URL: https://issues.apache.org/jira/browse/TEZ-4562
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Jonathan Turner Eagles
Assignee: Jonathan Turner Eagles


TEZ-3611 split DAG INFO and DAG EXTRA INFO but tez job analyzer wasn't updated 
to account for the change.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Resolved] (TEZ-4558) Update build setup maven version and enforcer minimum to correct minimum

2024-05-07 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-4558.
-
Fix Version/s: 0.10.4
   Resolution: Fixed

> Update build setup maven version and enforcer minimum to correct minimum
> 
>
> Key: TEZ-4558
> URL: https://issues.apache.org/jira/browse/TEZ-4558
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: Jonathan Turner Eagles
>Assignee: Jonathan Turner Eagles
>Priority: Major
> Fix For: 0.10.4
>
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> Build can't succeed according to build instructions and maven required 
> version enforcement.
> maven-enforcer-plugin: requireMavenVersion 3.0.2
> [MVNVM] Using maven: 3.1.0
> [ERROR] Failed to execute goal 
> org.apache.maven.plugins:maven-enforcer-plugin:3.0.0:enforce 
> (enforce-maven-version) on project tez: The plugin 
> org.apache.maven.plugins:maven-enforcer-plugin:3.0.0 requires Maven version 
> 3.1.1 -> [Help 1]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TEZ-4558) Update build setup maven version and enforcer minimum to correct minimum

2024-05-02 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17843003#comment-17843003
 ] 

Jonathan Turner Eagles commented on TEZ-4558:
-

Personally, I've been using 3.6.3. But have been using mvnvm recently which 
grabs the maven required version from the pom file and uses that maven version 
to build the project with.

> Update build setup maven version and enforcer minimum to correct minimum
> 
>
> Key: TEZ-4558
> URL: https://issues.apache.org/jira/browse/TEZ-4558
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: Jonathan Turner Eagles
>Assignee: Jonathan Turner Eagles
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Build can't succeed according to build instructions and maven required 
> version enforcement.
> maven-enforcer-plugin: requireMavenVersion 3.0.2
> [MVNVM] Using maven: 3.1.0
> [ERROR] Failed to execute goal 
> org.apache.maven.plugins:maven-enforcer-plugin:3.0.0:enforce 
> (enforce-maven-version) on project tez: The plugin 
> org.apache.maven.plugins:maven-enforcer-plugin:3.0.0 requires Maven version 
> 3.1.1 -> [Help 1]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (TEZ-4558) Update build setup maven version and enforcer minimum to correct minimum

2024-05-02 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles updated TEZ-4558:

Description: 
Build can't succeed according to build instructions and maven required version 
enforcement.

maven-enforcer-plugin: requireMavenVersion 3.0.2
[MVNVM] Using maven: 3.1.0
[ERROR] Failed to execute goal 
org.apache.maven.plugins:maven-enforcer-plugin:3.0.0:enforce 
(enforce-maven-version) on project tez: The plugin 
org.apache.maven.plugins:maven-enforcer-plugin:3.0.0 requires Maven version 
3.1.1 -> [Help 1]

> Update build setup maven version and enforcer minimum to correct minimum
> 
>
> Key: TEZ-4558
> URL: https://issues.apache.org/jira/browse/TEZ-4558
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: Jonathan Turner Eagles
>Assignee: Jonathan Turner Eagles
>Priority: Major
>
> Build can't succeed according to build instructions and maven required 
> version enforcement.
> maven-enforcer-plugin: requireMavenVersion 3.0.2
> [MVNVM] Using maven: 3.1.0
> [ERROR] Failed to execute goal 
> org.apache.maven.plugins:maven-enforcer-plugin:3.0.0:enforce 
> (enforce-maven-version) on project tez: The plugin 
> org.apache.maven.plugins:maven-enforcer-plugin:3.0.0 requires Maven version 
> 3.1.1 -> [Help 1]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (TEZ-4558) Update build setup maven version and enforcer minimum to correct minimum

2024-05-02 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4558:
---

 Summary: Update build setup maven version and enforcer minimum to 
correct minimum
 Key: TEZ-4558
 URL: https://issues.apache.org/jira/browse/TEZ-4558
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Jonathan Turner Eagles
Assignee: Jonathan Turner Eagles






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Resolved] (TEZ-4553) Default task scheduler to DagAwareTaskScheduler to avoid hang in TEZ-3535

2024-05-01 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-4553.
-
Fix Version/s: 0.10.4
   Resolution: Fixed

> Default task scheduler to DagAwareTaskScheduler to avoid hang in TEZ-3535
> -
>
> Key: TEZ-4553
> URL: https://issues.apache.org/jira/browse/TEZ-4553
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: Jonathan Turner Eagles
>Assignee: Jonathan Turner Eagles
>Priority: Major
> Fix For: 0.10.4
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (TEZ-4553) Default task scheduler to DagAwareTaskScheduler to avoid hang in TEZ-3535

2024-04-11 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4553:
---

 Summary: Default task scheduler to DagAwareTaskScheduler to avoid 
hang in TEZ-3535
 Key: TEZ-4553
 URL: https://issues.apache.org/jira/browse/TEZ-4553
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Jonathan Turner Eagles
Assignee: Jonathan Turner Eagles






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TEZ-2119) Counter for launched containers

2023-08-24 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-2119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17758649#comment-17758649
 ] 

Jonathan Turner Eagles commented on TEZ-2119:
-

I think this looks good. Adding review to github. Good work, [~abstractdog]

> Counter for launched containers
> ---
>
> Key: TEZ-2119
> URL: https://issues.apache.org/jira/browse/TEZ-2119
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Rohini Palaniswamy
>Assignee: László Bodor
>Priority: Major
> Attachments: TEZ-2119.01.patch
>
>  Time Spent: 3h 10m
>  Remaining Estimate: 0h
>
> org.apache.tez.common.counters.DAGCounter
> NUM_SUCCEEDED_TASKS=32976
> TOTAL_LAUNCHED_TASKS=32976
> OTHER_LOCAL_TASKS=2
> DATA_LOCAL_TASKS=9147
> RACK_LOCAL_TASKS=23761
> It would be very nice to have TOTAL_LAUNCHED_CONTAINERS counter added to 
> this. The difference between TOTAL_LAUNCHED_CONTAINERS and 
> TOTAL_LAUNCHED_TASKS should make it easy to see how much container reuse is 
> happening. It is very hard to find out now.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TEZ-4419) Upgrade node and yarn version and fix npm security issues in Tez UI module

2022-06-07 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17551189#comment-17551189
 ] 

Jonathan Turner Eagles commented on TEZ-4419:
-

I think many or all of these are solved by TEZ-4337

> Upgrade node and yarn version and fix npm security issues in Tez UI module
> --
>
> Key: TEZ-4419
> URL: https://issues.apache.org/jira/browse/TEZ-4419
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: Nikhil Gupta
>Assignee: Aman Raj
>Priority: Major
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Commented] (TEZ-4337) [TEZ UI] Upgrade to Ember 3

2021-10-19 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17430794#comment-17430794
 ] 

Jonathan Turner Eagles commented on TEZ-4337:
-

Documenting the steps for testing puppeteer in ubuntu docker

{code:bash}
% docker build -f build-tools/docker/Dockerfile .
% docker images
REPOSITORY   TAG   IMAGE ID   CREATED  SIZE
   87e0e291e68f   About a minute ago   2.02GB
# Use image id to run new container
docker run -it 
npm install -g yarn
adduser me
su me
cd
yarn add puppeteer
cat > run-chrome.js
const puppeteer = require("puppeteer");
const { execFileSync } = require("child_process");

let exePath = puppeteer.executablePath();
let args = process.argv.slice(2);
console.log('exePath', exePath);
console.log('args', args);
execFileSync(exePath, args);

node run-chrome.js --headless --no-sandbox
exit
exit
docker container ls --all
docker rm 
docker rmi 
{code}


> [TEZ UI] Upgrade to Ember 3
> ---
>
> Key: TEZ-4337
> URL: https://issues.apache.org/jira/browse/TEZ-4337
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: Jonathan Turner Eagles
>Assignee: Jonathan Turner Eagles
>Priority: Major
>  Time Spent: 40m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4337) [TEZ UI] Upgrade to Ember 3

2021-10-12 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17427948#comment-17427948
 ] 

Jonathan Turner Eagles commented on TEZ-4337:
-

In addition to upgrading the tez ui component to Ember 3, this jira 
accomplishes a few other things.
* Reduces code size with tree-shaking
* Remove ember bootstrap code and dependency and switch to bootstrap less only 
dependency
* Remove jquery dependency and disabled ember optional jquery dependency
* Replace moment with date-fns
* Upgrade to Node LTS 14, Ember LTS 3.24, Yarn 1.22
* Removed bower dependencies
* Convert templates from deprecated dasherized format to Camel case
* Upgraded to qunit 5 tests
* Upgrade to mode d3 v5. (tree shaking)
* Increased security with content security and removal of eval and Function 
(see exception alasql)
* Increased dag loading performance on large dag
* Remove tez ui verbose build warnings

> [TEZ UI] Upgrade to Ember 3
> ---
>
> Key: TEZ-4337
> URL: https://issues.apache.org/jira/browse/TEZ-4337
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: Jonathan Turner Eagles
>Assignee: Jonathan Turner Eagles
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (TEZ-4337) [TEZ UI] Upgrade to Ember 3

2021-09-27 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4337:
---

 Summary: [TEZ UI] Upgrade to Ember 3
 Key: TEZ-4337
 URL: https://issues.apache.org/jira/browse/TEZ-4337
 Project: Apache Tez
  Issue Type: Bug
Reporter: Jonathan Turner Eagles
Assignee: Jonathan Turner Eagles






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4330) Import external tez component em-tgraph

2021-09-02 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17409022#comment-17409022
 ] 

Jonathan Turner Eagles commented on TEZ-4330:
-

[~Sreenath] / [~abstractdog], em-helpers and em-table and imported. Is there 
anything else needed for em-tgraph?

> Import external tez component em-tgraph
> ---
>
> Key: TEZ-4330
> URL: https://issues.apache.org/jira/browse/TEZ-4330
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jonathan Turner Eagles
>Priority: Major
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-4329) Import external tez component em-table

2021-08-26 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-4329.
-
Fix Version/s: 0.10.2
   0.9.3
   Resolution: Fixed

Thanks for review. Merged to master and branch-0.9

> Import external tez component em-table
> --
>
> Key: TEZ-4329
> URL: https://issues.apache.org/jira/browse/TEZ-4329
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jonathan Turner Eagles
>Assignee: Sreenath Somarajapuram
>Priority: Major
> Fix For: 0.9.3, 0.10.2
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (TEZ-4329) Import external tez component em-table

2021-08-26 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles reassigned TEZ-4329:
---

Assignee: Sreenath Somarajapuram

> Import external tez component em-table
> --
>
> Key: TEZ-4329
> URL: https://issues.apache.org/jira/browse/TEZ-4329
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jonathan Turner Eagles
>Assignee: Sreenath Somarajapuram
>Priority: Major
>  Time Spent: 50m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4330) Import external tez component em-tgraph

2021-08-25 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17404657#comment-17404657
 ] 

Jonathan Turner Eagles commented on TEZ-4330:
-

FYI, [~Sreenath].

[~abstractdog], straight forward code move into our code base.

> Import external tez component em-tgraph
> ---
>
> Key: TEZ-4330
> URL: https://issues.apache.org/jira/browse/TEZ-4330
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jonathan Turner Eagles
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4329) Import external tez component em-table

2021-08-25 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17404494#comment-17404494
 ] 

Jonathan Turner Eagles commented on TEZ-4329:
-

[~abstractdog], do you think you could help with this PR? Essentially moved 
code from external repo to internal repo.

> Import external tez component em-table
> --
>
> Key: TEZ-4329
> URL: https://issues.apache.org/jira/browse/TEZ-4329
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jonathan Turner Eagles
>Priority: Major
>  Time Spent: 20m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4329) Import external tez component em-table

2021-08-24 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17404155#comment-17404155
 ] 

Jonathan Turner Eagles commented on TEZ-4329:
-

[~Sreenath], imported em-table into tez-ui with apache license header.

> Import external tez component em-table
> --
>
> Key: TEZ-4329
> URL: https://issues.apache.org/jira/browse/TEZ-4329
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jonathan Turner Eagles
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-4328) Import external tez component em-helpers

2021-08-24 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-4328.
-
Fix Version/s: 0.10.2
   0.9.3
   Resolution: Fixed

Thanks, [~Sreenath].

> Import external tez component em-helpers
> 
>
> Key: TEZ-4328
> URL: https://issues.apache.org/jira/browse/TEZ-4328
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jonathan Turner Eagles
>Priority: Major
> Fix For: 0.9.3, 0.10.2
>
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (TEZ-4328) Import external tez component em-helpers

2021-08-24 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles reassigned TEZ-4328:
---

Assignee: Sreenath Somarajapuram

> Import external tez component em-helpers
> 
>
> Key: TEZ-4328
> URL: https://issues.apache.org/jira/browse/TEZ-4328
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jonathan Turner Eagles
>Assignee: Sreenath Somarajapuram
>Priority: Major
> Fix For: 0.9.3, 0.10.2
>
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4328) Import external tez component em-helpers

2021-08-23 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17403499#comment-17403499
 ] 

Jonathan Turner Eagles commented on TEZ-4328:
-

I will model the other imports off of this one. Hopefully, finishing this week.

> Import external tez component em-helpers
> 
>
> Key: TEZ-4328
> URL: https://issues.apache.org/jira/browse/TEZ-4328
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jonathan Turner Eagles
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4328) Import external tez component em-helpers

2021-08-23 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17403496#comment-17403496
 ] 

Jonathan Turner Eagles commented on TEZ-4328:
-

If there is any specific needs, I will reach out and communicate it to you soon

> Import external tez component em-helpers
> 
>
> Key: TEZ-4328
> URL: https://issues.apache.org/jira/browse/TEZ-4328
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jonathan Turner Eagles
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4328) Import external tez component em-helpers

2021-08-23 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17403477#comment-17403477
 ] 

Jonathan Turner Eagles commented on TEZ-4328:
-

[~Sreenath], this is a first import for em-helpers. You don't have to do a full 
review, but there is some signoff as code owner. I have moved all functionality 
into the tez ui itself. I have added the Apache license header. As owner for 
this code it will be important for you to sign off on the license. 

https://github.com/apache/tez/pull/145/files

> Import external tez component em-helpers
> 
>
> Key: TEZ-4328
> URL: https://issues.apache.org/jira/browse/TEZ-4328
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jonathan Turner Eagles
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (TEZ-4331) Import external tez component mock-ats

2021-08-23 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4331:
---

 Summary: Import external tez component mock-ats
 Key: TEZ-4331
 URL: https://issues.apache.org/jira/browse/TEZ-4331
 Project: Apache Tez
  Issue Type: Sub-task
Reporter: Jonathan Turner Eagles






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (TEZ-4330) Import external tez component em-tgraph

2021-08-23 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4330:
---

 Summary: Import external tez component em-tgraph
 Key: TEZ-4330
 URL: https://issues.apache.org/jira/browse/TEZ-4330
 Project: Apache Tez
  Issue Type: Sub-task
Reporter: Jonathan Turner Eagles






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (TEZ-4329) Import external tez component em-table

2021-08-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles updated TEZ-4329:

Parent: TEZ-4327
Issue Type: Sub-task  (was: Bug)

> Import external tez component em-table
> --
>
> Key: TEZ-4329
> URL: https://issues.apache.org/jira/browse/TEZ-4329
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jonathan Turner Eagles
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (TEZ-4329) Import external tez component em-table

2021-08-23 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4329:
---

 Summary: Import external tez component em-table
 Key: TEZ-4329
 URL: https://issues.apache.org/jira/browse/TEZ-4329
 Project: Apache Tez
  Issue Type: Bug
Reporter: Jonathan Turner Eagles






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (TEZ-4328) Import external tez component em-helpers

2021-08-23 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4328:
---

 Summary: Import external tez component em-helpers
 Key: TEZ-4328
 URL: https://issues.apache.org/jira/browse/TEZ-4328
 Project: Apache Tez
  Issue Type: Sub-task
Reporter: Jonathan Turner Eagles






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (TEZ-4327) Migrate external Tez UI components under apache tez control

2021-08-19 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4327:
---

 Summary: Migrate external Tez UI components under apache tez 
control
 Key: TEZ-4327
 URL: https://issues.apache.org/jira/browse/TEZ-4327
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Jonathan Turner Eagles


External dependencies that are owned by tez but not part of tez git.
https://github.com/sreenaths/mock-ats
https://github.com/sreenaths/em-helpers
https://github.com/sreenaths/em-table
https://github.com/sreenaths/em-tgraph

We should make a plan regarding importing these into the git repo.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4069) Avoid repeated computation of preferred locations in split grouping.

2021-08-16 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17399946#comment-17399946
 ] 

Jonathan Turner Eagles commented on TEZ-4069:
-

[~abstractdog]/ [~odraese], This patch relies on correct implementation of 
hashCode and equals for SplitContainers. With this, I'm afraid the optimization 
will have worse performance than the current implementation. Can we confirm 
performance increase across other implementations 
(HostAffinitySplitLocationProvider mentioned above)? Can we confirm default 
hashCode and equals?

> Avoid repeated computation of preferred locations in split grouping.
> 
>
> Key: TEZ-4069
> URL: https://issues.apache.org/jira/browse/TEZ-4069
> Project: Apache Tez
>  Issue Type: Improvement
>Affects Versions: 0.9.2
>Reporter: Oliver Draese
>Priority: Major
> Attachments: TEZ-4069.1.patch, TEZ-4069.patch
>
>
> The TezSplitGrouper iterates through the list of splits multiple times, when 
> trying to group the splits (see getGroupedSplits). Each time, it asks the 
> locationProvider to return the array of preferred locations for the splits. 
> This has two side effects:
>  * generating the list of preferred locations can cause some CPU overhead 
> (i.e. calculating the consistent hash in HostAffinitySplitLocationProvider), 
> which can be avoided
>  * if the list of preferred location is changing between the different loops 
> of getGroupedSplits, we might encounter a NullPointerException. This happens 
> if a new location appears, that was not part of the initial set of locations 
> when populating the distinctLocations map.
> The getGroupedSplits should query the preferred locations only once (for each 
> split) via the location provider and then memorize these instead of asking 
> the location provider repeatedly.
>  
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4325) Counter for sort time

2021-08-12 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17398100#comment-17398100
 ] 

Jonathan Turner Eagles commented on TEZ-4325:
-

These open jiras regarding sort counters have some interesting discussion. 
Hopefully we can build off of those learnings.

> Counter for sort time
> -
>
> Key: TEZ-4325
> URL: https://issues.apache.org/jira/browse/TEZ-4325
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: László Bodor
>Assignee: László Bodor
>Priority: Major
> Attachments: Counters_SORT_TIME_TPCDS_q40.txt
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> It's already logged for every SortSpan in PipelinedSorter, but it can be 
> exposed via a counter to get better insights quickly.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-2262) DAG/Tasks should not fail if counter limits are exceeded.

2021-08-12 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-2262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-2262.
-
Resolution: Incomplete

Creating an option to make counters limits exceeded non-fatal is valid. This 
can be achieved by setting a very high limit as a work around. Closing as 
incomplete

> DAG/Tasks should not fail if counter limits are exceeded.
> -
>
> Key: TEZ-2262
> URL: https://issues.apache.org/jira/browse/TEZ-2262
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.5.0
>Reporter: Mostafa Mokhtar
>Priority: Major
>
> Running TPC-DS Q64 failed due to exceeding the max number of counters.
> DAG should succeed and include a warning in the diagnostics stating that the 
> error got truncated.
> {code}
> 18043560327-2015-04-01 16:23:08,509 INFO [AsyncDispatcher event handler] 
> impl.DAGImpl: No output committers for vertex: Reducer 9
> 18043560445-2015-04-01 16:23:08,857 FATAL [AsyncDispatcher event handler] 
> event.AsyncDispatcher: Error in dispatcher thread
> 18043560557:org.apache.tez.common.counters.LimitExceededException: Too many 
> counters: 1201 max=1200
> 18043560645-  at 
> org.apache.tez.common.counters.Limits.checkCounters(Limits.java:87)
> 18043560717-  at 
> org.apache.tez.common.counters.Limits.incrCounters(Limits.java:94)
> 18043560788-  at 
> org.apache.tez.common.counters.AbstractCounterGroup.addCounter(AbstractCounterGroup.java:75)
> 18043560885-  at 
> org.apache.tez.common.counters.AbstractCounterGroup.addCounterImpl(AbstractCounterGroup.java:92)
> 18043560986-  at 
> org.apache.tez.common.counters.AbstractCounterGroup.findCounter(AbstractCounterGroup.java:103)
> 18043561085-  at 
> org.apache.tez.common.counters.AbstractCounterGroup.incrAllCounters(AbstractCounterGroup.java:198)
> 18043561188-  at 
> org.apache.tez.common.counters.AbstractCounters.incrAllCounters(AbstractCounters.java:363)
> 18043561283-  at 
> org.apache.tez.dag.app.dag.impl.DAGImpl.incrTaskCounters(DAGImpl.java:598)
> 18043561362-  at 
> org.apache.tez.dag.app.dag.impl.DAGImpl.getAllCounters(DAGImpl.java:588)
> 18043561439-  at 
> org.apache.tez.dag.app.dag.impl.DAGImpl.logJobHistoryFinishedEvent(DAGImpl.java:994)
> 18043561528-  at 
> org.apache.tez.dag.app.dag.impl.DAGImpl.finished(DAGImpl.java:1135)
> 18043561600-  at 
> org.apache.tez.dag.app.dag.impl.DAGImpl.checkDAGForCompletion(DAGImpl.java:1048)
> 18043561685-  at 
> org.apache.tez.dag.app.dag.impl.DAGImpl$VertexCompletedTransition.transition(DAGImpl.java:1708)
> 18043561785-  at 
> org.apache.tez.dag.app.dag.impl.DAGImpl$VertexCompletedTransition.transition(DAGImpl.java:1665)
> 18043561885-  at 
> org.apache.hadoop.yarn.state.StateMachineFactory$MultipleInternalArc.doTransition(StateMachineFactory.java:385)
> 18043562001-  at 
> org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:302)
> 18043562097-  at 
> org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:46)
> 18043562190-  at 
> org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:448)
> 18043562307-  at 
> org.apache.tez.dag.app.dag.impl.DAGImpl.handle(DAGImpl.java:944)
> 18043562376-  at 
> org.apache.tez.dag.app.dag.impl.DAGImpl.handle(DAGImpl.java:126)
> 18043562445-  at 
> org.apache.tez.dag.app.DAGAppMaster$DagEventDispatcher.handle(DAGAppMaster.java:1686)
> 18043562535-  at 
> org.apache.tez.dag.app.DAGAppMaster$DagEventDispatcher.handle(DAGAppMaster.java:1677)
> 18043562625-  at 
> org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:173)
> 18043562709-  at 
> org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:106)
> 18043562790-  at java.lang.Thread.run(Thread.java:745)
> 18043562832-2015-04-01 16:23:08,882 INFO [AsyncDispatcher event handler] 
> event.AsyncDispatcher: Exiting, bbye..
> 18043562932-2015-04-01 16:23:08,885 INFO [Thread-1] app.DAGAppMaster: 
> DAGAppMasterShutdownHook invoked
> 18043563023-2015-04-01 16:23:08,885 INFO [Thread-1] app.DAGAppMaster: 
> DAGAppMaster received a signal. Signaling TaskScheduler
> 18043563137-2015-04-01 16:23:08,885 INFO [Thread-1] 
> rm.TaskSchedulerEventHandler: TaskScheduler notified that iSignalled was : 
> true
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4293) Use Tez credentials for CI

2021-08-11 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17397382#comment-17397382
 ] 

Jonathan Turner Eagles commented on TEZ-4293:
-

Thanks for the patch, [~abstractdog]. 

> Use Tez credentials for CI
> --
>
> Key: TEZ-4293
> URL: https://issues.apache.org/jira/browse/TEZ-4293
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: László Bodor
>Assignee: László Bodor
>Priority: Major
> Fix For: 0.9.3, 0.10.2
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> TEZ-4291 only managed to work with hadoop credentials, this needs to be fixed.
> For yetus debugging, this revision can be used:
> https://github.com/abstractdog/yetus/commit/aa8223ecdf84eb7cd963c3e5acb2e31b60a19b97#diff-9cb6a2dcf2678adfcaa19a33ce63447f9545744d22c511a4b2bb3d909dee2bc1R587-R593
> For manual testing:
> {code}
> curl -X POST -H "Accept: application/vnd.github.v3.full+json" -H 
> "Content-Type: application/json" -H "Authorization: token ${GITHUB_TOKEN}" -d 
> '{"body":"HELLO TEST COMMENT"}' --location 
> "https://api.github.com/repos/apache/tez/issues/112/comments";
> {code}
> Last error message which is shown from jenkins with the logging yetus 
> revision is:
> {code}
> 23:56:30  
> 
> 23:56:30  
> 
> 23:56:30   Adding comment to Github
> 23:56:30  
> 
> 23:56:30  
> 
> 23:56:30  
> 23:56:30  
> 23:56:30  authenticating with token 
> 23:56:30  POSTING TO 
> https://api.github.com/repos/apache/tez/issues/112/comments
> 23:56:30% Total% Received % Xferd  Average Speed   TimeTime 
> Time  Current
> 23:56:30   Dload  Upload   Total   Spent
> Left  Speed
> 23:56:30  
>   0 00 00 0  0  0 --:--:-- --:--:-- --:--:-- 0
> 100  19180 0  100  1918  0   8678 --:--:-- --:--:-- --:--:--  8639
> 100  2072  100   154  100  1918603   7521 --:--:-- --:--:-- --:--:--  8093
> 23:56:30  {
> 23:56:30"message": "Resource not accessible by integration",
> 23:56:30"documentation_url": 
> "https://docs.github.com/rest/reference/issues#create-an-issue-comment";
> 23:56:30  }
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-4321) Do Not Wrap Singleton Collections

2021-08-09 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-4321.
-
Fix Version/s: 0.10.2
   0.9.3
   Resolution: Fixed

Thanks, [~belugabehr]. Simple straight-forward improvement.

> Do Not Wrap Singleton Collections
> -
>
> Key: TEZ-4321
> URL: https://issues.apache.org/jira/browse/TEZ-4321
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Minor
> Fix For: 0.9.3, 0.10.2
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> Singleton Collections are by their design immutable.  There is no need to 
> wrap them in an unmodifiableCollection.
> https://docs.oracle.com/javase/9/docs/api/java/util/Collections.html#singletonList-T-
> {code:none}
> public static  List singletonList​(T o)
> Returns an immutable list containing only the specified object.
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-4314) Fix some plugin versions in site generation

2021-06-30 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-4314.
-
Fix Version/s: 0.10.2
   0.9.3
   Resolution: Fixed

> Fix some plugin versions in site generation
> ---
>
> Key: TEZ-4314
> URL: https://issues.apache.org/jira/browse/TEZ-4314
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: László Bodor
>Assignee: László Bodor
>Priority: Major
> Fix For: 0.9.3, 0.10.2
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> {code}
> mvn site
> {code}
> command failed for me with various errors, which I fixed as below:
> {code}
> [ERROR] Failed to execute goal 
> org.apache.maven.plugins:maven-site-plugin:3.7.1:site (default-site) on 
> project tez-docs: failed to get report for 
> org.apache.maven.plugins:maven-project-info-reports-plugin: Could not find 
> goal 'project-team' in plugin 
> org.apache.maven.plugins:maven-project-info-reports-plugin:3.0.0 among 
> available goals ci-management, dependencies, dependency-convergence, 
> dependency-info, dependency-management, distribution-management, help, index, 
> issue-management, licenses, mailing-lists, modules, plugin-management, 
> plugins, scm, summary, team -> [Help 1]
> [ERROR]
> [ERROR] To see the full stack trace of the errors, re-run Maven with the -e 
> switch.
> [ERROR] Re-run Maven using the -X switch to enable full debug logging.
> [ERROR]
> [ERROR] For more information about the errors and possible solutions, please 
> read the following articles:
> [ERROR] [Help 1] 
> http://cwiki.apache.org/confluence/display/MAVEN/MojoExecutionException
> [ERROR]
> [ERROR] After correcting the problems, you can resume the build with the 
> command
> [ERROR]   mvn  -rf :tez-docs
> {code}
> 1. maven-site-plugin
> 3.x doesn't like some html elements within XML, instead of CDATA and other 
> hacks, I fixed with an old plugin version to pass for now
> error was like:
> {code}
> Caused by: org.codehaus.plexus.util.xml.pull.XmlPullParserException: TEXT 
> must be immediately followed by END_TAG and not START_TAG (position: 
> START_TAG seen ...\n   End of Google analytics -->\n   
> 

[jira] [Resolved] (TEZ-4267) Remove Superfluous Code from DAGAppMaster

2021-06-30 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-4267.
-
Fix Version/s: 0.10.2
   0.9.3
   Resolution: Fixed

+1. Thanks, [~belugabehr]. Committed this change and cherry-picked to 
branch-0.9 with a few minor conflicts.

> Remove Superfluous Code from DAGAppMaster
> -
>
> Key: TEZ-4267
> URL: https://issues.apache.org/jira/browse/TEZ-4267
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Minor
> Fix For: 0.9.3, 0.10.2
>
>  Time Spent: 1h 40m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (TEZ-4313) Apache Tez Release 0.10.1

2021-06-30 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles updated TEZ-4313:

Fix Version/s: (was: 0.10.2)
   0.10.1

> Apache Tez Release 0.10.1
> -
>
> Key: TEZ-4313
> URL: https://issues.apache.org/jira/browse/TEZ-4313
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: László Bodor
>Assignee: László Bodor
>Priority: Major
> Fix For: 0.10.1
>
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-4313) Apache Tez Release 0.10.1

2021-06-30 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-4313.
-
Fix Version/s: 0.10.2
   Resolution: Fixed

I'm going to assume the failures are ok in this particular case.

> Apache Tez Release 0.10.1
> -
>
> Key: TEZ-4313
> URL: https://issues.apache.org/jira/browse/TEZ-4313
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: László Bodor
>Assignee: László Bodor
>Priority: Major
> Fix For: 0.10.2
>
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-4308) Add Whitespace in ShuffleScheduler Error Message

2021-06-30 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-4308.
-
Fix Version/s: 0.10.2
   0.9.3
   Resolution: Fixed

> Add Whitespace in ShuffleScheduler Error Message
> 
>
> Key: TEZ-4308
> URL: https://issues.apache.org/jira/browse/TEZ-4308
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Trivial
> Fix For: 0.9.3, 0.10.2
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-4275) Use Google Guava Intern Facility

2021-06-30 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-4275.
-
Fix Version/s: 0.10.2
   0.9.3
   Resolution: Fixed

Thanks, [~belugabehr]. Committed to master and branch-0.9.

> Use Google Guava Intern Facility
> 
>
> Key: TEZ-4275
> URL: https://issues.apache.org/jira/browse/TEZ-4275
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Major
> Fix For: 0.9.3, 0.10.2
>
>  Time Spent: 3h 50m
>  Remaining Estimate: 0h
>
> Google Guava has a pretty cool Interner facility.
>  
>  * More memory efficient than current offering.  Map contains a weak key and 
> a static dummy value (current implementation uses a weak value)
>  * Current implementation has a single lock around the entire data structure. 
>  Google segments their data structure into (default: 4) segments for better 
> concurrency
>  * All the other thoughtful stuff Google has added into this feature



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4298) ShuffleHandler is not source compatible with Hadoop 3.3.0

2021-06-25 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17369677#comment-17369677
 ] 

Jonathan Turner Eagles commented on TEZ-4298:
-

[~tasanuma], I have pushed this change to branch-0.9. There is nothing specific 
to hadoop 3.3 about this change and it will help support both lines as it will 
be easier to back port other changes to branch-0.9 without conflict.

As far as running tez 0.9 releases on hadoop 3.3, that is problematic. There 
are a number conflicting dependencies with hadoop 3.3 that prevent hadoop 3.3 
and tez 0.9 being in the same classpath (jetty, protobuf) during runtime. Tez 
0.9 and 0.10 are nearly identical with the most significant change being hadoop 
3.x compatibility. 

> ShuffleHandler is not source compatible with Hadoop 3.3.0
> -
>
> Key: TEZ-4298
> URL: https://issues.apache.org/jira/browse/TEZ-4298
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jonathan Turner Eagles
>Assignee: Jonathan Turner Eagles
>Priority: Major
> Fix For: 0.9.3, 0.10.2
>
> Attachments: TEZ-4298.001.patch, TEZ-4298.001.patch, 
> TEZ-4298.002.patch
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> {noformat}
> mvn clean package -DskipTests -pl '!tez-ui' -Dhadoop.version=3.3.0
> [ERROR] COMPILATION ERROR : 
> [INFO] -
> [ERROR] 
> tez/tez-plugins/tez-aux-services/src/main/java/org/apache/tez/auxservices/ShuffleHandler.java:[764,45]
>  incompatible types: com.google.protobuf.ByteString cannot be converted to 
> org.apache.hadoop.thirdparty.protobuf.ByteString
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (TEZ-4298) ShuffleHandler is not source compatible with Hadoop 3.3.0

2021-06-25 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles updated TEZ-4298:

Fix Version/s: 0.9.3

> ShuffleHandler is not source compatible with Hadoop 3.3.0
> -
>
> Key: TEZ-4298
> URL: https://issues.apache.org/jira/browse/TEZ-4298
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jonathan Turner Eagles
>Assignee: Jonathan Turner Eagles
>Priority: Major
> Fix For: 0.9.3, 0.10.2
>
> Attachments: TEZ-4298.001.patch, TEZ-4298.001.patch, 
> TEZ-4298.002.patch
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> {noformat}
> mvn clean package -DskipTests -pl '!tez-ui' -Dhadoop.version=3.3.0
> [ERROR] COMPILATION ERROR : 
> [INFO] -
> [ERROR] 
> tez/tez-plugins/tez-aux-services/src/main/java/org/apache/tez/auxservices/ShuffleHandler.java:[764,45]
>  incompatible types: com.google.protobuf.ByteString cannot be converted to 
> org.apache.hadoop.thirdparty.protobuf.ByteString
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4298) ShuffleHandler is not source compatible with Hadoop 3.3.0

2021-06-22 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17367670#comment-17367670
 ] 

Jonathan Turner Eagles commented on TEZ-4298:
-

+1. From me on 002 patch changes. I will approve PR and merge.

> ShuffleHandler is not source compatible with Hadoop 3.3.0
> -
>
> Key: TEZ-4298
> URL: https://issues.apache.org/jira/browse/TEZ-4298
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jonathan Turner Eagles
>Assignee: Jonathan Turner Eagles
>Priority: Major
> Attachments: TEZ-4298.001.patch, TEZ-4298.001.patch, 
> TEZ-4298.002.patch
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> {noformat}
> mvn clean package -DskipTests -pl '!tez-ui' -Dhadoop.version=3.3.0
> [ERROR] COMPILATION ERROR : 
> [INFO] -
> [ERROR] 
> tez/tez-plugins/tez-aux-services/src/main/java/org/apache/tez/auxservices/ShuffleHandler.java:[764,45]
>  incompatible types: com.google.protobuf.ByteString cannot be converted to 
> org.apache.hadoop.thirdparty.protobuf.ByteString
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4297) Hive CLI not working after upgrading from Oracle JDK 8u112 to 8u281 and have errors with TEZ

2021-04-23 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17331068#comment-17331068
 ] 

Jonathan Turner Eagles commented on TEZ-4297:
-

The code above is all part of hive and hadoop, so I think there is no Tez issue 
here. [~norbert.k.b.maclang], is this still a problem? From my looking at the 
log above, it seems there is a misconfiguration of KMS.

> Hive CLI not working after upgrading from Oracle JDK 8u112 to 8u281 and have 
> errors with TEZ
> 
>
> Key: TEZ-4297
> URL: https://issues.apache.org/jira/browse/TEZ-4297
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: Norbert Kiam Maclang
>Priority: Major
>
> After upgrading Oracle JDK version from jdk-8u112 to jdk-8u281, Hive CLI is 
> not working anymore and gives below error when logging in.
> {code:java}
> WARNING: Use "yarn jar" to launch YARN applications.
> 21/03/09 11:00:04 WARN conf.HiveConf: HiveConf of name 
> hive.server2.enable.impersonation does not existLogging initialized using 
> configuration in file:/etc/hive/2.4.3.0-227/0/hive-log4j.properties
> Exception in thread "main" java.lang.RuntimeException: java.io.IOException: 
> Previous writer likely failed to write 
> hdfs://ppcontent-nn1.pp-content.dataplatform.com:8020/tmp/hive/hive/_tez_session_dir/96b21825-63f4-4316-9c43-20ebe641d9c9/hive-hcatalog-core.jar.
>  Failing because I am unlikely to write too.
> at 
> org.apache.hadoop.hive.ql.session.SessionState.start(SessionState.java:544)
> at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:680)
> at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:624)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at org.apache.hadoop.util.RunJar.run(RunJar.java:221)
> at org.apache.hadoop.util.RunJar.main(RunJar.java:136)
> Caused by: java.io.IOException: Previous writer likely failed to write 
> hdfs://ppcontent-nn1.pp-content.dataplatform.com:8020/tmp/hive/hive/_tez_session_dir/96b21825-63f4-4316-9c43-20ebe641d9c9/hive-hcatalog-core.jar.
>  Failing because I am unlikely to write too.
> at 
> org.apache.hadoop.hive.ql.exec.tez.DagUtils.localizeResource(DagUtils.java:982)
> at 
> org.apache.hadoop.hive.ql.exec.tez.DagUtils.addTempResources(DagUtils.java:862)
> at 
> org.apache.hadoop.hive.ql.exec.tez.DagUtils.localizeTempFilesFromConf(DagUtils.java:805)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezSessionState.refreshLocalResourcesFromConf(TezSessionState.java:233)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezSessionState.open(TezSessionState.java:158)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezSessionState.open(TezSessionState.java:117)
> at 
> org.apache.hadoop.hive.ql.session.SessionState.start(SessionState.java:541)
> ... 8 more
> {code}
> Version we are using:
>  * Ambari 2.2.2
>  * Hive 1.2.1
>  * Hadoop 2.7
>  * Spark 1.6
>  * HDP 2.4
>  * Tez 0.7.0.2.4



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-3987) Schedule giving priorities based on topological order

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-3987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-3987.
-
Resolution: Later

If this feature is still needed in addition to TEZ-394, we can reopen.

> Schedule giving priorities based on topological order
> -
>
> Key: TEZ-3987
> URL: https://issues.apache.org/jira/browse/TEZ-3987
> Project: Apache Tez
>  Issue Type: New Feature
>Reporter: Jaume M
>Assignee: Jaume M
>Priority: Major
>
> It'd be an improvement for some DAGs to be scheduled in a topological order 
> as opposed to the scheduling based on distance from the root from 
> {{DAGScheduler}} and {{DAGSchedulerControlled}}.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4245) Optimise split grouping when locality information is set to null/empty

2021-04-23 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17331064#comment-17331064
 ] 

Jonathan Turner Eagles commented on TEZ-4245:
-

[~rajesh.balamohan], is this patch ready for review?

> Optimise split grouping when locality information is set to null/empty
> --
>
> Key: TEZ-4245
> URL: https://issues.apache.org/jira/browse/TEZ-4245
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: Rajesh Balamohan
>Priority: Major
> Attachments: TEZ-4245.1.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> In objectstores like S3, locality information always shows up as "localhost". 
>  Having this information in inputsplit slows down scheduling as explained in 
> https://issues.apache.org/jira/browse/HIVE-14060 Systems like hive remove 
> "localhost" information from splits.
>  
> Split information without any locality information (localhost/null/empty) 
> should be treated equally, so that split grouping can do meaningful grouping 
> based on cluster size. This is to avoid creating small split groups, which 
> can significantly increase runtime due to sequential processing (i.e same map 
> task getting lots of inputs and system ends up spending time in 
> open/seek/close on objectstores).
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4192) Attach task specific info in scheduler and retrieve in communicator

2021-04-23 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17331061#comment-17331061
 ] 

Jonathan Turner Eagles commented on TEZ-4192:
-

[~mustafaiman], this patch seems to be coming along. Are you looking for a 
review?

> Attach task specific info in scheduler and retrieve in communicator
> ---
>
> Key: TEZ-4192
> URL: https://issues.apache.org/jira/browse/TEZ-4192
> Project: Apache Tez
>  Issue Type: Improvement
>Affects Versions: 0.9.2
>Reporter: Mustafa İman
>Assignee: Mustafa İman
>Priority: Major
> Attachments: TEZ-4192.1.patch, TEZ-4192.2.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> There is no way to pass information from scheduler to communicator. Sometimes 
> we want to let communicator know about extra properties of the task other 
> than its spec. One example is if a task is preemptable.
> When scheduler receives a task, its taskspec is already constructed. It is a 
> nice immutable object at this point. We do not want to modify and put more 
> info in it at that stage. So we pass a new piece of information to 
> TaskSchedulerContext#taskAllocated along with the allocation. This info is 
> passed to the communicator untouched. It is up to the communicator 
> implementation to make use of it. Communicator implementation needs to 
> override a new registerRunningTaskAttempt that accepts this additional object 
> along with all the other arguments in the original registerRunningTaskAttempt 
> method.
> The new registerRunningTaskAttempt method has a default implementation that 
> falls back to the old method. So if the user does not override the new 
> method, the same old behavior is preserved. Similarly on scheduler side, the 
> user needs to call new taskAllocated method to pass the additional info 
> object. If they use the old taskAllocated method, old behavior is preserved.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4272) Service org.apache.tez.dag.app.DAGAppMaster failed in state STARTED org.apache.hadoop.service.ServiceStateException: java.lang.ExceptionInInitializerError

2021-04-23 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17331060#comment-17331060
 ] 

Jonathan Turner Eagles commented on TEZ-4272:
-

Currently, there is not support for hadoop 3.3. Please use TEZ-4150 to track

> Service org.apache.tez.dag.app.DAGAppMaster failed in state STARTED 
> org.apache.hadoop.service.ServiceStateException: 
> java.lang.ExceptionInInitializerError
> --
>
> Key: TEZ-4272
> URL: https://issues.apache.org/jira/browse/TEZ-4272
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.9.2, 0.10.0
>Reporter: Nomen'Aina
>Priority: Major
> Attachments: log.txt
>
>
> Hello to all.
> Currently I am trying to install Apache tez by following step by step, the 
> instructions on the official website, and when I run the command: 
> $HADOOP_PREFIX/bin/hadoop jar tez-examples.jar orderedwordcount 
> /tmp/input/file.txt /tmp/outout
> I get the following errors:
> Service org.apache.tez.dag.app.rm.TaskSchedulerManager failed in state 
> STARTED java.lang.NullPointerException at 
> org.apache.tez.dag.app.rm.TaskSchedulerManager.serviceStart(TaskSchedulerManager.java:653)
> and
> Error starting DAGAppMaster org.apache.hadoop.service.ServiceStateException: 
> java.lang.ExceptionInInitializerError
>  
>  
> hadoop version:3.3.0 / tez version: 0.10.0



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (TEZ-1019) Re-factor routing of events to use common code path for normal and recovery flow.

2021-04-23 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-1019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17331048#comment-17331048
 ] 

Jonathan Turner Eagles edited comment on TEZ-1019 at 4/23/21, 9:19 PM:
---

Closing recovery v1 bugs/features.


was (Author: jeagles):
VertexEventSourceVertexRecovered

> Re-factor routing of events to use common code path for normal and recovery 
> flow.
> -
>
> Key: TEZ-1019
> URL: https://issues.apache.org/jira/browse/TEZ-1019
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Hitesh Shah
>Assignee: Jeff Zhang
>Priority: Major
> Attachments: TEZ-1019-2.patch, TEZ-1019-3.patch, TEZ-1019-4.patch, 
> TEZ-1019-5.patch, Tez-1019.patch
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-15) Support for DAG AM recovery

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-15?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-15.
---
Resolution: Fixed

> Support for DAG AM recovery
> ---
>
> Key: TEZ-15
> URL: https://issues.apache.org/jira/browse/TEZ-15
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: Bikas Saha
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-2544) Incorrect dag result due to wrong TaskSpec in recovering

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-2544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-2544.
-
Resolution: Won't Fix

Closing recovery v1 bugs/features.

> Incorrect dag result due to wrong TaskSpec in recovering
> 
>
> Key: TEZ-2544
> URL: https://issues.apache.org/jira/browse/TEZ-2544
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jeff Zhang
>Assignee: Jeff Zhang
>Priority: Critical
>  Labels: Recovery
>
> Expected TaskSpec
> {noformat}
> DAGName : OrderedWordCount, VertexName: Summation, VertexParallelism: 1, 
> TaskAttemptID:attempt_1433850314856_0019_1_01_00_0, 
> processorName=org.apache.tez.examples.OrderedWordCount$SumProcessor, 
> inputSpecListSize=1, 
> outputSpecListSize=1, inputSpecList=[{{ sourceVertexName=Tokenizer, 
> physicalEdgeCount=2, 
> inputClassName=org.apache.tez.runtime.library.input.OrderedGroupedKVInput }}, 
> ], outputSpecList=[{{ destinationVertexName=Sorter, physicalEdgeCount=1, 
> outputClassName=org.apache.tez.runtime.library.output.OrderedPartitionedKVOutput
>  }}
> {noformat}
> The actual TaskSpec
> {noformat}
> DAGName : OrderedWordCount, VertexName: Summation, VertexParallelism: 1, 
> TaskAttemptID:attempt_1433850314856_0019_1_01_00_0, 
> processorName=org.apache.tez.examples.OrderedWordCount$SumProcessor, 
> inputSpecListSize=1, 
> outputSpecListSize=1, inputSpecList=[{{ sourceVertexName=Tokenizer, 
> physicalEdgeCount=1, 
> inputClassName=org.apache.tez.runtime.library.input.OrderedGroupedKVInput }}, 
> ], outputSpecList=[{{ destinationVertexName=Sorter, physicalEdgeCount=1, 
> outputClassName=org.apache.tez.runtime.library.output.OrderedPartitionedKVOutput
>  }}
> {noformat}
> The expected physicalEdgeCount is 2 but actually it is 1, it happens when 
> dynamic parallelism estimation is enabled. 
> The cause is that Task is recovering but its vertex's source edge manager has 
> not been updated from ScatterGatherEdgeManager to CustomShuffleEdgeManager, 
> so will result in different physicalEdgeCount for InputSpec



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-2477) Session stats should be recovered

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-2477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-2477.
-
Resolution: Won't Fix

Closing recovery v1 bugs/features.

> Session stats should be recovered 
> --
>
> Key: TEZ-2477
> URL: https://issues.apache.org/jira/browse/TEZ-2477
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jeff Zhang
>Assignee: Jeff Zhang
>Priority: Major
>
> {code}
>   diagResult.add("Session stats:"
>   + "submittedDAGs=" + submittedDAGs.get()
>   + ", successfulDAGs=" + successfulDAGs.get()
>   + ", failedDAGs=" + failedDAGs.get()
>   + ", killedDAGs=" + killedDAGs.get());
> {code}
> These data can be recovered from recovery summary file.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-2456) Refactor recovery event logging to ensure it meet the recovery event spec

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-2456.
-
Resolution: Won't Fix

Closing recovery v1 bugs/features.

> Refactor recovery event logging to ensure it meet the recovery event spec
> -
>
> Key: TEZ-2456
> URL: https://issues.apache.org/jira/browse/TEZ-2456
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jeff Zhang
>Assignee: Jeff Zhang
>Priority: Major
>  Labels: Recovery
>
> Currently we don't have spec for the recovery event logging. Recovery would 
> be fragile to code change. This jira try to define the spec and refactor the 
> recovery event logging to ensure it meet the spec. [~hitesh] Please help 
> review the following spec I drafted.
> *DAG*
> * DAGSubmitted/DAGInitializedEvent/DAGStartedEvent must been logged once, 
> Should not log it again when it’s recovered.
> * DAGFinishedEvent may be logged multiple times.  ( DAG move from SUCCEEDED 
> from ERROR ? Should we ignore this ? )
> * VertexFinishedEvent should be logged before DAGFinishedEvent
> *Vertex* 
> * RootInputDataInformation must be logged before VertexInitializedEvent
> * DataMovement must be logged before TaskFinishedEvent
> * TaskFinishedEvent must be logged before VertexFinishedEvent
> * VertexInitializedEvent / VertexStartedEvent should only be logged once, 
> should not log again when it’s recovered.
> * VertexFinishedEvent may be logged multiple times. (e.g. Vertex move from 
> SUCCEEDED to FAILED)
> * VertexParallelismUpdatedEvent must be logged before TaskStartedEvent
> * For VertexFinishedEvent (SUCCEEDED), before it there must be at least n 
> TaskFinishedEvent (SUCCEEDED)
> *Task*
> * If there’s no TaskStartedEvent, TaskFinishedEvent may still be logged (e.g. 
> Task is killed in NEW )  Current’s behavior is that TaskFinishedEvent won’t 
> be logged if there’s no TaskStartedEvent. 
> * TaskStartedEvent should only be logged once.  Should not log again when 
> it’s recovered.
> * TaskFinishedEvent may be logged multiple times (e.g. Task move from 
> SUCCEEDED to FAILED)
> * For TaskFinishedEvent (SUCCEEDED), before it there must be at least one 
> TaskAttemptFinishedEvent (SUCCEEDED)
>   
> *TaskAttempt*
> * If there’s no TaskAttemptStartedEvent, TaskAttemptFinishedEvent may still 
> be logged ( e.g. TaskAttempt is killed in NEW )  Current’s behavior is that 
> TaskAttemptFinishedEvent won’t be logged if there’s no TaskAttemptStartedEvent
> * TaskAttemptStartedEvent should only be logged once.  Should not log again 
> when it’s recovered.
> * TaskAttemptFinishedEvent may be logged multiple times. (e.g. TaskAttempt 
> move from SUCCEEDED to FAILED)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-2418) TASK_ATTEMPT_FAILED_EVENT and TASK_COMPLETED_EVENT should move back to direct routing to attempt

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-2418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-2418.
-
Resolution: Won't Fix

Closing recovery v1 bugs/features.

> TASK_ATTEMPT_FAILED_EVENT and TASK_COMPLETED_EVENT should move back to direct 
> routing to attempt
> 
>
> Key: TEZ-2418
> URL: https://issues.apache.org/jira/browse/TEZ-2418
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: Bikas Saha
>Priority: Major
> Attachments: TEZ-2418.1.patch
>
>
> Due to recovery code path, they are currently double routed to the vertex 
> first and then the attempt.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-2431) Recovery of task events (eg. datamovement events) should not depend on ordering of task attempt events

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-2431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-2431.
-
Resolution: Won't Fix

Closing recovery v1 bugs/features.

> Recovery of task events (eg. datamovement events) should not depend on 
> ordering of task attempt events
> --
>
> Key: TEZ-2431
> URL: https://issues.apache.org/jira/browse/TEZ-2431
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Bikas Saha
>Priority: Major
>
> Today, task attempt events need to go through verteximpl before reaching the 
> task in order to maintain ordering guarantees for recovery. This causes these 
> events to be routed twice through the dispatcher. This can cause overhead 
> delays in large jobs. Also, this makes assumptions about event ordering which 
> make the system fragile. Recovery should work independently of other system 
> interactions so that evolution of other components is not affected by 
> recovery unless it affects recovery logically. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-2107) Recovery failure in the case of Auto-reduce parallelism

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-2107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-2107.
-
Resolution: Won't Fix

Closing recovery v1 bugs/features.

> Recovery failure in the case of Auto-reduce parallelism
> ---
>
> Key: TEZ-2107
> URL: https://issues.apache.org/jira/browse/TEZ-2107
> Project: Apache Tez
>  Issue Type: Sub-task
>Affects Versions: 0.6.0
>Reporter: Jeff Zhang
>Assignee: Jeff Zhang
>Priority: Major
>
> The following errors happens when recovering in the case of auto-reduce 
> parallelism.  The task number is reduced from 2 to 1. while the upstream 
> vertex's DataMovementEvent is still routed to task 2 which has been removed 
> when auto-reduce parallelism.
> {code}
> 2015-02-16 09:11:54,587 FATAL [Dispatcher thread: Central] 
> common.AsyncDispatcher: Error in dispatcher thread
> org.apache.tez.dag.api.TezUncheckedException: Unexpected null task. 
> sourceVertex=vertex_1424048826974_0002_1_00 [scope-47] srcTaskIndex = 0 
> destVertex=vertex_1424048826974_0002_1_01 [scope-50] destTaskIndex=1 
> destNumTasks=1 
> edgeManager=org.apache.tez.dag.app.dag.impl.ScatterGatherEdgeManager
> at 
> org.apache.tez.dag.app.dag.impl.Edge.sendDmEventOrIfEventToTasks(Edge.java:358)
> at 
> org.apache.tez.dag.app.dag.impl.Edge.sendTezEventToDestinationTasks(Edge.java:422)
> at 
> org.apache.tez.dag.app.dag.impl.Edge.handleCompositeDataMovementEvent(Edge.java:310)
> at 
> org.apache.tez.dag.app.dag.impl.Edge.sendTezEventToDestinationTasks(Edge.java:378)
> at 
> org.apache.tez.dag.app.dag.impl.VertexImpl.handleRoutedTezEvents(VertexImpl.java:3795)
> at 
> org.apache.tez.dag.app.dag.impl.VertexImpl.access$3600(VertexImpl.java:187)
> at 
> org.apache.tez.dag.app.dag.impl.VertexImpl$RouteEventTransition.transition(VertexImpl.java:3708)
> at 
> org.apache.tez.dag.app.dag.impl.VertexImpl$RouteEventTransition.transition(VertexImpl.java:3700)
> at 
> org.apache.hadoop.yarn.state.StateMachineFactory$MultipleInternalArc.doTransition(StateMachineFactory.java:385)
> at 
> org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:302)
> at 
> org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:46)
> at 
> org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:448)
> at 
> org.apache.tez.state.StateMachineTez.doTransition(StateMachineTez.java:57)
> at org.apache.tez.dag.app.dag.impl.VertexImpl.handle(VertexImpl.java:1575)
> at org.apache.tez.dag.app.dag.impl.VertexImpl.handle(VertexImpl.java:186)
> at 
> org.apache.tez.dag.app.DAGAppMaster$VertexEventDispatcher.handle(DAGAppMaster.java:1802)
> at 
> org.apache.tez.dag.app.DAGAppMaster$VertexEventDispatcher.handle(DAGAppMaster.java:1788)
> at 
> org.apache.tez.common.AsyncDispatcher.dispatch(AsyncDispatcher.java:184)
> at org.apache.tez.common.AsyncDispatcher$1.run(AsyncDispatcher.java:115)
> at java.lang.Thread.run(Thread.java:745)
> {code}
> The following exception will also happen sometimes
> {code}
> 2015-06-10 08:02:03,417 ERROR [Dispatcher thread: Central] impl.VertexImpl: 
> Exception in VertexManager, vertex:vertex_1433894507873_0001_1_01 [Summation]
> org.apache.tez.dag.app.dag.impl.AMUserCodeException: 
> org.apache.tez.dag.api.TezUncheckedException: Atleast 1 bipartite source 
> should exist, vertexName=Summation
>   at 
> org.apache.tez.dag.app.dag.impl.VertexManager$VertexManagerCallback.onFailure(VertexManager.java:516)
>   at com.google.common.util.concurrent.Futures$6.run(Futures.java:977)
>   at 
> com.google.common.util.concurrent.MoreExecutors$SameThreadExecutorService.execute(MoreExecutors.java:253)
>   at 
> com.google.common.util.concurrent.ExecutionList$RunnableExecutorPair.execute(ExecutionList.java:149)
>   at 
> com.google.common.util.concurrent.ExecutionList.execute(ExecutionList.java:134)
>   at 
> com.google.common.util.concurrent.ListenableFutureTask.done(ListenableFutureTask.java:86)
>   at java.util.concurrent.FutureTask.finishCompletion(FutureTask.java:380)
>   at java.util.concurrent.FutureTask.setException(FutureTask.java:247)
>   at java.util.concurrent.FutureTask.run(FutureTask.java:267)
>   at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>   at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>   at java.lang.Thread.run(Thread.java:745)
> Caused by: org.apache.tez.dag.api.TezUncheckedException: Atleast 1 bipartite 
> source should exist, vertexName=Summation
>   at 
> org.apache.tez.dag.library.vertexmanager.ShuffleVertexManager.onVertexStarted(ShuffleVertexMa

[jira] [Resolved] (TEZ-1565) Add tests for checking custom vertex managers like auto-reduce parallelism in recovery

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-1565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-1565.
-
Resolution: Won't Fix

Closing recovery v1 bugs/features.

> Add tests for checking custom vertex managers like auto-reduce parallelism in 
> recovery
> --
>
> Key: TEZ-1565
> URL: https://issues.apache.org/jira/browse/TEZ-1565
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Hitesh Shah
>Assignee: Jeff Zhang
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-1384) Move recovery related code into inner class

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-1384.
-
Resolution: Won't Fix

Closing recovery v1 bugs/features.

> Move recovery related code into inner class
> ---
>
> Key: TEZ-1384
> URL: https://issues.apache.org/jira/browse/TEZ-1384
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jeff Zhang
>Assignee: Jeff Zhang
>Priority: Major
> Attachments: Tez-1384-2.patch, Tez-1384.patch
>
>
> Currently each entity (DAG, Vertex, Task, TaskAttempt) has some common 
> recovery code like log history event and restore from history event. These 
> are 2 opposite aspects of recovery. One for store status while the other is 
> for restore status. This jira is for putting these pieces of code together ( 
> in an inner class ). In this way, it is easy to maintain and cut down the 
> possibility that one field is not stored or restored.  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-1019) Re-factor routing of events to use common code path for normal and recovery flow.

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-1019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-1019.
-
Resolution: Won't Fix

VertexEventSourceVertexRecovered

> Re-factor routing of events to use common code path for normal and recovery 
> flow.
> -
>
> Key: TEZ-1019
> URL: https://issues.apache.org/jira/browse/TEZ-1019
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Hitesh Shah
>Assignee: Jeff Zhang
>Priority: Major
> Attachments: TEZ-1019-2.patch, TEZ-1019-3.patch, TEZ-1019-4.patch, 
> TEZ-1019-5.patch, Tez-1019.patch
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-1156) setParallelism in recovery does not send event to downstream vertices

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-1156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-1156.
-
Resolution: Won't Fix

Closing recovery v1 bugs/features.

> setParallelism in recovery does not send event to downstream vertices
> -
>
> Key: TEZ-1156
> URL: https://issues.apache.org/jira/browse/TEZ-1156
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Bikas Saha
>Priority: Major
>
> In non-recovery case, downstream vertices are notified. If say, parallelism 
> is update and saved but the AM dies before downstream vertices get the event 
> and update their parallelism then the recovered AM will set the parallelism 
> of the recovered vertex but the downstream vertices may remain uninitialized.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-993) Remove application logic from RecoveryService

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-993.

Resolution: Won't Fix

Closing recovery v1 bugs/features.

> Remove application logic from RecoveryService
> -
>
> Key: TEZ-993
> URL: https://issues.apache.org/jira/browse/TEZ-993
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Bikas Saha
>Assignee: Jeff Zhang
>Priority: Major
> Attachments: TEZ-993-3.patch, TEZ-993-4.patch, TEZ-993-5.patch, 
> Tez-993-2.patch, Tez-993.patch
>
>
> Currently RecoveryService storage logic knows a lot about the DAG like which 
> dag is pre-warm and does not need to be stored, which events needs special 
> treatment etc. This kind of logic couples the DAG and the storage more than 
> is probably necessary and can be a source of complications down the road. The 
> storage should ideally be simply storing a sequence of arbitrary records 
> delimited by a marker.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-861) Handle task re-schedules in recovery

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-861.

Resolution: Won't Fix

Closing recovery v1 bugs/features.

> Handle task re-schedules in recovery
> 
>
> Key: TEZ-861
> URL: https://issues.apache.org/jira/browse/TEZ-861
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Hitesh Shah
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-992) Recovery data should not be written on AsyncDispatcher thread

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-992.

Resolution: Won't Fix

Closing recovery v1 bugs/features.

> Recovery data should not be written on AsyncDispatcher thread
> -
>
> Key: TEZ-992
> URL: https://issues.apache.org/jira/browse/TEZ-992
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Bikas Saha
>Assignee: Jeff Zhang
>Priority: Major
> Attachments: DAG_FinishSaving.gv, DAG_FinishSaving_2.gv, 
> DAG_FinishSaving_2.pdf, TEZ-992.patch, Vertex_FinishSaving.gv, 
> Vertex_FinishSaving_2.gv, Vertex_FinishSaving_2.pdf
>
>
> This may block the DAG operations in case the recovery data needs to be 
> synchronously stored. The operations requiring this blocking operation should 
> change their state machines to wait for the store operation before moving 
> ahead. They will move ahead after they receive notification from the 
> RecoveryService that their operation has completed.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-860) Handle restore of AMContainer and AMNode states on recovery

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-860.

Resolution: Won't Fix

Closing recovery v1 bugs/features.

> Handle restore of AMContainer and AMNode states on recovery
> ---
>
> Key: TEZ-860
> URL: https://issues.apache.org/jira/browse/TEZ-860
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Hitesh Shah
>Priority: Major
>
> Required to handle retroactive failures or nodes going unhealthy.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-852) Implement more optimal flush/sync mechanism to HDFS

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-852.

Resolution: Won't Fix

Closing recovery v1 bugs/features.

> Implement more optimal flush/sync mechanism to HDFS
> ---
>
> Key: TEZ-852
> URL: https://issues.apache.org/jira/browse/TEZ-852
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Hitesh Shah
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-3421) cdh 5.5.2 cann't use tez 0.8.4

2021-04-23 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-3421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17331035#comment-17331035
 ] 

Jonathan Turner Eagles commented on TEZ-3421:
-

[~ft20082], Is this jira still relevant? Trying to close out old jiras.

> cdh 5.5.2 cann't use tez 0.8.4
> --
>
> Key: TEZ-3421
> URL: https://issues.apache.org/jira/browse/TEZ-3421
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.7.1, 0.8.4
> Environment: cdh 5.5.2, hive 1.1.0, tez 0.8.4
>Reporter: Jake.zhang
>Priority: Major
>
> following the install guide, and it don't work.
> {code}
> 2016-08-31 15:31:20,614 [INFO] [IPC Server handler 0 on 43797] 
> |app.DAGAppMaster|: Running DAG: 
> root_20160831153131_0c02cc1a-1447-4ce9-a4b1-f66c2b8e1354:2
> 2016-08-31 15:31:20,849 [INFO] [IPC Server handler 0 on 43797] 
> |history.HistoryEventHandler|: 
> [HISTORY][DAG:dag_1470895429219_0037_1][Event:DAG_SUBMITTED]: 
> dagID=dag_1470895429219_0037_1, submitTime=1472628680567
> 2016-08-31 15:31:20,886 [INFO] [IPC Server handler 0 on 43797] 
> |impl.VertexImpl|: Default container context for 
> vertex_1470895429219_0037_1_00 [Map 1]=LocalResources: [[ name=tezlib, 
> value=resource { scheme: "hdfs" host: "name2.test.kingnetdc.com" port: 8020 
> file: "/user/tez/tez-0.8.4.tar.gz" } size: 48523006 timestamp: 1472625394859 
> type: ARCHIVE visibility: PUBLIC],[ name=hbase-client.jar, value=resource { 
> scheme: "hdfs" host: "name2.test.kingnetdc.com" port: 8020 file: 
> "/tmp/hive/root/_tez_session_dir/298b98e0-8658-4045-a0db-d8ca0b20f932/hbase-client.jar"
>  } size: 1294765 timestamp: 1472625755599 type: FILE visibility: PRIVATE],[ 
> name=hbase-server.jar, value=resource { scheme: "hdfs" host: 
> "name2.test.kingnetdc.com" port: 8020 file: 
> "/tmp/hive/root/_tez_session_dir/298b98e0-8658-4045-a0db-d8ca0b20f932/hbase-server.jar"
>  } size: 4299872 timestamp: 1472625755903 type: FILE visibility: PRIVATE],[ 
> name=hive-hbase-handler-1.1.0-cdh5.8.0.jar, value=resource { scheme: "hdfs" 
> host: "name2.test.kingnetdc.com" port: 8020 file: 
> "/tmp/hive/root/_tez_session_dir/298b98e0-8658-4045-a0db-d8ca0b20f932/hive-hbase-handler-1.1.0-cdh5.8.0.jar"
>  } size: 118398 timestamp: 1472625755443 type: FILE visibility: PRIVATE],[ 
> name=postgresql-9.0-801.jdbc4.jar, value=resource { scheme: "hdfs" host: 
> "name2.test.kingnetdc.com" port: 8020 file: 
> "/tmp/hive/root/_tez_session_dir/298b98e0-8658-4045-a0db-d8ca0b20f932/postgresql-9.0-801.jdbc4.jar"
>  } size: 539705 timestamp: 1472625756348 type: FILE visibility: PRIVATE],[ 
> name=hbase-common.jar, value=resource { scheme: "hdfs" host: 
> "name2.test.kingnetdc.com" port: 8020 file: 
> "/tmp/hive/root/_tez_session_dir/298b98e0-8658-4045-a0db-d8ca0b20f932/hbase-common.jar"
>  } size: 581623 timestamp: 1472625755634 type: FILE visibility: PRIVATE],[ 
> name=htrace-core-3.2.0-incubating.jar, value=resource { scheme: "hdfs" host: 
> "name2.test.kingnetdc.com" port: 8020 file: 
> "/tmp/hive/root/_tez_session_dir/298b98e0-8658-4045-a0db-d8ca0b20f932/htrace-core-3.2.0-incubating.jar"
>  } size: 1483913 timestamp: 1472625756002 type: FILE visibility: PRIVATE],[ 
> name=hbase-hadoop-compat.jar, value=resource { scheme: "hdfs" host: 
> "name2.test.kingnetdc.com" port: 8020 file: 
> "/tmp/hive/root/_tez_session_dir/298b98e0-8658-4045-a0db-d8ca0b20f932/hbase-hadoop-compat.jar"
>  } size: 39226 timestamp: 1472625755671 type: FILE visibility: PRIVATE],[ 
> name=htrace-core4-4.0.1-incubating.jar, value=resource { scheme: "hdfs" host: 
> "name2.test.kingnetdc.com" port: 8020 file: 
> "/tmp/hive/root/_tez_session_dir/298b98e0-8658-4045-a0db-d8ca0b20f932/htrace-core4-4.0.1-incubating.jar"
>  } size: 1485102 timestamp: 1472625756117 type: FILE visibility: PRIVATE],[ 
> name=hbase-protocol.jar, value=resource { scheme: "hdfs" host: 
> "name2.test.kingnetdc.com" port: 8020 file: 
> "/tmp/hive/root/_tez_session_dir/298b98e0-8658-4045-a0db-d8ca0b20f932/hbase-protocol.jar"
>  } size: 4378452 timestamp: 1472625755802 type: FILE visibility: PRIVATE],[ 
> name=htrace-core.jar, value=resource { scheme: "hdfs" host: 
> "name2.test.kingnetdc.com" port: 8020 file: 
> "/tmp/hive/root/_tez_session_dir/298b98e0-8658-4045-a0db-d8ca0b20f932/htrace-core.jar"
>  } size: 1483913 timestamp: 1472625756052 type: FILE visibility: PRIVATE],[ 
> name=hbase-hadoop2-compat.jar, value=resource { scheme: "hdfs" host: 
> "name2.test.kingnetdc.com" port: 8020 file: 
> "/tmp/hive/root/_tez_session_dir/298b98e0-8658-4045-a0db-d8ca0b20f932/hbase-hadoop2-compat.jar"
>  } size: 97555 timestamp: 1472625755713 type: FILE visibility: PRIVATE],[ 
> name=tez-conf.pb, value=resource { scheme: "hdfs" host: 
> "name2.test.kingnetdc.com" port: 8020 file: 
> "/tmp/hive/root/_tez_session_dir/298b98e0-8658-4045-a0db-d8ca0b20f932/.tez/application_147089

[jira] [Resolved] (TEZ-1522) Scheduling can result in out of order execution and slowdown of upstream work

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-1522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-1522.
-
Resolution: Later

DagAwareYarnTaskScheduler addresses a large portion of this jira. We can close 
this jira and have customers configure that scheduler.

> Scheduling can result in out of order execution and slowdown of upstream work
> -
>
> Key: TEZ-1522
> URL: https://issues.apache.org/jira/browse/TEZ-1522
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: Rajesh Balamohan
>Assignee: Rajesh Balamohan
>Priority: Critical
>  Labels: performance
> Attachments: TEZ-1522.1.wip.txt, TEZ-1522.2.wip.txt, 
> TEZ-1522.am.log.gz, task_runtime.svg
>
>
> M2 M7
> \  /
> (sg) \/
>R3/ (b)
> \   /
>  (b) \ /
>   \   /
> M5
> |
> R6 
> Plz refer to the attachment (task runtime SVG). In this case, M5 got 
> scheduled much earlier than R3 (green color in the diagram) and retained lots 
> of containers.
> R3 got less containers to work with. 
> Attaching the output from the status monitor when the job ran;  Map_5 has 
> taken up almost all of cluster resource, whereas Reducer_3 got fraction of 
> the capacity.
> Map_2: 1/1  Map_5: 0(+373)/1000 Map_7: 1/1  Reducer_3: 0/8000 
>   Reducer_6: 0/1
> Map_2: 1/1  Map_5: 0(+374)/1000 Map_7: 1/1  Reducer_3: 0/8000 
>   Reducer_6: 0/1
> Map_2: 1/1  Map_5: 0(+374)/1000 Map_7: 1/1  Reducer_3: 0(+1)/8000 
>   Reducer_6: 0/1
> 
> Map_2: 1/1  Map_5: 0(+374)/1000 Map_7: 1/1  Reducer_3: 
> 14(+7)/8000  Reducer_6: 0/1
> Map_2: 1/1  Map_5: 0(+374)/1000 Map_7: 1/1  Reducer_3: 
> 63(+14)/8000 Reducer_6: 0/1
> Map_2: 1/1  Map_5: 0(+374)/1000 Map_7: 1/1  Reducer_3: 
> 159(+22)/8000Reducer_6: 0/1
> Map_2: 1/1  Map_5: 0(+374)/1000 Map_7: 1/1  Reducer_3: 
> 308(+29)/8000Reducer_6: 0/1
> ...
> Creating this JIRA as a placeholder for scheduler enhancement. One 
> possibililty could be to
> schedule lesser number of tasks in downstream vertices, based on the 
> information available for the upstream vertex.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-3885) Tez timeline server integration error

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-3885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-3885.
-
Resolution: Duplicate

Some portion of this jira was fixed in another jira.

> Tez timeline server integration error
> -
>
> Key: TEZ-3885
> URL: https://issues.apache.org/jira/browse/TEZ-3885
> Project: Apache Tez
>  Issue Type: Bug
>  Components: UI
>Affects Versions: 0.9.0
> Environment: apache hive version - 3.0.0-SNAPSHOT
> apache hadoop version - 3.0.0
> tez version - 0.9.0
>Reporter: Yousun Jeong
>Priority: Critical
>  Labels: beginner
>
> If I configure below, I've got an error about launch hive error. I think it's 
> a library problem, but I'm not sure how to fix it.
> 
> 
> tez.lib.uris
> hdfs:///apps/tez-0.9.0/tez-0.9.0-minimal.tar.gz
> 
> 
> tez.use.cluster.hadoop-libs
> true
> 
> 
> Enable Tez to use the Timeline Server for History 
> Logging
> tez.history.logging.service.class
> 
> org.apache.tez.dag.history.logging.ats.ATSHistoryLoggingService
> 
> 
> URL for where the Tez UI is hosted
> tez.tez-ui.history-url.base
> http://HOST:8081/tez-ui-0.9.0/
> 
> 
> 2018-01-09 17:47:55,484 [INFO] [main] |app.DAGAppMaster|: Creating 
> DAGAppMaster for applicationId=application_1515483495537_0016, attemptNum=1, 
> AMContainerId=container_e40_1515483495537_0016_01_01, jvmPid=127852, 
> userFromEnv=hadoop, cliSessionOption=true, 
> pwd=/data5/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016/container_e40_1515483495537_0016_01_01,
>  
> localDirs=/data1/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016,/data2/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016,/data3/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016,/data4/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016,/data5/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016,/data6/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016,
>  
> logDirs=/data1/yarn/logs/application_1515483495537_0016/container_e40_1515483495537_0016_01_01
> 2018-01-09 17:47:55,588 [INFO] [main] |Configuration.deprecation|: 
> yarn.resourcemanager.system-metrics-publisher.enabled is deprecated. Instead, 
> use yarn.system-metrics-publisher.enabled
> 2018-01-09 17:47:55,727 [INFO] [main] |security.SecurityUtil|: Updating 
> Configuration
> 2018-01-09 17:47:55,867 [INFO] [main] |app.DAGAppMaster|: Created 
> DAGAppMaster for application appattempt_1515483495537_0016_01, 
> versionInfo=[ component=tez-dag, version=0.9.0, revision=${buildNumber}, 
> SCM-URL=scm:git:https://git-wip-us.apache.org/repos/asf/tez.git, 
> buildTime=2018-01-09T00:23:13Z ]
> 2018-01-09 17:47:55,897 [INFO] [main] |app.DAGAppMaster|: 
> /
> [system properties]
> os.name: Linux
> os.version: 2.6.32-573.7.1.el6.x86_64
> java.home: /usr/java/jdk1.8.0_121/jre
> java.runtime.version: 1.8.0_121-b13
> java.vendor: Oracle Corporation
> java.version: 1.8.0_121
> java.vm.name: Java HotSpot(TM) 64-Bit Server VM
> java.class.path: 
> /data5/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016/container_e40_1515483495537_0016_01_01:/data5/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016/container_e40_1515483495537_0016_01_01/hive-exec-3.0.0-SNAPSHOT-09b95fc8eaad07546c7d390259f6a8727a4be9b13ea05b2742a0dd4dbe78b7e7.jar:/data5/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016/container_e40_1515483495537_0016_01_01/tezlib/tez-tests-0.9.0.jar:/data5/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016/container_e40_1515483495537_0016_01_01/tezlib/tez-ext-service-tests-0.9.0.jar:/data5/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016/container_e40_1515483495537_0016_01_01/tezlib/hadoop-shim-0.9.0.jar:/data5/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016/container_e40_1515483495537_0016_01_01/tezlib/tez-api-0.9.0.jar:/data5/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016/container_e40_1515483495537_0016_01_01/tezlib/tez-job-analyzer-0.9.0.jar:/data5/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016/container_e40_1515483495537_0016_01_01/tezlib/tez-common-0.9.0.jar:/data5/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016/container_e40_1515483495537_0016_01_01/tezlib/tez-yarn-timeline-history-with-acls-0.9.0.jar:/data5/yarn/local/usercache/hadoop/appcache/application_1515483495537_0016/container_e40_1515483495537_0016_01_01/tezlib/tez-dag-0.9.0.jar:/data5/yarn/local/userca

[jira] [Resolved] (TEZ-3273) app.TaskAttemptListenerImpTezDag: Attempt is not recognized for heartbeat in tez 0.5.2,cause job hang

2021-04-23 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-3273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-3273.
-
Resolution: Incomplete

A number of hangs have been fixed in 0.7/0.9/0.10 releases as well as a number 
of scheduler changes including a new DagAwareYarnTaskScheduler that has been 
stable now for a few years. Closing this jira as it has grown stale.

> app.TaskAttemptListenerImpTezDag: Attempt is not recognized for heartbeat in 
> tez 0.5.2,cause job hang
> -
>
> Key: TEZ-3273
> URL: https://issues.apache.org/jira/browse/TEZ-3273
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.5.2
> Environment: hive0.14 hadoop2.6
>Reporter: Feng Yuan
>Priority: Critical
> Attachments: app_logs.zip
>
>
> Map 1: 145(+0,-1)/146 Reducer 2: 0/415
> Map 1: 145(+0,-1)/146 Reducer 2: 0/415
> Map 1: 145(+0,-1)/146 Reducer 2: 0/415
> Map 1: 145(+0,-1)/146 Reducer 2: 0/415
> Map 1: 145(+0,-1)/146 Reducer 2: 0/415
> Map 1: 145(+0,-1)/146 Reducer 2: 0/415
> Map 1: 145(+0,-1)/146 Reducer 2: 0/415
> stuck forever~



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4157) ShuffleHandler: upgrade to Netty4 and remove Netty3 dependency from tez

2021-04-23 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17330837#comment-17330837
 ] 

Jonathan Turner Eagles commented on TEZ-4157:
-

Sorry. The comment didn't show up for me for some reason.

[~abstractdog], I worried the tez shuffle handler 
auxservices/ShuffleHandler.java will NOT work for branch-0.9 as nodemanager api 
is netty 3 in hadoop 2.10/2.8. Have you verified this branch-0.9 shuffle 
handler works on 2.10 with this change?

> ShuffleHandler: upgrade to Netty4 and remove Netty3 dependency from tez
> ---
>
> Key: TEZ-4157
> URL: https://issues.apache.org/jira/browse/TEZ-4157
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: László Bodor
>Assignee: László Bodor
>Priority: Major
> Fix For: 0.10.1, 0.9.3
>
> Attachments: TEZ-4157.01.patch, TEZ-4157.02.patch, TEZ-4157.03.patch, 
> TEZ-4157.04.patch, TEZ-4157.05.patch, TEZ-4157.06.patch, dep.log
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> -In the dependency tree, there are 2 occurrences of compile scope direct 
> netty dependencies, however, they're not used at all. I compiled locally 
> successfully without them. E.g. when investigating blackduck alerts 
> (complaining about netty deps for current 3.10.5.Final), it would be cleaner 
> to start from a dependency tree where Tez doesn't depend on netty directly in 
> order to eliminate its responsibility (and move the focus to underlying 
> hadoop for instance).-
> Tez depends on netty3 almost only in ShuffleHandler and some related classes. 
> We can eliminate netty3 by upgrading it, but this effort might involve some 
> testing due to fundamental [changes from 
> netty3->netty4|https://netty.io/wiki/new-and-noteworthy-in-4.0.html] + we 
> don't have a reference yet, as [hadoop's 
> ShuffleHandler|https://github.com/apache/hadoop/blob/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-shuffle/src/main/java/org/apache/hadoop/mapred/ShuffleHandler.java]
>  is still on netty3.
> As per the netty documentation, we can also expect some performance 
> improvement (e.g. Pooled buffers).
> Background:
> netty4 migration guideline: 
> https://netty.io/wiki/new-and-noteworthy-in-4.0.html
> articles of possible performance improvement:
> https://blog.twitter.com/engineering/en_us/a/2013/netty-4-at-twitter-reduced-gc-overhead.html
> https://developer.squareup.com/blog/upgrading-a-reverse-proxy-from-netty-3-to-4/
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4157) ShuffleHandler: upgrade to Netty4 and remove Netty3 dependency from tez

2021-04-23 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17330833#comment-17330833
 ] 

Jonathan Turner Eagles commented on TEZ-4157:
-

[~abstractdog], I don't see a review on this code before it was merged. Did I 
miss the review?

> ShuffleHandler: upgrade to Netty4 and remove Netty3 dependency from tez
> ---
>
> Key: TEZ-4157
> URL: https://issues.apache.org/jira/browse/TEZ-4157
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: László Bodor
>Assignee: László Bodor
>Priority: Major
> Fix For: 0.10.1, 0.9.3
>
> Attachments: TEZ-4157.01.patch, TEZ-4157.02.patch, TEZ-4157.03.patch, 
> TEZ-4157.04.patch, TEZ-4157.05.patch, TEZ-4157.06.patch, dep.log
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> -In the dependency tree, there are 2 occurrences of compile scope direct 
> netty dependencies, however, they're not used at all. I compiled locally 
> successfully without them. E.g. when investigating blackduck alerts 
> (complaining about netty deps for current 3.10.5.Final), it would be cleaner 
> to start from a dependency tree where Tez doesn't depend on netty directly in 
> order to eliminate its responsibility (and move the focus to underlying 
> hadoop for instance).-
> Tez depends on netty3 almost only in ShuffleHandler and some related classes. 
> We can eliminate netty3 by upgrading it, but this effort might involve some 
> testing due to fundamental [changes from 
> netty3->netty4|https://netty.io/wiki/new-and-noteworthy-in-4.0.html] + we 
> don't have a reference yet, as [hadoop's 
> ShuffleHandler|https://github.com/apache/hadoop/blob/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-shuffle/src/main/java/org/apache/hadoop/mapred/ShuffleHandler.java]
>  is still on netty3.
> As per the netty documentation, we can also expect some performance 
> improvement (e.g. Pooled buffers).
> Background:
> netty4 migration guideline: 
> https://netty.io/wiki/new-and-noteworthy-in-4.0.html
> articles of possible performance improvement:
> https://blog.twitter.com/engineering/en_us/a/2013/netty-4-at-twitter-reduced-gc-overhead.html
> https://developer.squareup.com/blog/upgrading-a-reverse-proxy-from-netty-3-to-4/
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-861) Handle task re-schedules in recovery

2021-04-19 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17325270#comment-17325270
 ] 

Jonathan Turner Eagles commented on TEZ-861:


This is already supported and later I will clean this and associated jiras up. 
[~balakuntala], are you looking for a way to contribute to this project? In 
that case, I could point you issues that may relate to how you would want to 
contribute.

> Handle task re-schedules in recovery
> 
>
> Key: TEZ-861
> URL: https://issues.apache.org/jira/browse/TEZ-861
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Hitesh Shah
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4283) Docker fails to build on master - upgrade to nodejs10

2021-04-09 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17318012#comment-17318012
 ] 

Jonathan Turner Eagles commented on TEZ-4283:
-

+1. This commit I believe has already gone in. [~abstractdog], Can we update 
the status of this jira to reflect current status?

> Docker fails to build on master - upgrade to nodejs10
> -
>
> Key: TEZ-4283
> URL: https://issues.apache.org/jira/browse/TEZ-4283
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: László Bodor
>Assignee: László Bodor
>Priority: Blocker
> Fix For: 0.10.1
>
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> https://ci-hadoop.apache.org/blue/organizations/jenkins/tez-multibranch/detail/PR-102/5/pipeline
> {code}
> [2021-02-08T10:48:03.916Z] WARN engine npm@7.5.2: wanted: {"node":">=10"} 
> (current: {"node":"8.10.0","npm":"3.5.2"})
> [2021-02-08T10:48:03.916Z] WARN engine npm@7.5.2: wanted: {"node":">=10"} 
> (current: {"node":"8.10.0","npm":"3.5.2"})
> [2021-02-08T10:48:14.281Z] /usr/local/lib
> [2021-02-08T10:48:14.281Z] `-- (empty)
> [2021-02-08T10:48:14.281Z] 
> [2021-02-08T10:48:14.281Z] npm ERR! Linux 4.15.0-112-generic
> [2021-02-08T10:48:14.281Z] npm ERR! argv "/usr/bin/node" "/usr/bin/npm" 
> "install" "npm@latest" "-g"
> [2021-02-08T10:48:14.281Z] npm ERR! node v8.10.0
> [2021-02-08T10:48:14.281Z] npm ERR! npm  v3.5.2
> [2021-02-08T10:48:14.281Z] npm ERR! path 
> /usr/local/lib/node_modules/.staging/@npmcli/ci-detect-c7bf9552
> [2021-02-08T10:48:14.281Z] npm ERR! code ENOENT
> [2021-02-08T10:48:14.281Z] npm ERR! errno -2
> [2021-02-08T10:48:14.281Z] npm ERR! syscall rename
> [2021-02-08T10:48:14.281Z] 
> [2021-02-08T10:48:14.281Z] npm ERR! enoent ENOENT: no such file or directory, 
> rename '/usr/local/lib/node_modules/.staging/@npmcli/ci-detect-c7bf9552' -> 
> '/usr/local/lib/node_modules/npm/node_modules/@npmcli/ci-detect'
> [2021-02-08T10:48:14.281Z] npm ERR! enoent ENOENT: no such file or directory, 
> rename '/usr/local/lib/node_modules/.staging/@npmcli/ci-detect-c7bf9552' -> 
> '/usr/local/lib/node_modules/npm/node_modules/@npmcli/ci-detect'
> [2021-02-08T10:48:14.281Z] npm ERR! enoent This is most likely not a problem 
> with npm itself
> [2021-02-08T10:48:14.281Z] npm ERR! enoent and is related to npm not being 
> able to find a file.
> [2021-02-08T10:48:14.281Z] npm ERR! enoent 
> [2021-02-08T10:48:14.281Z] 
> [2021-02-08T10:48:14.281Z] npm ERR! Please include the following file with 
> any support request:
> [2021-02-08T10:48:14.282Z] npm ERR! /root/npm-debug.log
> [2021-02-08T10:48:14.282Z] npm ERR! code 1
> [2021-02-08T10:48:14.806Z] The command '/bin/bash -o pipefail -c apt-get -q 
> update && apt-get install -y --no-install-recommends nodejs npm && 
> apt-get clean && rm -rf /var/lib/apt/lists/* && npm install 
> npm@latest -g && npm install -g jshint' returned a non-zero code: 1
> [2021-02-08T10:48:14.806Z] ERROR: Docker failed to build yetus/tez:474bb22ba.
> {code}
> https://github.com/npm/ci-detect



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (TEZ-4299) Default java opts cause jdk11 to fail

2021-03-26 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles updated TEZ-4299:

Parent: TEZ-4020
Issue Type: Sub-task  (was: Bug)

> Default java opts cause jdk11 to fail
> -
>
> Key: TEZ-4299
> URL: https://issues.apache.org/jira/browse/TEZ-4299
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: Jonathan Turner Eagles
>Assignee: Jonathan Turner Eagles
>Priority: Major
>
> From TEZ-4298 PR request on jdk11
> {noformat}
> [INFO] Running org.apache.tez.auxservices.TestShuffleHandlerJobs
> [ERROR] Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 
> 51.727 s <<< FAILURE! - in org.apache.tez.auxservices.TestShuffleHandlerJobs
> [ERROR] 
> org.apache.tez.auxservices.TestShuffleHandlerJobs.testOrderedWordCount  Time 
> elapsed: 4.583 s  <<< ERROR!
> org.apache.tez.dag.api.SessionNotRunning: 
> TezSession has already shutdown. Application application_1616693699949_0001 
> failed 2 times due to AM Container for appattempt_1616693699949_0001_02 
> exited with  exitCode: 1
> Failing this attempt.Diagnostics: [2021-03-25 17:35:06.864]Exception from 
> container-launch.
> Container id: container_1616693699949_0001_02_01
> Exit code: 1
> [2021-03-25 17:35:06.874]Container exited with a non-zero exit code 1. Error 
> file: prelaunch.err.
> Last 4096 bytes of prelaunch.err :
> Last 4096 bytes of stderr :
> Unrecognized VM option 'PrintGCTimeStamps'
> Error: Could not create the Java Virtual Machine.
> Error: A fatal exception has occurred. Program will exit.
> [2021-03-25 17:35:06.876]Container exited with a non-zero exit code 1. Error 
> file: prelaunch.err.
> Last 4096 bytes of prelaunch.err :
> Last 4096 bytes of stderr :
> Unrecognized VM option 'PrintGCTimeStamps'
> Error: Could not create the Java Virtual Machine.
> Error: A fatal exception has occurred. Program will exit.
> For more detailed output, check the application tracking page: 
> http://a5bb2fa1dfc2:34397/cluster/app/application_1616693699949_0001 Then 
> click on links to logs of each attempt.
> . Failing the application.
>   at 
> org.apache.tez.auxservices.TestShuffleHandlerJobs.testOrderedWordCount(TestShuffleHandlerJobs.java:129)
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (TEZ-4299) Default java opts cause jdk11 to fail

2021-03-26 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles updated TEZ-4299:

Description: 
>From TEZ-4298 PR request on jdk11
{noformat}
[INFO] Running org.apache.tez.auxservices.TestShuffleHandlerJobs
[ERROR] Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 51.727 
s <<< FAILURE! - in org.apache.tez.auxservices.TestShuffleHandlerJobs
[ERROR] org.apache.tez.auxservices.TestShuffleHandlerJobs.testOrderedWordCount  
Time elapsed: 4.583 s  <<< ERROR!
org.apache.tez.dag.api.SessionNotRunning: 
TezSession has already shutdown. Application application_1616693699949_0001 
failed 2 times due to AM Container for appattempt_1616693699949_0001_02 
exited with  exitCode: 1
Failing this attempt.Diagnostics: [2021-03-25 17:35:06.864]Exception from 
container-launch.
Container id: container_1616693699949_0001_02_01
Exit code: 1

[2021-03-25 17:35:06.874]Container exited with a non-zero exit code 1. Error 
file: prelaunch.err.
Last 4096 bytes of prelaunch.err :
Last 4096 bytes of stderr :
Unrecognized VM option 'PrintGCTimeStamps'
Error: Could not create the Java Virtual Machine.
Error: A fatal exception has occurred. Program will exit.


[2021-03-25 17:35:06.876]Container exited with a non-zero exit code 1. Error 
file: prelaunch.err.
Last 4096 bytes of prelaunch.err :
Last 4096 bytes of stderr :
Unrecognized VM option 'PrintGCTimeStamps'
Error: Could not create the Java Virtual Machine.
Error: A fatal exception has occurred. Program will exit.


For more detailed output, check the application tracking page: 
http://a5bb2fa1dfc2:34397/cluster/app/application_1616693699949_0001 Then click 
on links to logs of each attempt.
. Failing the application.
at 
org.apache.tez.auxservices.TestShuffleHandlerJobs.testOrderedWordCount(TestShuffleHandlerJobs.java:129)
{noformat}

> Default java opts cause jdk11 to fail
> -
>
> Key: TEZ-4299
> URL: https://issues.apache.org/jira/browse/TEZ-4299
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: Jonathan Turner Eagles
>Assignee: Jonathan Turner Eagles
>Priority: Major
>
> From TEZ-4298 PR request on jdk11
> {noformat}
> [INFO] Running org.apache.tez.auxservices.TestShuffleHandlerJobs
> [ERROR] Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 
> 51.727 s <<< FAILURE! - in org.apache.tez.auxservices.TestShuffleHandlerJobs
> [ERROR] 
> org.apache.tez.auxservices.TestShuffleHandlerJobs.testOrderedWordCount  Time 
> elapsed: 4.583 s  <<< ERROR!
> org.apache.tez.dag.api.SessionNotRunning: 
> TezSession has already shutdown. Application application_1616693699949_0001 
> failed 2 times due to AM Container for appattempt_1616693699949_0001_02 
> exited with  exitCode: 1
> Failing this attempt.Diagnostics: [2021-03-25 17:35:06.864]Exception from 
> container-launch.
> Container id: container_1616693699949_0001_02_01
> Exit code: 1
> [2021-03-25 17:35:06.874]Container exited with a non-zero exit code 1. Error 
> file: prelaunch.err.
> Last 4096 bytes of prelaunch.err :
> Last 4096 bytes of stderr :
> Unrecognized VM option 'PrintGCTimeStamps'
> Error: Could not create the Java Virtual Machine.
> Error: A fatal exception has occurred. Program will exit.
> [2021-03-25 17:35:06.876]Container exited with a non-zero exit code 1. Error 
> file: prelaunch.err.
> Last 4096 bytes of prelaunch.err :
> Last 4096 bytes of stderr :
> Unrecognized VM option 'PrintGCTimeStamps'
> Error: Could not create the Java Virtual Machine.
> Error: A fatal exception has occurred. Program will exit.
> For more detailed output, check the application tracking page: 
> http://a5bb2fa1dfc2:34397/cluster/app/application_1616693699949_0001 Then 
> click on links to logs of each attempt.
> . Failing the application.
>   at 
> org.apache.tez.auxservices.TestShuffleHandlerJobs.testOrderedWordCount(TestShuffleHandlerJobs.java:129)
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (TEZ-4299) Default java opts cause jdk11 to fail

2021-03-26 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4299:
---

 Summary: Default java opts cause jdk11 to fail
 Key: TEZ-4299
 URL: https://issues.apache.org/jira/browse/TEZ-4299
 Project: Apache Tez
  Issue Type: Bug
Reporter: Jonathan Turner Eagles
Assignee: Jonathan Turner Eagles






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (TEZ-4298) ShuffleHandler is not source compatible with Hadoop 3.3.0

2021-03-25 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles updated TEZ-4298:

Attachment: TEZ-4298.001.patch

> ShuffleHandler is not source compatible with Hadoop 3.3.0
> -
>
> Key: TEZ-4298
> URL: https://issues.apache.org/jira/browse/TEZ-4298
> Project: Apache Tez
>  Issue Type: Bug
>Reporter: Jonathan Turner Eagles
>Assignee: Jonathan Turner Eagles
>Priority: Major
> Attachments: TEZ-4298.001.patch
>
>
> {noformat}
> mvn clean package -DskipTests -pl '!tez-ui' -Dhadoop.version=3.3.0
> [ERROR] COMPILATION ERROR : 
> [INFO] -
> [ERROR] 
> tez/tez-plugins/tez-aux-services/src/main/java/org/apache/tez/auxservices/ShuffleHandler.java:[764,45]
>  incompatible types: com.google.protobuf.ByteString cannot be converted to 
> org.apache.hadoop.thirdparty.protobuf.ByteString
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (TEZ-4298) ShuffleHandler is not source compatible with Hadoop 3.3.0

2021-03-25 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4298:
---

 Summary: ShuffleHandler is not source compatible with Hadoop 3.3.0
 Key: TEZ-4298
 URL: https://issues.apache.org/jira/browse/TEZ-4298
 Project: Apache Tez
  Issue Type: Bug
Reporter: Jonathan Turner Eagles
Assignee: Jonathan Turner Eagles


{noformat}
mvn clean package -DskipTests -pl '!tez-ui' -Dhadoop.version=3.3.0
[ERROR] COMPILATION ERROR : 
[INFO] -
[ERROR] 
tez/tez-plugins/tez-aux-services/src/main/java/org/apache/tez/auxservices/ShuffleHandler.java:[764,45]
 incompatible types: com.google.protobuf.ByteString cannot be converted to 
org.apache.hadoop.thirdparty.protobuf.ByteString
{noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (TEZ-4294) Tez UI counters missing in settings view

2021-02-19 Thread Jonathan Turner Eagles (Jira)
Jonathan Turner Eagles created TEZ-4294:
---

 Summary: Tez UI counters missing in settings view
 Key: TEZ-4294
 URL: https://issues.apache.org/jira/browse/TEZ-4294
 Project: Apache Tez
  Issue Type: Bug
Reporter: Jonathan Turner Eagles
Assignee: Jonathan Turner Eagles


org.apache.tez.common.counters.TaskCounter.INPUT_SPLIT_LENGTH_BYTES is one 
missing example
Need to compare 
tez-api/src/main/java/org/apache/tez/common/counters/TaskCounter.java to 
tez-ui/src/main/webapp/config/default-app-conf.js to see if any others are 
missing and valuable.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-4289) Remove Dependency on commons-math

2021-02-17 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-4289.
-
Fix Version/s: 0.9.3
   0.10.1
   Resolution: Fixed

+1 Committed to master and branch-0.9

> Remove Dependency on commons-math
> -
>
> Key: TEZ-4289
> URL: https://issues.apache.org/jira/browse/TEZ-4289
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Minor
> Fix For: 0.10.1, 0.9.3
>
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (TEZ-4288) Upgrade SLF4J to 1.7.30

2021-02-12 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles updated TEZ-4288:

Fix Version/s: 0.9.3

> Upgrade SLF4J to 1.7.30
> ---
>
> Key: TEZ-4288
> URL: https://issues.apache.org/jira/browse/TEZ-4288
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Minor
> Fix For: 0.10.1, 0.9.3
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-4288) Upgrade SLF4J to 1.7.30

2021-02-12 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-4288.
-
Fix Version/s: 0.10.1
   Resolution: Fixed

+1. Committed to master branch via github. I will cherry-pick this to 
branch-0.9 as well.

> Upgrade SLF4J to 1.7.30
> ---
>
> Key: TEZ-4288
> URL: https://issues.apache.org/jira/browse/TEZ-4288
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Minor
> Fix For: 0.10.1
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4241) UI: containerLogs testWrapper test fails

2021-02-12 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17283828#comment-17283828
 ] 

Jonathan Turner Eagles commented on TEZ-4241:
-

I have assigned this jira to myself to indicate contributor of 02 patch. Pushed 
to master and cherry-picked to branch-0.9.
Thanks to [~abstractdog] and [~euigeun_chung] for their help to get this work 
committed.

> UI: containerLogs testWrapper test fails
> 
>
> Key: TEZ-4241
> URL: https://issues.apache.org/jira/browse/TEZ-4241
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.9.2, 0.10.0
>Reporter: László Bodor
>Assignee: Jonathan Turner Eagles
>Priority: Major
> Fix For: 0.10.1
>
> Attachments: TEZ-4241.01.patch, TEZ-4241.02.patch, TEZ-4241.03.patch, 
> patch-unit-root.txt
>
>
> This causes failures in precommit for patches that are independent of tez ui.
> {code}
> not ok 750 PhantomJS 2.1 - Unit | Serializer | dag: containerLogs test
> ---
> actual: >
> null
> message: >
> Died on test #2 
> testWrapper@http://localhost:7357/assets/test-support.js:6240:16
> test@http://localhost:7357/assets/test-support.js:6254:44
> http://localhost:7357/assets/tests.js:19223:24
> http://localhost:7357/assets/vendor.js:477:34
> tryFinally@http://localhost:7357/assets/vendor.js:358:21
> requireModule@http://localhost:7357/assets/vendor.js:475:15
> require@http://localhost:7357/assets/test-loader.js:60:16
> loadModules@http://localhost:7357/assets/test-loader.js:51:25
> load@http://localhost:7357/assets/test-loader.js:82:35
> http://localhost:7357/assets/test-support.js:6037:20: undefined 
> is not an object (evaluating 'this.get')
> Log: |
> ...
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4241) UI: containerLogs testWrapper test fails

2021-02-12 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17283827#comment-17283827
 ] 

Jonathan Turner Eagles commented on TEZ-4241:
-

[~euigeun_chung], we're going to commit work-around patch above 02.patch to get 
the builds passing again. A proper fix in a follow-on jira. Please option a 
jira if you wish to pursue that.

> UI: containerLogs testWrapper test fails
> 
>
> Key: TEZ-4241
> URL: https://issues.apache.org/jira/browse/TEZ-4241
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.9.2, 0.10.0
>Reporter: László Bodor
>Assignee: Jonathan Turner Eagles
>Priority: Major
> Fix For: 0.10.1
>
> Attachments: TEZ-4241.01.patch, TEZ-4241.02.patch, TEZ-4241.03.patch, 
> patch-unit-root.txt
>
>
> This causes failures in precommit for patches that are independent of tez ui.
> {code}
> not ok 750 PhantomJS 2.1 - Unit | Serializer | dag: containerLogs test
> ---
> actual: >
> null
> message: >
> Died on test #2 
> testWrapper@http://localhost:7357/assets/test-support.js:6240:16
> test@http://localhost:7357/assets/test-support.js:6254:44
> http://localhost:7357/assets/tests.js:19223:24
> http://localhost:7357/assets/vendor.js:477:34
> tryFinally@http://localhost:7357/assets/vendor.js:358:21
> requireModule@http://localhost:7357/assets/vendor.js:475:15
> require@http://localhost:7357/assets/test-loader.js:60:16
> loadModules@http://localhost:7357/assets/test-loader.js:51:25
> load@http://localhost:7357/assets/test-loader.js:82:35
> http://localhost:7357/assets/test-support.js:6037:20: undefined 
> is not an object (evaluating 'this.get')
> Log: |
> ...
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (TEZ-4241) UI: containerLogs testWrapper test fails

2021-02-12 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles reassigned TEZ-4241:
---

Assignee: Jonathan Turner Eagles  (was: László Bodor)

> UI: containerLogs testWrapper test fails
> 
>
> Key: TEZ-4241
> URL: https://issues.apache.org/jira/browse/TEZ-4241
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.9.2, 0.10.0
>Reporter: László Bodor
>Assignee: Jonathan Turner Eagles
>Priority: Major
> Fix For: 0.10.1
>
> Attachments: TEZ-4241.01.patch, TEZ-4241.02.patch, TEZ-4241.03.patch, 
> patch-unit-root.txt
>
>
> This causes failures in precommit for patches that are independent of tez ui.
> {code}
> not ok 750 PhantomJS 2.1 - Unit | Serializer | dag: containerLogs test
> ---
> actual: >
> null
> message: >
> Died on test #2 
> testWrapper@http://localhost:7357/assets/test-support.js:6240:16
> test@http://localhost:7357/assets/test-support.js:6254:44
> http://localhost:7357/assets/tests.js:19223:24
> http://localhost:7357/assets/vendor.js:477:34
> tryFinally@http://localhost:7357/assets/vendor.js:358:21
> requireModule@http://localhost:7357/assets/vendor.js:475:15
> require@http://localhost:7357/assets/test-loader.js:60:16
> loadModules@http://localhost:7357/assets/test-loader.js:51:25
> load@http://localhost:7357/assets/test-loader.js:82:35
> http://localhost:7357/assets/test-support.js:6037:20: undefined 
> is not an object (evaluating 'this.get')
> Log: |
> ...
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4290) Dockerfile improvements

2021-02-12 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17283770#comment-17283770
 ] 

Jonathan Turner Eagles commented on TEZ-4290:
-

+1 On these changes.

> Dockerfile improvements
> ---
>
> Key: TEZ-4290
> URL: https://issues.apache.org/jira/browse/TEZ-4290
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: László Bodor
>Assignee: László Bodor
>Priority: Major
> Fix For: 0.10.1, 0.9.3
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Given these improvement ideas: 
> https://github.com/aw-was-here/tez/commit/bead3a21e68ecf4a75d64d8c08b9f8fa0f98cf51
> thanks  [~aw]
> Dockerfile changes could be easily tested, becuase it can be build locally 
> (as discovered on TEZ-4283):
> {code}
> docker build -f ./build-tools/docker/Dockerfile ./build-tools/docker/
> {code}
> Moreover, it's applied on PR level, so a full precommit test (including all 
> modules) could let us validate the correctness of the image changes.
> So, I'm suspecting that Dockerfile changes can come in advance, before 
> introducing Jenkinsfile, which is a bit harder to test (I guess).
> according to [~aw]'s comments:
> {code}
>   - Update to an image that won't be unsupported in a year
>   - Remove a bunch of stuff that Tez doesn't use
>   - Add support for a bunch of stuff that Yetus needs to test everything 
> (however, keeping FindBugs over SpotBugs)
>   - Make it Docker BuildKit compatible to make it build in parallel
>   - Support both OpenJDK 8 and OpenJDK 11
>   - Switch to python3 because python2 is effectively dead in January with pip 
> changes
>   - Remove the interactive bits of the Dockerfile since nothing appears to be 
> using them
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4241) UI: containerLogs testWrapper test fails

2021-02-12 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17283748#comment-17283748
 ] 

Jonathan Turner Eagles commented on TEZ-4241:
-

[~abstractdog], [~euigeun_chung], I propose we put patch 02 in as a temporary 
fix (under a different jira) until this test can be fixed properly so that 
builds can get healthy again. Anyone against this idea?

> UI: containerLogs testWrapper test fails
> 
>
> Key: TEZ-4241
> URL: https://issues.apache.org/jira/browse/TEZ-4241
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.9.2, 0.10.0
>Reporter: László Bodor
>Assignee: László Bodor
>Priority: Major
> Fix For: 0.10.1
>
> Attachments: TEZ-4241.01.patch, TEZ-4241.02.patch, TEZ-4241.03.patch, 
> patch-unit-root.txt
>
>
> This causes failures in precommit for patches that are independent of tez ui.
> {code}
> not ok 750 PhantomJS 2.1 - Unit | Serializer | dag: containerLogs test
> ---
> actual: >
> null
> message: >
> Died on test #2 
> testWrapper@http://localhost:7357/assets/test-support.js:6240:16
> test@http://localhost:7357/assets/test-support.js:6254:44
> http://localhost:7357/assets/tests.js:19223:24
> http://localhost:7357/assets/vendor.js:477:34
> tryFinally@http://localhost:7357/assets/vendor.js:358:21
> requireModule@http://localhost:7357/assets/vendor.js:475:15
> require@http://localhost:7357/assets/test-loader.js:60:16
> loadModules@http://localhost:7357/assets/test-loader.js:51:25
> load@http://localhost:7357/assets/test-loader.js:82:35
> http://localhost:7357/assets/test-support.js:6037:20: undefined 
> is not an object (evaluating 'this.get')
> Log: |
> ...
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4290) Dockerfile improvements

2021-02-12 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17283741#comment-17283741
 ] 

Jonathan Turner Eagles commented on TEZ-4290:
-

[~abstractdog], Is this jira ready to go in? Or is there more work still to do?

> Dockerfile improvements
> ---
>
> Key: TEZ-4290
> URL: https://issues.apache.org/jira/browse/TEZ-4290
> Project: Apache Tez
>  Issue Type: Sub-task
>Reporter: László Bodor
>Assignee: László Bodor
>Priority: Major
> Fix For: 0.10.1, 0.9.3
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Given these improvement ideas: 
> https://github.com/aw-was-here/tez/commit/bead3a21e68ecf4a75d64d8c08b9f8fa0f98cf51
> thanks  [~aw]
> Dockerfile changes could be easily tested, becuase it can be build locally 
> (as discovered on TEZ-4283):
> {code}
> docker build -f ./build-tools/docker/Dockerfile ./build-tools/docker/
> {code}
> Moreover, it's applied on PR level, so a full precommit test (including all 
> modules) could let us validate the correctness of the image changes.
> So, I'm suspecting that Dockerfile changes can come in advance, before 
> introducing Jenkinsfile, which is a bit harder to test (I guess).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4286) Consider sending fewer emails from github to d...@tez.apache.org

2021-02-10 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17282696#comment-17282696
 ] 

Jonathan Turner Eagles commented on TEZ-4286:
-

[~abstractdog], is git...@hive.apache.org a real mailing list? Also, if our 
goal is to mimic the JIRA/gitbox emails. Would issues and pullrequests point to 
issues@tez.apache.org to send PR comments and creation/closure of new PR? Agree 
that d...@tez.apache.org is not the correct place for those.

I'm trying to imagine what equivalence would be, but would be open to 
reimagining a new system that would better serve the community. 

> Consider sending fewer emails from github to d...@tez.apache.org
> ---
>
> Key: TEZ-4286
> URL: https://issues.apache.org/jira/browse/TEZ-4286
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: László Bodor
>Assignee: László Bodor
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> TEZ-4278 introduced .asf.yaml, where I haven't changed the defaults regarding 
> notifications:
> https://gitbox.apache.org/schemes.cgi?tez
> {code}
> commits:  comm...@tez.apache.org
> issues:   d...@tez.apache.org
> pullrequests: d...@tez.apache.org
> {code}
> now looking at my emails and settings in hive project, I think 
> d...@apache.tez.org should not be the recipient of so many mails from github, 
> because it will kill the original intention of a dev list (I mean, getting 
> relevant, useful emails from humans :) )
> for reference in hive there is:
> {code}
>   commits:  comm...@hive.apache.org
>   issues:   git...@hive.apache.org
>   pullrequests: git...@hive.apache.org
> {code}
> I think the *pullrequests* part which really spams dev@ at the moment



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-3966) Tez UI config couldn't be executed in browser

2021-02-10 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-3966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-3966.
-
Fix Version/s: 0.9.3
   0.10.1
   Resolution: Fixed

> Tez UI config couldn't be executed in browser 
> --
>
> Key: TEZ-3966
> URL: https://issues.apache.org/jira/browse/TEZ-3966
> Project: Apache Tez
>  Issue Type: Bug
>  Components: UI
>Reporter: Roman Lukin
>Priority: Major
> Fix For: 0.10.1, 0.9.3
>
> Attachments: Selection_043.png, Selection_044.png, 
> simple-cors-http-server.py
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> Current name convention applied to the ui configuration file, produce the 
> following error in web browser - mime type check fails (which based on file 
> extesion) and if  header `X-Content-Type-Options: nosniff` enabled config 
> wouldn't be executed and used. As a result we may have problems, for instance 
> links based on variable `yarnProtocol`.
> More about header - 
> [https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers/X-Content-Type-Options]
> Possible solution - rename file `config.env` to `config.js` 
> [https://github.com/apache/tez/pull/26] 
> [https://github.com/rlukin/tez/commit/8db8c04faee04dc161c90c3961233ec5aa40518e]
>  
> Steps to reproduce:
>  * Enable aforementioned header on web server
>  * Try to load tez ui



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-3966) Tez UI config couldn't be executed in browser

2021-02-10 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-3966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17282665#comment-17282665
 ] 

Jonathan Turner Eagles commented on TEZ-3966:
-

+1. Disussion in github. Committing and cherry-picking to branch-0.9

> Tez UI config couldn't be executed in browser 
> --
>
> Key: TEZ-3966
> URL: https://issues.apache.org/jira/browse/TEZ-3966
> Project: Apache Tez
>  Issue Type: Bug
>  Components: UI
>Reporter: Roman Lukin
>Priority: Major
> Attachments: Selection_043.png, Selection_044.png, 
> simple-cors-http-server.py
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> Current name convention applied to the ui configuration file, produce the 
> following error in web browser - mime type check fails (which based on file 
> extesion) and if  header `X-Content-Type-Options: nosniff` enabled config 
> wouldn't be executed and used. As a result we may have problems, for instance 
> links based on variable `yarnProtocol`.
> More about header - 
> [https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers/X-Content-Type-Options]
> Possible solution - rename file `config.env` to `config.js` 
> [https://github.com/apache/tez/pull/26] 
> [https://github.com/rlukin/tez/commit/8db8c04faee04dc161c90c3961233ec5aa40518e]
>  
> Steps to reproduce:
>  * Enable aforementioned header on web server
>  * Try to load tez ui



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-4278) Show PR link in Jira

2021-02-09 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-4278.
-
Fix Version/s: 0.10.1
   Resolution: Fixed

+1. Committed to master via github. [~abstractdog], does this need to be 
cherry-picked to branch-0.9? I didn't think so based on the comments above. Let 
me know if needed.

> Show PR link in Jira
> 
>
> Key: TEZ-4278
> URL: https://issues.apache.org/jira/browse/TEZ-4278
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: László Bodor
>Assignee: László Bodor
>Priority: Major
> Fix For: 0.10.1
>
> Attachments: Screen Shot 2021-01-29 at 6.52.59 PM.png
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> In hive project, when a PR is created, link is automatically shown in Jira. I 
> got used to it, so it's very important :)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4130) Config for hard limiting the number of splits

2021-01-29 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17275353#comment-17275353
 ] 

Jonathan Turner Eagles commented on TEZ-4130:
-

[~abstractdog], now with TEZ-4271. Are we needing to increase the priority of 
this jira? Seems like unit test was needed, but perhaps some logic also.

> Config for hard limiting the number of splits
> -
>
> Key: TEZ-4130
> URL: https://issues.apache.org/jira/browse/TEZ-4130
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: László Bodor
>Assignee: László Bodor
>Priority: Major
> Attachments: TEZ-4130.01.patch, TEZ-4130.02.patch
>
>
> During the investigation of a customer issue, I found that tez generated a 
> dag plan containing >4k tasks. It failed for hive because of bucket number 
> limitations (4k). It can be configured properly, e.g. bigger splits 
> (tez.grouping.min-size), but maybe it would be more convenient for users to 
> config a hard limit for the number of splits.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TEZ-4266) Make Tez Inherit From Apache POM

2021-01-29 Thread Jonathan Turner Eagles (Jira)


 [ 
https://issues.apache.org/jira/browse/TEZ-4266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Turner Eagles resolved TEZ-4266.
-
Fix Version/s: 0.9.3
   0.10.1
   Resolution: Fixed

merged to master from github and cherry-picked to branch-0.9. Thanks, 
[~belugabehr] for this patch.

> Make Tez Inherit From Apache POM
> 
>
> Key: TEZ-4266
> URL: https://issues.apache.org/jira/browse/TEZ-4266
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Major
> Fix For: 0.10.1, 0.9.3
>
>  Time Spent: 2h 50m
>  Remaining Estimate: 0h
>
> This allows for inheriting all of the Apache Maven Plugins and makes plugins 
> easier to manage since all the versions are tested together.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4278) Show PR link in Jira

2021-01-29 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17275296#comment-17275296
 ] 

Jonathan Turner Eagles commented on TEZ-4278:
-

This is some feedback and patch I received after the first round of github 
integration from yetus maintainer. I haven't had time to analyze the changes 
and try them out yet.

https://github.com/aw-was-here/tez/commit/bead3a21e68ecf4a75d64d8c08b9f8fa0f98cf51

> Show PR link in Jira
> 
>
> Key: TEZ-4278
> URL: https://issues.apache.org/jira/browse/TEZ-4278
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: László Bodor
>Assignee: László Bodor
>Priority: Major
> Attachments: Screen Shot 2021-01-29 at 6.52.59 PM.png
>
>
> In hive project, when a PR is created, link is automatically shown in Jira. I 
> got used to it, so it's very important :)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TEZ-4275) Use Google Guava Intern Facility

2021-01-28 Thread Jonathan Turner Eagles (Jira)


[ 
https://issues.apache.org/jira/browse/TEZ-4275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17274157#comment-17274157
 ] 

Jonathan Turner Eagles commented on TEZ-4275:
-

https://issues.apache.org/jira/browse/TEZ-2164 has details on shading guava and 
discussions

> Use Google Guava Intern Facility
> 
>
> Key: TEZ-4275
> URL: https://issues.apache.org/jira/browse/TEZ-4275
> Project: Apache Tez
>  Issue Type: Improvement
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Google Guava has a pretty cool Interner facility.
>  
>  * More memory efficient than current offering.  Map contains a weak key and 
> a static dummy value (current implementation uses a weak value)
>  * Current implementation has a single lock around the entire data structure. 
>  Google segments their data structure into (default: 4) segments for better 
> concurrency
>  * All the other thoughtful stuff Google has added into this feature



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


  1   2   3   4   5   >