[jira] [Commented] (FLINK-14347) YARNSessionFIFOITCase.checkForProhibitedLogContents found a log with prohibited string

2019-10-10 Thread Zili Chen (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-14347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16949065#comment-16949065
 ] 

Zili Chen commented on FLINK-14347:
---

[~trohrmann] PR submitted.

> YARNSessionFIFOITCase.checkForProhibitedLogContents found a log with 
> prohibited string
> --
>
> Key: FLINK-14347
> URL: https://issues.apache.org/jira/browse/FLINK-14347
> Project: Flink
>  Issue Type: Test
>  Components: Deployment / YARN, Tests
>Affects Versions: 1.10.0, 1.9.1, 1.8.3
>Reporter: Caizhi Weng
>Assignee: Zili Chen
>Priority: Critical
>  Labels: pull-request-available
> Fix For: 1.10.0, 1.9.1, 1.8.3
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> YARNSessionFIFOITCase.checkForProhibitedLogContents fails with the following 
> exception:
> {code:java}
> 14:55:27.643 [ERROR]   
> YARNSessionFIFOITCase.checkForProhibitedLogContents:77->YarnTestBase.ensureNoProhibitedStringInLogFiles:461
>  Found a file 
> /home/travis/build/apache/flink/flink-yarn-tests/target/flink-yarn-tests-fifo/flink-yarn-tests-fifo-logDir-nm-1_0/application_1570546069180_0001/container_1570546069180_0001_01_01/jobmanager.log
>  with a prohibited string (one of [Exception, Started 
> SelectChannelConnector@0.0.0.0:8081]). Excerpts:23760[{code}
> Travis log link: [https://travis-ci.org/apache/flink/jobs/595082243]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (FLINK-14347) YARNSessionFIFOITCase.checkForProhibitedLogContents found a log with prohibited string

2019-10-10 Thread Zili Chen (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-14347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16949064#comment-16949064
 ] 

Zili Chen commented on FLINK-14347:
---

Thanks for your information [~fly_in_gis]!

> YARNSessionFIFOITCase.checkForProhibitedLogContents found a log with 
> prohibited string
> --
>
> Key: FLINK-14347
> URL: https://issues.apache.org/jira/browse/FLINK-14347
> Project: Flink
>  Issue Type: Test
>  Components: Deployment / YARN, Tests
>Affects Versions: 1.10.0, 1.9.1, 1.8.3
>Reporter: Caizhi Weng
>Assignee: Zili Chen
>Priority: Critical
>  Labels: pull-request-available
> Fix For: 1.10.0, 1.9.1, 1.8.3
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> YARNSessionFIFOITCase.checkForProhibitedLogContents fails with the following 
> exception:
> {code:java}
> 14:55:27.643 [ERROR]   
> YARNSessionFIFOITCase.checkForProhibitedLogContents:77->YarnTestBase.ensureNoProhibitedStringInLogFiles:461
>  Found a file 
> /home/travis/build/apache/flink/flink-yarn-tests/target/flink-yarn-tests-fifo/flink-yarn-tests-fifo-logDir-nm-1_0/application_1570546069180_0001/container_1570546069180_0001_01_01/jobmanager.log
>  with a prohibited string (one of [Exception, Started 
> SelectChannelConnector@0.0.0.0:8081]). Excerpts:23760[{code}
> Travis log link: [https://travis-ci.org/apache/flink/jobs/595082243]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (FLINK-14347) YARNSessionFIFOITCase.checkForProhibitedLogContents found a log with prohibited string

2019-10-10 Thread Yang Wang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-14347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16949063#comment-16949063
 ] 

Yang Wang commented on FLINK-14347:
---

When there is `ApplicationAttemptNotFoundException` in AMRMClient, 
`onShutdownRequest` of the callback will be called. Usually it just because we 
kill the application through yarn client, so it is a expected exception. Also 
when there is some problem with Yarn in recover application meta info from 
zookeeper(lost some applications), `ApplicationAttemptNotFoundException` will 
also send to AMRMClient.

It is reasonable that we throw a fatal error in `onShutdownRequest`. Updating 
the check rules is ok for me.

> YARNSessionFIFOITCase.checkForProhibitedLogContents found a log with 
> prohibited string
> --
>
> Key: FLINK-14347
> URL: https://issues.apache.org/jira/browse/FLINK-14347
> Project: Flink
>  Issue Type: Test
>  Components: Deployment / YARN, Tests
>Affects Versions: 1.10.0, 1.9.1, 1.8.3
>Reporter: Caizhi Weng
>Assignee: Zili Chen
>Priority: Critical
>  Labels: pull-request-available
> Fix For: 1.10.0, 1.9.1, 1.8.3
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> YARNSessionFIFOITCase.checkForProhibitedLogContents fails with the following 
> exception:
> {code:java}
> 14:55:27.643 [ERROR]   
> YARNSessionFIFOITCase.checkForProhibitedLogContents:77->YarnTestBase.ensureNoProhibitedStringInLogFiles:461
>  Found a file 
> /home/travis/build/apache/flink/flink-yarn-tests/target/flink-yarn-tests-fifo/flink-yarn-tests-fifo-logDir-nm-1_0/application_1570546069180_0001/container_1570546069180_0001_01_01/jobmanager.log
>  with a prohibited string (one of [Exception, Started 
> SelectChannelConnector@0.0.0.0:8081]). Excerpts:23760[{code}
> Travis log link: [https://travis-ci.org/apache/flink/jobs/595082243]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (FLINK-14347) YARNSessionFIFOITCase.checkForProhibitedLogContents found a log with prohibited string

2019-10-10 Thread Till Rohrmann (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-14347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16948810#comment-16948810
 ] 

Till Rohrmann commented on FLINK-14347:
---

I'd be ok with this approach. Can you work on this issue [~tison]?

> YARNSessionFIFOITCase.checkForProhibitedLogContents found a log with 
> prohibited string
> --
>
> Key: FLINK-14347
> URL: https://issues.apache.org/jira/browse/FLINK-14347
> Project: Flink
>  Issue Type: Test
>  Components: Deployment / YARN, Tests
>Affects Versions: 1.10.0, 1.9.1, 1.8.3
>Reporter: Caizhi Weng
>Priority: Critical
> Fix For: 1.10.0, 1.9.1, 1.8.3
>
>
> YARNSessionFIFOITCase.checkForProhibitedLogContents fails with the following 
> exception:
> {code:java}
> 14:55:27.643 [ERROR]   
> YARNSessionFIFOITCase.checkForProhibitedLogContents:77->YarnTestBase.ensureNoProhibitedStringInLogFiles:461
>  Found a file 
> /home/travis/build/apache/flink/flink-yarn-tests/target/flink-yarn-tests-fifo/flink-yarn-tests-fifo-logDir-nm-1_0/application_1570546069180_0001/container_1570546069180_0001_01_01/jobmanager.log
>  with a prohibited string (one of [Exception, Started 
> SelectChannelConnector@0.0.0.0:8081]). Excerpts:23760[{code}
> Travis log link: [https://travis-ci.org/apache/flink/jobs/595082243]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (FLINK-14347) YARNSessionFIFOITCase.checkForProhibitedLogContents found a log with prohibited string

2019-10-10 Thread Zili Chen (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-14347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16948773#comment-16948773
 ] 

Zili Chen commented on FLINK-14347:
---

Thanks for reporting this issue [~TsReaper] and your analysis [~trohrmann].

I notice that the instability depends on whether or not {{jobmanager.log}} has 
been dumped on the verification. Given that the "forbidden" string is actually 
expected[1] I propose we add the next line into whitelist. Locally verify when 
{{jobmanager.log}} dumped we find the "forbidden" string and filter out with 
the exclusion of expected Exception.

{{Received shutdown request from YARN ResourceManager}}

[1] Specifically, we call {{YarnClient.killApplication}} in 
{{YARNSessionFIFOITCase#runDetachedModeTest}} which always causes a shutdown 
request.

> YARNSessionFIFOITCase.checkForProhibitedLogContents found a log with 
> prohibited string
> --
>
> Key: FLINK-14347
> URL: https://issues.apache.org/jira/browse/FLINK-14347
> Project: Flink
>  Issue Type: Test
>  Components: Deployment / YARN, Tests
>Affects Versions: 1.10.0, 1.9.1, 1.8.3
>Reporter: Caizhi Weng
>Priority: Critical
> Fix For: 1.10.0, 1.9.1, 1.8.3
>
>
> YARNSessionFIFOITCase.checkForProhibitedLogContents fails with the following 
> exception:
> {code:java}
> 14:55:27.643 [ERROR]   
> YARNSessionFIFOITCase.checkForProhibitedLogContents:77->YarnTestBase.ensureNoProhibitedStringInLogFiles:461
>  Found a file 
> /home/travis/build/apache/flink/flink-yarn-tests/target/flink-yarn-tests-fifo/flink-yarn-tests-fifo-logDir-nm-1_0/application_1570546069180_0001/container_1570546069180_0001_01_01/jobmanager.log
>  with a prohibited string (one of [Exception, Started 
> SelectChannelConnector@0.0.0.0:8081]). Excerpts:23760[{code}
> Travis log link: [https://travis-ci.org/apache/flink/jobs/595082243]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (FLINK-14347) YARNSessionFIFOITCase.checkForProhibitedLogContents found a log with prohibited string

2019-10-10 Thread Till Rohrmann (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-14347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16948728#comment-16948728
 ] 

Till Rohrmann commented on FLINK-14347:
---

I think this problem has been introduced with FLINK-14010. I suspect that the 
{{YarnResourceManager}} receives an {{onShutdownRequest}} during the clean up 
of the test. Since we are now calling the {{FatalExceptionHandler}}, the test 
process terminates which then fails the test.

> YARNSessionFIFOITCase.checkForProhibitedLogContents found a log with 
> prohibited string
> --
>
> Key: FLINK-14347
> URL: https://issues.apache.org/jira/browse/FLINK-14347
> Project: Flink
>  Issue Type: Test
>  Components: Deployment / YARN, Tests
>Affects Versions: 1.10.0, 1.9.1, 1.8.3
>Reporter: Caizhi Weng
>Priority: Critical
> Fix For: 1.10.0, 1.9.1, 1.8.3
>
>
> YARNSessionFIFOITCase.checkForProhibitedLogContents fails with the following 
> exception:
> {code:java}
> 14:55:27.643 [ERROR]   
> YARNSessionFIFOITCase.checkForProhibitedLogContents:77->YarnTestBase.ensureNoProhibitedStringInLogFiles:461
>  Found a file 
> /home/travis/build/apache/flink/flink-yarn-tests/target/flink-yarn-tests-fifo/flink-yarn-tests-fifo-logDir-nm-1_0/application_1570546069180_0001/container_1570546069180_0001_01_01/jobmanager.log
>  with a prohibited string (one of [Exception, Started 
> SelectChannelConnector@0.0.0.0:8081]). Excerpts:23760[{code}
> Travis log link: [https://travis-ci.org/apache/flink/jobs/595082243]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (FLINK-14347) YARNSessionFIFOITCase.checkForProhibitedLogContents found a log with prohibited string

2019-10-10 Thread Till Rohrmann (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-14347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16948726#comment-16948726
 ] 

Till Rohrmann commented on FLINK-14347:
---

Another instance: https://api.travis-ci.org/v3/job/595082243/log.txt

> YARNSessionFIFOITCase.checkForProhibitedLogContents found a log with 
> prohibited string
> --
>
> Key: FLINK-14347
> URL: https://issues.apache.org/jira/browse/FLINK-14347
> Project: Flink
>  Issue Type: Test
>  Components: Deployment / YARN, Tests
>Affects Versions: 1.10.0, 1.9.1, 1.8.3
>Reporter: Caizhi Weng
>Priority: Critical
>
> YARNSessionFIFOITCase.checkForProhibitedLogContents fails with the following 
> exception:
> {code:java}
> 14:55:27.643 [ERROR]   
> YARNSessionFIFOITCase.checkForProhibitedLogContents:77->YarnTestBase.ensureNoProhibitedStringInLogFiles:461
>  Found a file 
> /home/travis/build/apache/flink/flink-yarn-tests/target/flink-yarn-tests-fifo/flink-yarn-tests-fifo-logDir-nm-1_0/application_1570546069180_0001/container_1570546069180_0001_01_01/jobmanager.log
>  with a prohibited string (one of [Exception, Started 
> SelectChannelConnector@0.0.0.0:8081]). Excerpts:23760[{code}
> Travis log link: [https://travis-ci.org/apache/flink/jobs/595082243]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (FLINK-14347) YARNSessionFIFOITCase.checkForProhibitedLogContents found a log with prohibited string

2019-10-09 Thread Chesnay Schepler (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-14347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16947627#comment-16947627
 ] 

Chesnay Schepler commented on FLINK-14347:
--

Please add the full excerpt to the JIRA description.

> YARNSessionFIFOITCase.checkForProhibitedLogContents found a log with 
> prohibited string
> --
>
> Key: FLINK-14347
> URL: https://issues.apache.org/jira/browse/FLINK-14347
> Project: Flink
>  Issue Type: Test
>  Components: Tests
>Affects Versions: 1.8.2
>Reporter: Caizhi Weng
>Priority: Major
>
> YARNSessionFIFOITCase.checkForProhibitedLogContents fails with the following 
> exception:
> {code:java}
> 14:55:27.643 [ERROR]   
> YARNSessionFIFOITCase.checkForProhibitedLogContents:77->YarnTestBase.ensureNoProhibitedStringInLogFiles:461
>  Found a file 
> /home/travis/build/apache/flink/flink-yarn-tests/target/flink-yarn-tests-fifo/flink-yarn-tests-fifo-logDir-nm-1_0/application_1570546069180_0001/container_1570546069180_0001_01_01/jobmanager.log
>  with a prohibited string (one of [Exception, Started 
> SelectChannelConnector@0.0.0.0:8081]). Excerpts:23760[{code}
> Travis log link: [https://travis-ci.org/apache/flink/jobs/595082243]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)