[GitHub] flink issue #2696: [FLINK-3347] [akka] Add QuarantineMonitor which shuts a q...

2017-04-25 Thread tillrohrmann
Github user tillrohrmann commented on the issue: https://github.com/apache/flink/pull/2696 I propose to merge this PR into the master such that it is disabled per default. This entails that we don't require our own heartbeats from the TMs to the `JobManager`. This will make

[GitHub] flink issue #2696: [FLINK-3347] [akka] Add QuarantineMonitor which shuts a q...

2017-02-20 Thread tillrohrmann
Github user tillrohrmann commented on the issue: https://github.com/apache/flink/pull/2696 Rebased this PR onto the latest master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] flink issue #2696: [FLINK-3347] [akka] Add QuarantineMonitor which shuts a q...

2016-10-27 Thread tillrohrmann
Github user tillrohrmann commented on the issue: https://github.com/apache/flink/pull/2696 Thanks for the review @StephanEwen. Yes let's do it as you've proposed. I've opened an [issue](https://issues.apache.org/jira/browse/FLINK-4944) for replacing Akka's death watch with our own hea

[GitHub] flink issue #2696: [FLINK-3347] [akka] Add QuarantineMonitor which shuts a q...

2016-10-27 Thread StephanEwen
Github user StephanEwen commented on the issue: https://github.com/apache/flink/pull/2696 This means we should do a separate pull request which changes the TaskManager side of the failure detection from Akka watch to the TaskManager heartbeats. After that is in, we merge this

[GitHub] flink issue #2696: [FLINK-3347] [akka] Add QuarantineMonitor which shuts a q...

2016-10-27 Thread StephanEwen
Github user StephanEwen commented on the issue: https://github.com/apache/flink/pull/2696 That's a very good addition, we need something like that. After an offline discussion with @tillrohrmann we came to the following conclusion: There is a tricky problem with that