@Milan, I agree this bug needs to be fixed, and the the hack I provided previously isn't complete as it doesn't work for any operator with on_kill method that requires transient task attributes (i.e. self.sp)... Unfortunately I don't think the new UI will help rid of the issue (as the bug is on the model layer, and running the cli command `clear` will result the same bug). I will try to set some time aside and look into this bug again to see how to preserve the context, but currently don't have have a good solution for it.
On Mon, Jan 15, 2018 at 12:40 AM, EKC (Erik Cederstrand) <e...@novozymes.com> wrote: > I'm not sure if it's related, but there's an additional issue with > attempting to kill processes, that doesn't always kill the process: > https://issues.apache.org/jira/browse/AIRFLOW-949 > > > Kind regards, > > Erik Cederstrand > > ________________________________ > From: Milan van der Meer <milan.vanderm...@realimpactanalytics.com> > Sent: Friday, January 12, 2018 9:23:24 PM > To: dev@airflow.incubator.apache.org > Subject: Re: Fix on_kill command for operators > > Currently the on_kill does not get triggered when you clear from the UI. > As you mention, adding the 'fix' mentioned in the first comment of the > issue, does no fix the problem as it does not trigger on the right > operators context. > > Im not sure what exact changes are planned for the next 1.10 release, but > if the whole UI change is planned, this could be a good opportunity to also > fix this bug. > > On Mon, Jan 8, 2018 at 2:55 PM, Ash Berlin-Taylor < > ash_airflowl...@firemirror.com> wrote: > > > Without this change does on_kill ever get triggered? It seems like this > > change is desired behaviour. > > > > As per the first comment https://emea01.safelinks. > protection.outlook.com/?url=https%3A%2F%2Fissues.apache. > org%2F&data=01%7C01%7CEKC%40novozymes.com%7C4d2e0189d7e44025c48608d559fa > 5913%7C43d5f49ee03a4d22a2285684196bb001%7C0&sdata=9%2BWOuDmUe9M1WK2% > 2FqSsOtsaNrZnTtbvX%2Fq20dOBXVTA%3D&reserved=0 > > jira/browse/AIRFLOW-1623?focusedCommentId=16171819& > > page=com.atlassian.jira.plugin.system.issuetabpanels: > > comment-tabpanel#comment-16171819 <https://emea01.safelinks. > protection.outlook.com/?url=https%3A%2F%2Fissues.apache. > org%2F&data=01%7C01%7CEKC%40novozymes.com%7C4d2e0189d7e44025c48608d559fa > 5913%7C43d5f49ee03a4d22a2285684196bb001%7C0&sdata=9%2BWOuDmUe9M1WK2% > 2FqSsOtsaNrZnTtbvX%2Fq20dOBXVTA%3D&reserved=0 > > jira/browse/AIRFLOW-1623?focusedCommentId=16171819& > > page=com.atlassian.jira.plugin.system.issuetabpanels: > > comment-tabpanel#comment-16171819> I'm not sure this is the right fix. > It > > also seems like this would end up running the on_kill in a different > > process to the rest of the operator. > > > > I wonder if somewhere a signal handler is missing somewhere in one of the > > `run --local` or `run --raw`. I tried to follow all the paths through > from > > ui to sig handlers but got stuck in a tiwsty maze of classes. (and was > > attempting to do it just from reading the code)? > > > > > > > On 8 Jan 2018, at 13:15, Driesprong, Fokko <fo...@driesprong.frl> > wrote: > > > > > > Yes, for Spark this should work. Depending on the operator and the > > > implementation: > > > https://emea01.safelinks.protection.outlook.com/?url= > https%3A%2F%2Fgithub.com%2Fapache%2Fincubator-airflow% > 2Fblob%2F&data=01%7C01%7CEKC%40novozymes.com% > 7C4d2e0189d7e44025c48608d559fa5913%7C43d5f49ee03a4d22a2285684196b > b001%7C0&sdata=z1p%2FFlZ2cgPo%2BVK9sQJoDDJbKBEIdslsxtGOP%2F% > 2B7g0M%3D&reserved=0 > > 3e6babe8ed8f8f281b67aa3f4e03bf3cfc1bcbaa/airflow/contrib/ > > hooks/spark_submit_hook.py#L412-L428 > > > > > > However this is a big change in behaviour. I'm curious about the > opinion > > of > > > others. > > > > > > Cheers, > > > Fokko > > > > > > > > > 2018-01-08 14:12 GMT+01:00 Milan van der Meer < > > > milan.vanderm...@realimpactanalytics.com>: > > > > > >> Any help? :) > > >> > > >> On Thu, Dec 14, 2017 at 8:12 PM, Milan van der Meer < > > >> milan.vanderm...@realimpactanalytics.com> wrote: > > >> > > >>> I recently openend the following PR: https://emea01.safelinks. > protection.outlook.com/?url=https%3A%2F%2Fgithub.com% > 2Fapache%2F&data=01%7C01%7CEKC%40novozymes.com% > 7C4d2e0189d7e44025c48608d559fa5913%7C43d5f49ee03a4d22a2285684196b > b001%7C0&sdata=7AHCTsBnf%2F9nW0IY4B5ef3zfeY%2FlrH86WXwCt9YPwiU%3D& > reserved=0 > > >>> incubator-airflow/pull/2877 > > >>> > > >>> The problem is that on_kill is not called for operators when you > clear > > a > > >>> task from the UI. > > >>> Thats problematic when working with ex. spark clusters as the jobs on > > the > > >>> cluster need to be killed. > > >>> > > >>> The issue is in the core code of Airflow and Im not familiar enough > > with > > >>> the inner workings there. So I could use some directions on this one > > from > > >>> people who are familiar. > > >>> > > >>> For more info, check out the PR. > > >>> > > >>> Kind regards, > > >>> Milan > > >>> > > >> > > >> > > >> > > >> -- > > >> > > >> *Milan van der Meer* > > >> > > >> *Real**Impact* Analytics *| *Big Data Consultant > > >> https://emea01.safelinks.protection.outlook.com/?url= > www.realimpactanalytics.com&data=01%7C01%7CEKC%40novozymes.com% > 7C4d2e0189d7e44025c48608d559fa5913%7C43d5f49ee03a4d22a2285684196b > b001%7C0&sdata=863cz5%2FF3LXZh52xM3pD0ORkew1PI8Q1Bt% > 2B2pRiayWI%3D&reserved=0 > > >> > > >> *BE *+32 498 45 96 22 <0032498459622>* | Skype *milan.vandermeer.ria > > >> > > > > > > > -- > > *Milan van der Meer* > > *Real**Impact* Analytics *| *Big Data Consultant > https://emea01.safelinks.protection.outlook.com/?url= > www.realimpactanalytics.com&data=01%7C01%7CEKC%40novozymes.com% > 7C4d2e0189d7e44025c48608d559fa5913%7C43d5f49ee03a4d22a2285684196b > b001%7C0&sdata=863cz5%2FF3LXZh52xM3pD0ORkew1PI8Q1Bt% > 2B2pRiayWI%3D&reserved=0 > > *BE *+32 498 45 96 22 <0032498459622>* | Skype *milan.vandermeer.ria >