[ 
https://issues.apache.org/jira/browse/GOBBLIN-2011?focusedWorklogId=907866&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-907866
 ]

ASF GitHub Bot logged work on GOBBLIN-2011:
-------------------------------------------

                Author: ASF GitHub Bot
            Created on: 01/Mar/24 23:37
            Start Date: 01/Mar/24 23:37
    Worklog Time Spent: 10m 
      Work Description: codecov-commenter commented on PR #3888:
URL: https://github.com/apache/gobblin/pull/3888#issuecomment-1974077899

   ## 
[Codecov](https://app.codecov.io/gh/apache/gobblin/pull/3888?src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=apache)
 Report
   All modified and coverable lines are covered by tests :white_check_mark:
   > Project coverage is 41.54%. Comparing base 
[(`a78ee54`)](https://app.codecov.io/gh/apache/gobblin/commit/a78ee5473dc9f24bf9acbf8432c9d838d25655c7?el=desc&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=apache)
 to head 
[(`74360c4`)](https://app.codecov.io/gh/apache/gobblin/pull/3888?src=pr&el=desc&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=apache).
   > Report is 1 commits behind head on master.
   
   
   <details><summary>Additional details and impacted files</summary>
   
   
   ```diff
   @@             Coverage Diff              @@
   ##             master    #3888      +/-   ##
   ============================================
   - Coverage     46.67%   41.54%   -5.13%     
   + Complexity    11154     2197    -8957     
   ============================================
     Files          2219      472    -1747     
     Lines         87657    20046   -67611     
     Branches       9621     2327    -7294     
   ============================================
   - Hits          40911     8328   -32583     
   + Misses        43055    10834   -32221     
   + Partials       3691      884    -2807     
   ```
   
   
   
   </details>
   
   [:umbrella: View full report in Codecov by 
Sentry](https://app.codecov.io/gh/apache/gobblin/pull/3888?src=pr&el=continue&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=apache).
   
   :loudspeaker: Have feedback on the report? [Share it 
here](https://about.codecov.io/codecov-pr-comment-feedback/?utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=apache).
   




Issue Time Tracking
-------------------

    Worklog Id:     (was: 907866)
    Time Spent: 20m  (was: 10m)

> Fix bug where concurrent flows can be kicked off depending on a jobstatus 
> race condition
> ----------------------------------------------------------------------------------------
>
>                 Key: GOBBLIN-2011
>                 URL: https://issues.apache.org/jira/browse/GOBBLIN-2011
>             Project: Apache Gobblin
>          Issue Type: Bug
>            Reporter: William Lo
>            Priority: Major
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> There's a bug that causes GaaS multileader to kick off unintended concurrent 
> flows which happens in the order described below:
> 1. Host A checks the latest flow execution status to ensure the prior flow is 
> not running, sees that the prior execution is still running.
> 2. Host A fails the flow pending execution as it cannot run concurrent flow, 
> this emits a FAILED event to GaaS which is ingested by the JobStatusMonitor.
> 3. Host B checks the latest flow execution status, sees the current flow 
> execution ID which is FAILED (considered a finished flow).
> 4. Host B kicks off the pending flow execution when it shouldn't be.
> To resolve this, we need to ensure that we are looking at the past 2 flow 
> executions, and follow the behavior:
> 1. If there is no prior execution, kick off the pending flow
> 2. If the prior execution is IN PROGRESS, we want to indicate that there is a 
> concurrent flow and block the pending execution.
> 3. If the prior execution is FINISHED, then we want to kick off the pending 
> execution (rely on the DagManager for deduplication of flows because we do 
> not know if the host managing this pending flow is running behind the other 
> hosts).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to