Jakub Dubovsky created SPARK-23282:
--------------------------------------

             Summary: Probable mistake in hasLaunchedTask condition
                 Key: SPARK-23282
                 URL: https://issues.apache.org/jira/browse/SPARK-23282
             Project: Spark
          Issue Type: Bug
          Components: Spark Core
    Affects Versions: 2.2.1
            Reporter: Jakub Dubovsky


In class {code}TaskSchedulerImpl{code} there is a 
[variable|https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/scheduler/TaskSchedulerImpl.scala#L97]
 tracking whether some task was already launched. It is then set to true 
[here|https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/scheduler/TaskSchedulerImpl.scala#L377]
 whenever some task is launched.

I believe the 
[condition|https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/scheduler/TaskSchedulerImpl.scala#L376]
 should be {code}tasks.map(_.size).sum > 0{code} instead of current 
{code}tasks.size > 0{code}

Current condition asks whether there are some resource offers while it should 
ask whether there were some launched tasks. In some cases it might happen that 
resources are offered but no tasks launched. In such a case no warning is 
issued.

Can you confirm this is valid?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to