If I perform a 'collect' action on the RDD, I can see a new stage getting created in the spark web UI (http://master:4040/stages/), but when I do a 'first' action, I don't see any stage getting created. However on the console I see these lines:
14/02/19 10:51:31 INFO SparkContext: Starting job: first at xxx.scala:110 14/02/19 10:51:31 INFO DAGScheduler: Got job 110 (first at xxx.scala:110) with 1 output partitions (allowLocal=true) 14/02/19 10:51:31 INFO DAGScheduler: Final stage: Stage 2 (first at xxx.scala:110) 14/02/19 10:51:31 INFO DAGScheduler: Parents of final stage: List() 14/02/19 10:51:31 INFO DAGScheduler: Missing parents: List() So why doesn't the webUI list the stages created when I run the 'first' action?
