[ 
https://issues.apache.org/jira/browse/SPARK-21755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16165678#comment-16165678
 ] 

Ankur edited comment on SPARK-21755 at 9/14/17 3:41 AM:
--------------------------------------------------------

Issue can also be reproduced on Spark 2.2 version on an EMR cluster with 
configurations: 

* EMR Release label:      emr-5.8.0
* Hadoop distribution:      Amazon 2.7.3





was (Author: ankurkasliwal26):
Issue can also be reproduced on Spark 2.2 version on an EMR cluster with same 
configurations: 

* EMR Release label:      emr-5.8.0
* Hadoop distribution:      Amazon 2.7.3




> Spark 2.1.1 UI page not displaying any dynamic updates on job progress after 
> showing progress for initial few minutes of job run.
> ---------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-21755
>                 URL: https://issues.apache.org/jira/browse/SPARK-21755
>             Project: Spark
>          Issue Type: Bug
>          Components: Web UI
>    Affects Versions: 2.1.1
>         Environment: Issue was produced on an EMR cluster with following 
> configurations:
> ### EMR Release label:      emr-5.6.0
> ### Hadoop distribution:      Amazon 2.7.3
> ### Applications installed:   Hive 2.1.1, Spark 2.1.1
>            Reporter: Ankur
>
> When a Spark SQL job is ran, Spark Application’s Web Console ( UI ) is 
> getting intermittently updated for initial few minutes ( ~ 10-15 minutes ) 
> and after that there are no updates on job progress ( even after job 
> execution completes).  As soon as "Spark SQL" session is terminated I can see 
> Spark UI got updated with the job summary.
> Issue was reproduced by using spark-sql on a data-set of around 1.2 TB size. 
> Here are the steps:
> Step 1> An EMR cluster is launched ( release emr-5.6.0 and applications as 
> Hive 2.1.1, Spark 2.1.1 )
> Step 2>> Following command is ran:
> spark-sql> CREATE TABLE total_flights USING com.databricks.spark.csv OPTIONS 
> (path "s3://bucket/test_web_UI/flight/", header "true", inferSchema "true");
> Data-set used : Flights history in CSV files provided by US Department of 
> Transportation, Bureau of Transportation Statistics - 
> https://www.transtats.bts.gov/DL_SelectFields.asp?Table_ID=236&DB_Short_Name=On-Time
> Step 3> There were no updates on Web UI after initial ~10 minutes. Web UI did 
> not got updated even after few hours when job was completed successfully. 
> Step 4> Once the spark-sql session is ended, Spark UI got updated with the 
> job summary correctly as expected. 
> I have verified that "spark.history.fs.update.interval" is set to default 
> value of 10 seconds as mentioned in this document 
> "https://spark.apache.org/docs/latest/monitoring.html ".  



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to