[ https://issues.apache.org/jira/browse/SPARK-16338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rohit Agarwal updated SPARK-16338: ---------------------------------- Attachment: error Attached the file with the driver log for a couple of batch durations. > Streaming driver running on standalone cluster mode with supervise goes into > bad state when application is killed from the UI > ----------------------------------------------------------------------------------------------------------------------------- > > Key: SPARK-16338 > URL: https://issues.apache.org/jira/browse/SPARK-16338 > Project: Spark > Issue Type: Bug > Components: Deploy, Streaming, Web UI > Affects Versions: 1.6.1 > Reporter: Rohit Agarwal > Attachments: error > > > We are going to start using Spark Streaming in production and I was testing > various failure scenarios. I noticed one case where the spark streaming > driver got into a bad state. > Steps to reproduce: > 1. Create a spark streaming application with Direct Kafka Streams and > checkpointing enabled. > 2. Deploy the application to a spark standalone cluster. With cluster mode > and --supervise. > 3. Let it run for sometime. > 4. Kill the application (but not the driver) from the Spark Master UI. > 5. The driver keeps on running but doesn't restart the application. What's > worse is that it keeps updating the checkpoint every batch duration, so when > you do restart the driver, it starts at a later point and you have lost data. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org