[ 
https://issues.apache.org/jira/browse/SPARK-2009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14020057#comment-14020057
 ] 

Vadim Chekan commented on SPARK-2009:
-------------------------------------

Folks,

Patch is in progress of review here:
https://github.com/apache/spark/pull/961#issuecomment-45125185

> Key not found exception when slow receiver starts
> -------------------------------------------------
>
>                 Key: SPARK-2009
>                 URL: https://issues.apache.org/jira/browse/SPARK-2009
>             Project: Spark
>          Issue Type: Bug
>          Components: Streaming
>    Affects Versions: 1.0.0
>            Reporter: Vadim Chekan
>
> I got "java.util.NoSuchElementException: key not found: 1401756085000 ms" 
> exception when using kafka stream and 1 sec batchPeriod.
> Investigation showed that the reason is that ReceiverLauncher.startReceivers 
> is asynchronous (started in a thread).
> https://github.com/vchekan/spark/blob/master/streaming/src/main/scala/org/apache/spark/streaming/scheduler/ReceiverTracker.scala#L206
> In case of slow starting receiver, such as Kafka, it easily takes more than 
> 2sec to start. In result, no single "compute" will be called on 
> ReceiverInputDStream before first batch job is executed and receivedBlockInfo 
> remains empty (obviously). Batch job will cause 
> ReceiverInputDStream.getReceivedBlockInfo call and "key not found" exception.
> The patch makes getReceivedBlockInfo more robust by tolerating missing values.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to