Thanks a lot.

Sent from my iPad

> On Mar 7, 2015, at 8:26 AM, Sean Owen <so...@cloudera.com> wrote:
> 
>> On Sat, Mar 7, 2015 at 4:17 PM, Jean-Pascal Billaud <j...@tellapart.com> 
>> wrote:
>> So given this let's go a bit further. Imagine my static factory provides a 
>> stats collector that my various map() code would use to export some metrics 
>> while mapping tuples. This stats collector comes with a timer that flush the 
>> stats buffer once every 5mn for instance. Given that executor JVM should not 
>> be reinitialized over the life of the DStream in practice, it is totally 
>> reasonable to assume that this timer will be able to do his job. Right?
> 
> Ignoring failures, the executors should not be restarted over the life
> of the streaming app, yes. You will have, potentially, many executors
> at once of course.
> 
>> Given what you said earlier that totally makes sense. In general is there 
>> any spark architecture documentation other than the code that gives a good 
>> overview of the thing we talked about?
> 
> http://spark.apache.org/docs/latest/cluster-overview.html
> 
> This is entirely applicable to streaming too.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to