RDD storage in spark steaming

2015-03-23 Thread abhi
HI,
i have a simple question about creating RDD . Whenever RDD is created in
spark streaming for the particular time window .When does the RDD gets
stored .

1. Does it get stored at the Driver machine ? or it gets stored on all the
machines in the cluster ?
2. Does the data gets stored in memory by default ? Can it store at the
memory and disk ? How can it configured ?


Thanks,
Abhi


Re: RDD storage in spark steaming

2015-03-23 Thread Jeffrey Jedele
Hey Abhi,
many of StreamingContext's methods to create input streams take a
StorageLevel parameter to configure this behavior. RDD partitions are
generally stored in the in-memory cache of worker nodes I think. You can
also configure replication and spilling to disk if needed.

Regards,
Jeff

2015-03-23 15:26 GMT+01:00 abhi abhishek...@gmail.com:

 HI,
 i have a simple question about creating RDD . Whenever RDD is created in
 spark streaming for the particular time window .When does the RDD gets
 stored .

 1. Does it get stored at the Driver machine ? or it gets stored on all the
 machines in the cluster ?
 2. Does the data gets stored in memory by default ? Can it store at the
 memory and disk ? How can it configured ?


 Thanks,
 Abhi