Re: sampling operation for DStream

2016-08-01 Thread Cody Koeninger
ng further processing. The >> >> dstream .transform method will let you do any filtering / sampling you >> >> could have done on an rdd. >> >> >> >> On Fri, Jul 29, 2016 at 9:57 AM, Martin Le <martin.leq...@gmail.com> >> >> wrote: >

Re: sampling operation for DStream

2016-08-01 Thread Martin Le
29, 2016 at 9:57 AM, Martin Le <martin.leq...@gmail.com> > >> wrote: > >> > Hi all, > >> > > >> > I have to handle high-speed rate data stream. To reduce the heavy > load, > >> > I > >> > want to use sampling technique

Re: sampling operation for DStream

2016-08-01 Thread Cody Koeninger
in.leq...@gmail.com> >> wrote: >> > Hi all, >> > >> > I have to handle high-speed rate data stream. To reduce the heavy load, >> > I >> > want to use sampling techniques for each stream window. It means that I >> > want >> > to pro

Re: sampling operation for DStream

2016-08-01 Thread Martin Le
s a subset of data instead of whole window data. I saw Spark > > support sampling operations for RDD, but for DStream, Spark supports > > sampling operation as well? If not, could you please give me a > suggestion > > how to implement it? > > > > Thanks, > > Martin >

Re: sampling operation for DStream

2016-07-29 Thread Cody Koeninger
ream. To reduce the heavy load, I > want to use sampling techniques for each stream window. It means that I want > to process a subset of data instead of whole window data. I saw Spark > support sampling operations for RDD, but for DStream, Spark supports > sampling operation as well? If not

sampling operation for DStream

2016-07-29 Thread Martin Le
sampling operation as well? If not, could you please give me a suggestion how to implement it? Thanks, Martin