Re: Spark Streaming S3 Performance Implications

2015-04-01 Thread Mike Trienis
ch > interval). > > this goes for any spark streaming implementation - not just Kinesis. > > lemme know if that works for you. > > thanks! > > -Chris > _ > From: Mike Trienis > Sent: Wednesday, March 18, 2015 2:45 PM > Subject: S

Re: Spark Streaming S3 Performance Implications

2015-03-21 Thread Ted Yu
for any spark streaming implementation - not just Kinesis. > > lemme know if that works for you. > > thanks! > > -Chris > _____ > From: Mike Trienis > Sent: Wednesday, March 18, 2015 2:45 PM > Subject: Spark Streaming S3 Performance Implications

Re: Spark Streaming S3 Performance Implications

2015-03-21 Thread Chris Fregly
is a multiple of the batch interval). this goes for any spark streaming implementation - not just Kinesis. lemme know if that works for you. thanks! -Chris  _ From: Mike Trienis Sent: Wednesday, March 18, 2015 2:45 PM Subject: Spark Streaming S3 Performance Implicatio

Spark Streaming S3 Performance Implications

2015-03-18 Thread Mike Trienis
Hi All, I am pushing data from Kinesis stream to S3 using Spark Streaming and noticed that during testing (i.e. master=local[2]) the batches (1 second intervals) were falling behind the incoming data stream at about 5-10 events / second. It seems that the rdd.saveAsTextFile(s3n://...) is taking at