Long pauses after writing to sequence files

2015-01-30 Thread Corey Nolet
We have a series of spark jobs which run in succession over various cached datasets, do small groups and transforms, and then call saveAsSequenceFile() on them. Each call to save as a sequence file appears to have done its work, the task says it completed in xxx.x seconds but then it pauses

Re: Long pauses after writing to sequence files

2015-01-30 Thread Akhil Das
Not quiet sure, but it could be the GC Pause, if you are holding too much objects in memory. You can check this tuning http://spark.apache.org/docs/1.2.0/tuning.html part if you haven't already been through it. Thanks Best Regards On Sat, Jan 31, 2015 at 7:22 AM, Corey Nolet cjno...@gmail.com