Re: java.io.IOException Error in task deserialization

2014-10-10 Thread Sung Hwan Chung
I haven't seen this at all since switching to HttpBroadcast. It seems TorrentBroadcast might have some issues? On Thu, Oct 9, 2014 at 4:28 PM, Sung Hwan Chung coded...@cs.stanford.edu wrote: I don't think that I saw any other error message. This is all I saw. I'm currently experimenting to

Re: java.io.IOException Error in task deserialization

2014-10-10 Thread Davies Liu
Maybe, TorrentBroadcast is more complicated than HttpBroadcast, could you tell us how to reproduce this issue? That will help us a lot to improve TorrentBroadcast. Thanks! On Fri, Oct 10, 2014 at 8:46 AM, Sung Hwan Chung coded...@cs.stanford.edu wrote: I haven't seen this at all since switching

Re: java.io.IOException Error in task deserialization

2014-10-09 Thread Davies Liu
This exception should be caused by another one, could you paste all of them here? Also, that will be great if you can provide a script to reproduce this problem. Thanks! On Fri, Sep 26, 2014 at 6:11 AM, Arun Ahuja aahuj...@gmail.com wrote: Has anyone else seen this erorr in task

Re: java.io.IOException Error in task deserialization

2014-10-09 Thread Davies Liu
Could you provide a script to reproduce this problem? Thanks! On Wed, Oct 8, 2014 at 9:13 PM, Sung Hwan Chung coded...@cs.stanford.edu wrote: This is also happening to me on a regular basis, when the job is large with relatively large serialized objects used in each RDD lineage. A bad thing

Re: java.io.IOException Error in task deserialization

2014-10-09 Thread Sung Hwan Chung
I don't think that I saw any other error message. This is all I saw. I'm currently experimenting to see if this can be alleviated by using HttpBroadcastFactory instead of TorrentBroadcast. So far, with HttpBroadcast, I haven't seen this recurring as of yet. I'll keep you posted. On Thu, Oct 9,

Re: java.io.IOException Error in task deserialization

2014-10-08 Thread Sung Hwan Chung
This is also happening to me on a regular basis, when the job is large with relatively large serialized objects used in each RDD lineage. A bad thing about this is that this exception always stops the whole job. On Fri, Sep 26, 2014 at 11:17 AM, Brad Miller bmill...@eecs.berkeley.edu wrote:

java.io.IOException Error in task deserialization

2014-09-26 Thread Arun Ahuja
Has anyone else seen this erorr in task deserialization? The task is processing a small amount of data and doesn't seem to have much data hanging to the closure? I've only seen this with Spark 1.1 Job aborted due to stage failure: Task 975 in stage 8.0 failed 4 times, most recent failure: Lost

Re: java.io.IOException Error in task deserialization

2014-09-26 Thread Brad Miller
I've had multiple jobs crash due to java.io.IOException: unexpected exception type; I've been running the 1.1 branch for some time and am now running the 1.1 release binaries. Note that I only use PySpark. I haven't kept detailed notes or the tracebacks around since there are other problems that

Re: java.io.IOException Error in task deserialization

2014-09-26 Thread Arun Ahuja
No for me as well it is non-deterministic. It happens in a piece of code that does many filter and counts on a small set of records (~1k-10k). The originally set is persisted in memory and we have a Kryo serializer set for it. The task itself takes in just a few filtering parameters. This with

Re: java.io.IOException Error in task deserialization

2014-09-26 Thread Brad Miller
FWIW I suspect that each count operation is an opportunity for you to trigger the bug, and each filter operation increases the likelihood of setting up the bug. I normally don't come across this error until my job has been running for an hour or two and had a chance to build up longer lineages