RE: spark job failure - akka error Association with remote system has failed

2016-01-13 Thread vivek.meghanathan
Identified the problem - the Cassandra seed ip we use was down! From: Vivek Meghanathan (WT01 - NEP) Sent: 13 January 2016 13:06 To: 'user@spark.apache.org' <user@spark.apache.org> Subject: RE: spark job failure - akka error Association with remote system has failed I have used master_ip

Re: spark job failure - akka error Association with remote system has failed

2016-01-13 Thread vivek.meghanathan
:38 AM To: Vivek Meghanathan (WT01 - NEP); user@spark.apache.org Subject: RE: spark job failure - akka error Association with remote system has failed Check the entries in your /etc/hosts file. Also check what the hostname command returns. Mohammed From: vivek.meghanat...@wip

RE: spark job failure - akka error Association with remote system has failed

2016-01-13 Thread Mohammed Guller
@masternode1:36537] has failed, address is now gated for [5000] ms. Reason is: [Disassociated]. From: Vivek Meghanathan (WT01 - NEP) Sent: 13 January 2016 12:18 To: user@spark.apache.org<mailto:user@spark.apache.org> Subject: spark job failure - akka error Association with remote system has fail

RE: spark job failure - akka error Association with remote system has failed

2016-01-12 Thread vivek.meghanathan
, address is now gated for [5000] ms. Reason is: [Disassociated]. From: Vivek Meghanathan (WT01 - NEP) Sent: 13 January 2016 12:18 To: user@spark.apache.org Subject: spark job failure - akka error Association with remote system has failed Hi All, I am running spark 1.3.0 standalone cluster mode, we

spark job failure - akka error Association with remote system has failed

2016-01-12 Thread vivek.meghanathan
Hi All, I am running spark 1.3.0 standalone cluster mode, we have rebooted the cluster servers (system reboot). After that the spark jobs are failing by showing following error (it fails within 7-8 seconds). 2 of the jobs are running fine. All the jobs used to be stable before the system