Hello,

I run Hadoop jobs which read data from Cassandra 1.2.8 and write results back 
to another tables. One of my reduce tasks was killed 2 times by job tracker, 
because it wasn't responding for more than 10 minutes, the 3rd attempt was 
succesfull.

The error message for killed reduce tasks is:

java.io.IOException: TimedOutException(acknowledged_by:0) 
at 
org.apache.cassandra.hadoop.cql3.CqlRecordWriter$RangeClient.run(CqlRecordWriter.java:245)
 
Caused by: TimedOutException(acknowledged_by:0) 
at 
org.apache.cassandra.thrift.Cassandra$execute_prepared_cql3_query_result.read(Cassandra.java:41884)
 
at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:78) 
at 
org.apache.cassandra.thrift.Cassandra$Client.recv_execute_prepared_cql3_query(Cassandra.java:1689)
 
at 
org.apache.cassandra.thrift.Cassandra$Client.execute_prepared_cql3_query(Cassandra.java:1674)
 
at 
org.apache.cassandra.hadoop.cql3.CqlRecordWriter$RangeClient.run(CqlRecordWriter.java:229)
 ,
Task attempt_201310081258_0006_r_000000_0 failed to report status for 600 
seconds. Killing!

I'm wondering how could it happen that task didn't report status for 600 
seconds and how it's related to the TimedOutException at the top of the 
stacktrace.  The write_request_timeout_in_ms is default 10000, so it should 
fail much earlier.


Thanks.

Reply via email to