Hi,

I am using cdh4.4-mr1 for my scenario with Jobtracker HA.
During a failover of Jobtracker,  the client is not retrying.
The root cause is:
The method:
org.apache.hadoop.mapred.JobSubmissionProtocol.getSystemDir() doesn't throw any 
exception and when client failover happens on this API call, the exception 
being thrown is:
java.lang.reflect.UndeclaredThrowableException which wraps 
java.net.ConnectException.

which is not one of the exception on which retry is allowed:
1) ConnectException
2) NoRouteToHostException
3) UnknownHostException
4) StandbyException (or Wrapped StandbyException)
5) ConnectTimeoutException

Ideally the code should check wrapped exception or the API should throw 
IOExceptions like others.

Is this bug being already known to community ??
Do we have a workaround for this??

Cheers,
Subroto Sanyal

Attachment: signature.asc
Description: Message signed with OpenPGP using GPGMail

Reply via email to