Hi, I am using cdh4.4-mr1 for my scenario with Jobtracker HA. During a failover of Jobtracker, the client is not retrying. The root cause is: The method: org.apache.hadoop.mapred.JobSubmissionProtocol.getSystemDir() doesn't throw any exception and when client failover happens on this API call, the exception being thrown is: java.lang.reflect.UndeclaredThrowableException which wraps java.net.ConnectException.
which is not one of the exception on which retry is allowed: 1) ConnectException 2) NoRouteToHostException 3) UnknownHostException 4) StandbyException (or Wrapped StandbyException) 5) ConnectTimeoutException Ideally the code should check wrapped exception or the API should throw IOExceptions like others. Is this bug being already known to community ?? Do we have a workaround for this?? Cheers, Subroto Sanyal
signature.asc
Description: Message signed with OpenPGP using GPGMail