[ https://issues.apache.org/jira/browse/HADOOP-15593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16550762#comment-16550762 ]
Gabor Bota commented on HADOOP-15593: ------------------------------------- Thanks [~eyang] for the review! There is no difference between the v1 and v2 solution about how this is handled. Please also note the following comment in the IOException cache, where getNextTgtRenewalTime is used and RetryPolicy is defined: {noformat} // Use a dummy maxRetries to create the policy. The policy will // only be used to get next retry time with exponential back-off. // The final retry time will be later limited within the // tgt endTime in getNextTgtRenewalTime. {noformat} I think a solution for this would be to move this to the try block, instead of creating the RetryPolicy in the catch block, so all renewal time would be based on the RetryPolicy. As this issue is a blocker one, so really need to be finished asap (also only aims to target the NPE), I will create another issue for changing the retry behavior. > UserGroupInformation TGT renewer throws NPE > ------------------------------------------- > > Key: HADOOP-15593 > URL: https://issues.apache.org/jira/browse/HADOOP-15593 > Project: Hadoop Common > Issue Type: Bug > Components: security > Affects Versions: 3.0.0 > Reporter: Wei-Chiu Chuang > Assignee: Gabor Bota > Priority: Blocker > Attachments: HADOOP-15593.001.patch, HADOOP-15593.002.patch > > > Found the following NPE thrown in UGI tgt renewer. The NPE was thrown within > an exception handler so the original exception was hidden, though it's likely > caused by expired tgt. > {noformat} > 18/07/02 10:30:57 ERROR util.SparkUncaughtExceptionHandler: Uncaught > exception in thread Thread[TGT Renewer for f...@example.com,5,main] > java.lang.NullPointerException > at > javax.security.auth.kerberos.KerberosTicket.getEndTime(KerberosTicket.java:482) > at > org.apache.hadoop.security.UserGroupInformation$1.run(UserGroupInformation.java:894) > at java.lang.Thread.run(Thread.java:748){noformat} > Suspect it's related to [https://bugs.openjdk.java.net/browse/JDK-8154889]. > The relevant code was added in HADOOP-13590. File this jira to handle the > exception better. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org