[jira] [Commented] (YARN-4506) Application was killed by a resourcemanager, In the JobHistory Can't see the job detail

2015-12-30 Thread Junping Du (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-4506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15075135#comment-15075135
 ] 

Junping Du commented on YARN-4506:
--

2.2 is too old. It is highly possible that we could already fix this issue in 
recent releases. Please check if latest release: 2.6.3 or 2.7.1 have the same 
issue. If not, let's resolve this JIRA as cannot reproduce.

> Application was killed by a resourcemanager, In the JobHistory Can't see the 
> job detail
> ---
>
> Key: YARN-4506
> URL: https://issues.apache.org/jira/browse/YARN-4506
> Project: Hadoop YARN
>  Issue Type: Bug
>Affects Versions: 2.2.0
>Reporter: tangshangwen
> Attachments: am.rar
>
>
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster: MRAppMaster received a 
> signal. Signaling RMCommunicator and JobHistoryEventHandler.
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: RMCommunicator 
> notified that iSignalled is: true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster: Notify RMCommunicator 
> isAMLastRetry: true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: RMCommunicator 
> notified that shouldUnregistered is: true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster: Notify JHEH isAMLastRetry: 
> true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: 
> JobHistoryEventHandler notified that forceJobCompletion is true
> 2015-12-15 03:08:54,074 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: Stopping 
> JobHistoryEventHandler. Size of the outstanding queue size is 0
> 2015-12-15 03:08:54,074 INFO [eventHandlingThread] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: EventQueue 
> take interrupted. Returning
> 2015-12-15 03:08:54,078 WARN [Thread-1] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: Found jobId 
> job_1449835724839_219910 to have not been closed. Will close



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (YARN-4506) Application was killed by a resourcemanager, In the JobHistory Can't see the job detail

2015-12-30 Thread tangshangwen (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-4506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15075140#comment-15075140
 ] 

tangshangwen commented on YARN-4506:


Ok, I'll try to fix it

> Application was killed by a resourcemanager, In the JobHistory Can't see the 
> job detail
> ---
>
> Key: YARN-4506
> URL: https://issues.apache.org/jira/browse/YARN-4506
> Project: Hadoop YARN
>  Issue Type: Bug
>Affects Versions: 2.2.0
>Reporter: tangshangwen
> Attachments: am.rar
>
>
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster: MRAppMaster received a 
> signal. Signaling RMCommunicator and JobHistoryEventHandler.
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: RMCommunicator 
> notified that iSignalled is: true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster: Notify RMCommunicator 
> isAMLastRetry: true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: RMCommunicator 
> notified that shouldUnregistered is: true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster: Notify JHEH isAMLastRetry: 
> true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: 
> JobHistoryEventHandler notified that forceJobCompletion is true
> 2015-12-15 03:08:54,074 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: Stopping 
> JobHistoryEventHandler. Size of the outstanding queue size is 0
> 2015-12-15 03:08:54,074 INFO [eventHandlingThread] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: EventQueue 
> take interrupted. Returning
> 2015-12-15 03:08:54,078 WARN [Thread-1] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: Found jobId 
> job_1449835724839_219910 to have not been closed. Will close



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (YARN-4506) Application was killed by a resourcemanager, In the JobHistory Can't see the job detail

2015-12-24 Thread tangshangwen (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-4506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15071325#comment-15071325
 ] 

tangshangwen commented on YARN-4506:


I found when the MRAppMaster received a signal, the thread is not copy 
job_ID.jhist to /user/history/done_intermediate in my am.log.

> Application was killed by a resourcemanager, In the JobHistory Can't see the 
> job detail
> ---
>
> Key: YARN-4506
> URL: https://issues.apache.org/jira/browse/YARN-4506
> Project: Hadoop YARN
>  Issue Type: Bug
>Affects Versions: 2.2.0
>Reporter: tangshangwen
> Attachments: am.rar
>
>
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster: MRAppMaster received a 
> signal. Signaling RMCommunicator and JobHistoryEventHandler.
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: RMCommunicator 
> notified that iSignalled is: true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster: Notify RMCommunicator 
> isAMLastRetry: true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: RMCommunicator 
> notified that shouldUnregistered is: true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster: Notify JHEH isAMLastRetry: 
> true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: 
> JobHistoryEventHandler notified that forceJobCompletion is true
> 2015-12-15 03:08:54,074 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: Stopping 
> JobHistoryEventHandler. Size of the outstanding queue size is 0
> 2015-12-15 03:08:54,074 INFO [eventHandlingThread] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: EventQueue 
> take interrupted. Returning
> 2015-12-15 03:08:54,078 WARN [Thread-1] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: Found jobId 
> job_1449835724839_219910 to have not been closed. Will close



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (YARN-4506) Application was killed by a resourcemanager, In the JobHistory Can't see the job detail

2015-12-24 Thread Daniel Templeton (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-4506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15071106#comment-15071106
 ] 

Daniel Templeton commented on YARN-4506:


Are you sure you're on 2.2?  That's pretty darned old.  Any additional details 
you can provide would also be helpful.

> Application was killed by a resourcemanager, In the JobHistory Can't see the 
> job detail
> ---
>
> Key: YARN-4506
> URL: https://issues.apache.org/jira/browse/YARN-4506
> Project: Hadoop YARN
>  Issue Type: Bug
>Affects Versions: 2.2.0
>Reporter: tangshangwen
>
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster: MRAppMaster received a 
> signal. Signaling RMCommunicator and JobHistoryEventHandler.
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: RMCommunicator 
> notified that iSignalled is: true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster: Notify RMCommunicator 
> isAMLastRetry: true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: RMCommunicator 
> notified that shouldUnregistered is: true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster: Notify JHEH isAMLastRetry: 
> true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: 
> JobHistoryEventHandler notified that forceJobCompletion is true
> 2015-12-15 03:08:54,074 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: Stopping 
> JobHistoryEventHandler. Size of the outstanding queue size is 0
> 2015-12-15 03:08:54,074 INFO [eventHandlingThread] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: EventQueue 
> take interrupted. Returning
> 2015-12-15 03:08:54,078 WARN [Thread-1] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: Found jobId 
> job_1449835724839_219910 to have not been closed. Will close



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (YARN-4506) Application was killed by a resourcemanager, In the JobHistory Can't see the job detail

2015-12-24 Thread tangshangwen (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-4506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15071119#comment-15071119
 ] 

tangshangwen commented on YARN-4506:


I'm sure it happened in 2.2,because i fond AM was kill by RM,I can't found the 
job in JobHistory。

2015-12-15 02:56:48,916 INFO [main] org.mortbay.log: Extract 
jar:file:/software/servers/hadoop-2.2.0/share/hadoop/yarn/hadoop-yarn-common-2.2.0.jar!/webapps/mapreduce

> Application was killed by a resourcemanager, In the JobHistory Can't see the 
> job detail
> ---
>
> Key: YARN-4506
> URL: https://issues.apache.org/jira/browse/YARN-4506
> Project: Hadoop YARN
>  Issue Type: Bug
>Affects Versions: 2.2.0
>Reporter: tangshangwen
> Attachments: am.rar
>
>
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster: MRAppMaster received a 
> signal. Signaling RMCommunicator and JobHistoryEventHandler.
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: RMCommunicator 
> notified that iSignalled is: true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster: Notify RMCommunicator 
> isAMLastRetry: true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: RMCommunicator 
> notified that shouldUnregistered is: true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster: Notify JHEH isAMLastRetry: 
> true
> 2015-12-15 03:08:54,073 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: 
> JobHistoryEventHandler notified that forceJobCompletion is true
> 2015-12-15 03:08:54,074 INFO [Thread-1] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: Stopping 
> JobHistoryEventHandler. Size of the outstanding queue size is 0
> 2015-12-15 03:08:54,074 INFO [eventHandlingThread] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: EventQueue 
> take interrupted. Returning
> 2015-12-15 03:08:54,078 WARN [Thread-1] 
> org.apache.hadoop.mapreduce.jobhistory.JobHistoryEventHandler: Found jobId 
> job_1449835724839_219910 to have not been closed. Will close



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)