[jira] [Updated] (YARN-1405) RM should crash and print permission error for nonwritable/readable local path in yarn.resourcemanager.fs.state-store.uri

2013-12-06 Thread Jian He (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jian He updated YARN-1405:
--

Attachment: YARN-1405.1.patch

upload a patch to throw exception directly instead of calling sys exit 

> RM should crash and print permission error for nonwritable/readable local 
> path in yarn.resourcemanager.fs.state-store.uri
> -
>
> Key: YARN-1405
> URL: https://issues.apache.org/jira/browse/YARN-1405
> Project: Hadoop YARN
>  Issue Type: Sub-task
>Affects Versions: 2.2.0
>Reporter: Yesha Vora
>Assignee: Jian He
> Attachments: YARN-1405.1.patch, rm-threaddump.out
>
>
> Enable yarn.resourcemanager.recovery.enabled=true and Pass a local path to 
> yarn.resourcemanager.fs.state-store.uri. such as "file:///tmp/MYTMP"
> if the directory  /tmp/MYTMP is not readable or writable, RM should crash and 
> should print "Permission denied Error"
> Currently, RM throws "java.io.FileNotFoundException: File 
> file:/tmp/MYTMP/FSRMStateRoot/RMDTSecretManagerRoot does not exist" Error. RM 
> returns Exiting status 1 but RM process does not shutdown. 
> Snapshot of Resource manager log:
> 2013-09-27 18:31:36,621 INFO  security.NMTokenSecretManagerInRM 
> (NMTokenSecretManagerInRM.java:rollMasterKey(97)) - Rolling master-key for 
> nm-tokens
> 2013-09-27 18:31:36,694 ERROR resourcemanager.ResourceManager 
> (ResourceManager.java:serviceStart(640)) - Failed to load/recover state
> java.io.FileNotFoundException: File 
> file:/tmp/MYTMP/FSRMStateRoot/RMDTSecretManagerRoot does not exist
> at 
> org.apache.hadoop.fs.RawLocalFileSystem.listStatus(RawLocalFileSystem.java:379)
> at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:1478)
> at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:1518)
> at 
> org.apache.hadoop.fs.ChecksumFileSystem.listStatus(ChecksumFileSystem.java:564)
> at 
> org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore.loadRMDTSecretManagerState(FileSystemRMStateStore.java:188)
> at 
> org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore.loadState(FileSystemRMStateStore.java:112)
> at 
> org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.serviceStart(ResourceManager.java:635)
> at 
> org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
> at 
> org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.main(ResourceManager.java:855)
> 2013-09-27 18:31:36,697 INFO  util.ExitUtil (ExitUtil.java:terminate(124)) - 
> Exiting with status 1



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (YARN-1405) RM should crash and print permission error for nonwritable/readable local path in yarn.resourcemanager.fs.state-store.uri

2013-12-06 Thread Jian He (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jian He updated YARN-1405:
--

Attachment: rm-threaddump.out

upload rm thread dump

> RM should crash and print permission error for nonwritable/readable local 
> path in yarn.resourcemanager.fs.state-store.uri
> -
>
> Key: YARN-1405
> URL: https://issues.apache.org/jira/browse/YARN-1405
> Project: Hadoop YARN
>  Issue Type: Sub-task
>Affects Versions: 2.2.0
>Reporter: Yesha Vora
>Assignee: Jian He
> Attachments: rm-threaddump.out
>
>
> Enable yarn.resourcemanager.recovery.enabled=true and Pass a local path to 
> yarn.resourcemanager.fs.state-store.uri. such as "file:///tmp/MYTMP"
> if the directory  /tmp/MYTMP is not readable or writable, RM should crash and 
> should print "Permission denied Error"
> Currently, RM throws "java.io.FileNotFoundException: File 
> file:/tmp/MYTMP/FSRMStateRoot/RMDTSecretManagerRoot does not exist" Error. RM 
> returns Exiting status 1 but RM process does not shutdown. 
> Snapshot of Resource manager log:
> 2013-09-27 18:31:36,621 INFO  security.NMTokenSecretManagerInRM 
> (NMTokenSecretManagerInRM.java:rollMasterKey(97)) - Rolling master-key for 
> nm-tokens
> 2013-09-27 18:31:36,694 ERROR resourcemanager.ResourceManager 
> (ResourceManager.java:serviceStart(640)) - Failed to load/recover state
> java.io.FileNotFoundException: File 
> file:/tmp/MYTMP/FSRMStateRoot/RMDTSecretManagerRoot does not exist
> at 
> org.apache.hadoop.fs.RawLocalFileSystem.listStatus(RawLocalFileSystem.java:379)
> at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:1478)
> at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:1518)
> at 
> org.apache.hadoop.fs.ChecksumFileSystem.listStatus(ChecksumFileSystem.java:564)
> at 
> org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore.loadRMDTSecretManagerState(FileSystemRMStateStore.java:188)
> at 
> org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore.loadState(FileSystemRMStateStore.java:112)
> at 
> org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.serviceStart(ResourceManager.java:635)
> at 
> org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
> at 
> org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.main(ResourceManager.java:855)
> 2013-09-27 18:31:36,697 INFO  util.ExitUtil (ExitUtil.java:terminate(124)) - 
> Exiting with status 1



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (YARN-1405) RM should crash and print permission error for nonwritable/readable local path in yarn.resourcemanager.fs.state-store.uri

2013-11-12 Thread Jian He (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jian He updated YARN-1405:
--

Issue Type: Sub-task  (was: Bug)
Parent: YARN-128

> RM should crash and print permission error for nonwritable/readable local 
> path in yarn.resourcemanager.fs.state-store.uri
> -
>
> Key: YARN-1405
> URL: https://issues.apache.org/jira/browse/YARN-1405
> Project: Hadoop YARN
>  Issue Type: Sub-task
>Affects Versions: 2.2.0
>Reporter: Yesha Vora
>
> Enable yarn.resourcemanager.recovery.enabled=true and Pass a local path to 
> yarn.resourcemanager.fs.state-store.uri. such as "file:///tmp/MYTMP"
> if the directory  /tmp/MYTMP is not readable or writable, RM should crash and 
> should print "Permission denied Error"
> Currently, RM throws "java.io.FileNotFoundException: File 
> file:/tmp/MYTMP/FSRMStateRoot/RMDTSecretManagerRoot does not exist" Error. RM 
> returns Exiting status 1 but RM process does not shutdown. 
> Snapshot of Resource manager log:
> 2013-09-27 18:31:36,621 INFO  security.NMTokenSecretManagerInRM 
> (NMTokenSecretManagerInRM.java:rollMasterKey(97)) - Rolling master-key for 
> nm-tokens
> 2013-09-27 18:31:36,694 ERROR resourcemanager.ResourceManager 
> (ResourceManager.java:serviceStart(640)) - Failed to load/recover state
> java.io.FileNotFoundException: File 
> file:/tmp/MYTMP/FSRMStateRoot/RMDTSecretManagerRoot does not exist
> at 
> org.apache.hadoop.fs.RawLocalFileSystem.listStatus(RawLocalFileSystem.java:379)
> at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:1478)
> at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:1518)
> at 
> org.apache.hadoop.fs.ChecksumFileSystem.listStatus(ChecksumFileSystem.java:564)
> at 
> org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore.loadRMDTSecretManagerState(FileSystemRMStateStore.java:188)
> at 
> org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore.loadState(FileSystemRMStateStore.java:112)
> at 
> org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.serviceStart(ResourceManager.java:635)
> at 
> org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
> at 
> org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.main(ResourceManager.java:855)
> 2013-09-27 18:31:36,697 INFO  util.ExitUtil (ExitUtil.java:terminate(124)) - 
> Exiting with status 1



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (YARN-1405) RM should crash and print permission error for nonwritable/readable local path in yarn.resourcemanager.fs.state-store.uri

2013-11-12 Thread Yesha Vora (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yesha Vora updated YARN-1405:
-

Summary: RM should crash and print permission error for 
nonwritable/readable local path in yarn.resourcemanager.fs.state-store.uri  
(was: RM should crash and print permission error for nonwritable/readble local 
path in yarn.resourcemanager.fs.state-store.uri)

> RM should crash and print permission error for nonwritable/readable local 
> path in yarn.resourcemanager.fs.state-store.uri
> -
>
> Key: YARN-1405
> URL: https://issues.apache.org/jira/browse/YARN-1405
> Project: Hadoop YARN
>  Issue Type: Bug
>Affects Versions: 2.2.0
>Reporter: Yesha Vora
>
> Enable yarn.resourcemanager.recovery.enabled=true and Pass a local path to 
> yarn.resourcemanager.fs.state-store.uri. such as "file:///tmp/MYTMP"
> if the directory  /tmp/MYTMP is not readable or writable, RM should crash and 
> should print "Permission denied Error"
> Currently, RM throws "java.io.FileNotFoundException: File 
> file:/tmp/MYTMP/FSRMStateRoot/RMDTSecretManagerRoot does not exist" Error. RM 
> returns Exiting status 1 but RM process does not shutdown. 
> Snapshot of Resource manager log:
> 2013-09-27 18:31:36,621 INFO  security.NMTokenSecretManagerInRM 
> (NMTokenSecretManagerInRM.java:rollMasterKey(97)) - Rolling master-key for 
> nm-tokens
> 2013-09-27 18:31:36,694 ERROR resourcemanager.ResourceManager 
> (ResourceManager.java:serviceStart(640)) - Failed to load/recover state
> java.io.FileNotFoundException: File 
> file:/tmp/MYTMP/FSRMStateRoot/RMDTSecretManagerRoot does not exist
> at 
> org.apache.hadoop.fs.RawLocalFileSystem.listStatus(RawLocalFileSystem.java:379)
> at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:1478)
> at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:1518)
> at 
> org.apache.hadoop.fs.ChecksumFileSystem.listStatus(ChecksumFileSystem.java:564)
> at 
> org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore.loadRMDTSecretManagerState(FileSystemRMStateStore.java:188)
> at 
> org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore.loadState(FileSystemRMStateStore.java:112)
> at 
> org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.serviceStart(ResourceManager.java:635)
> at 
> org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
> at 
> org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.main(ResourceManager.java:855)
> 2013-09-27 18:31:36,697 INFO  util.ExitUtil (ExitUtil.java:terminate(124)) - 
> Exiting with status 1



--
This message was sent by Atlassian JIRA
(v6.1#6144)