[ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Hemanth Yamijala updated HADOOP-2848: ------------------------------------- Attachment: 2848.patch The attached file modifies the behavior of the list, info and deallocate commands when the cluster directory is not found. list : will list the cluster state as 'cluster state unknown' and continue to print the cluster id and cluster directory. Previously, it used to not print anything info: will print that the cluster directory is invalid, and print the cluster id, if the cluster directory was deleted. deallocate: will print that the cluster directory is invalid, and deallocate the cluster, based on the cluster id, if the cluster directory was deleted, thus freeing up nodes. The patch also adds unit tests for testing the above conditions. > [HOD] If a cluster directory is deleted, hod -o list must show it, and > deallocate should work. > ---------------------------------------------------------------------------------------------- > > Key: HADOOP-2848 > URL: https://issues.apache.org/jira/browse/HADOOP-2848 > Project: Hadoop Core > Issue Type: Bug > Components: contrib/hod > Affects Versions: 0.16.0 > Reporter: Hemanth Yamijala > Assignee: Hemanth Yamijala > Fix For: 0.17.0 > > Attachments: 2848.patch > > > Currently if the cluster directory is deleted, all state about the cluster is > lost. While this in itself is not a problem, at least recovery in the sense > of being able to list the torque job id and deallocation to clear up the > nodes should happen correctly. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.