[ 
https://issues.apache.org/jira/browse/HADOOP-3153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kumar Vavilapalli updated HADOOP-3153:
--------------------------------------------

    Attachment: HADOOP-3153.1

Attaching a second patch incorporating above suggestions.

    - Allocate works now, even if .hod directory does not exist.
    - Made changes to error messages.
    - Incorporated minor points and suggestions for changes in test-cases.
    - Now, deallocation happens when the except block of the last part of the 
allocate command is triggered. Tested this by putting in a hard-coded 
exception, and verified that cluster is deallocated.
    - To implement the above, re-factored code in hadoop.py a bit. Tested the 
re-factored code too(by giving an interrupt after ringmaster comes up).
    - Made changes to error codes so that 1 is returned when failing because of 
an invalid state file. ( <1><Configuration error><Incorrect configuration 
values specified in hodrc, or other errors related to HOD configuration. The 
error messages in this case must be sufficient to debug and fix the problem> )

> [HOD] Hod should deallocate cluster if there's a problem in writing 
> information to the state file
> -------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-3153
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3153
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: contrib/hod
>    Affects Versions: 0.16.0
>            Reporter: Hemanth Yamijala
>            Assignee: Vinod Kumar Vavilapalli
>             Fix For: 0.17.0
>
>         Attachments: HADOOP-3153, HADOOP-3153.1
>
>
> Consider a scenario where hod runs allocate successfully, but isn't able to 
> save teh allocated information to the clusters.state file. In such a case, it 
> gets an error and exits. But the cluster remains allocated, and unfortunately 
> the user cannot deallocate the cluster now unless he knows the cluster 
> directory.
> It is better if HOD can deallocate the cluster in such an error condition.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to