adoroszlai opened a new pull request, #7483:
URL: https://github.com/apache/ozone/pull/7483

   ## What changes were proposed in this pull request?
   
   Background:
   - CI checks collect problems that caused the failure into a report file 
(usually `summary.txt`).  This file is shown in the jobs' `Summary of failures` 
step.
   - Checks are marked as failed if the report file is not empty.
   - There may be failures that affect the check, but are not specific to its 
validation.  E.g. findbugs may fail due to compile error.  This is not 
reflected in the report file, so the check may be passing despite compile 
errors.
   
   On current `master`, Kubernetes check runs into error:
   
   ```
   RuntimeException: Failed to set directory permissions for /data: /data: 
Operation not permitted
        at 
org.apache.hadoop.hdds.server.ServerUtils.getDirectoryFromConfig(ServerUtils.java:188)
        at 
org.apache.hadoop.hdds.utils.db.DBDefinition.getDBLocation(DBDefinition.java:58)
        at 
org.apache.hadoop.hdds.utils.db.DBStoreBuilder.getDBDirPath(DBStoreBuilder.java:169)
        at 
org.apache.hadoop.ozone.container.metadata.WitnessedContainerMetadataStoreImpl.get(WitnessedContainerMetadataStoreImpl.java:45)
        at 
org.apache.hadoop.ozone.container.ozoneimpl.OzoneContainer.<init>(OzoneContainer.java:192)
        at 
org.apache.hadoop.ozone.container.common.statemachine.DatanodeStateMachine.<init>(DatanodeStateMachine.java:183)
        at 
org.apache.hadoop.ozone.HddsDatanodeService.start(HddsDatanodeService.java:297)
   ```
   
   which was introduced in HDDS-11650, but went unnoticed due to the false 
"passing" status.
   
   This change ensures that check status reflects errors:
   
   1. Add common post-processing step in check scripts to make sure 
`summary.txt` is not left empty if any failures are encountered.
   2. Remove `continue-on-error` from main step of checks.  Previously this was 
needed to make GitHub UI auto-open the `Summary of failures` step instead of 
`Execute tests`.
   
   It also fixes the permission problem in `kubernetes` check 
(e09ba8f228838f15f857b5fcf0e667abfc91d34b).
   
   https://issues.apache.org/jira/browse/HDDS-11718
   
   ## How was this patch tested?
   
   Verified that `kubernetes` check is failing before applying the fix for the 
permission problem:
   https://github.com/adoroszlai/ozone/actions/runs/12012294888/job/33484088312
   
   Temporarily introduced compile error, verified `findbugs` (as well as 
`build` and `checkstyle`) fails:
   https://github.com/adoroszlai/ozone/actions/runs/12012941631/job/33485335513
   
   Regular CI:
   https://github.com/adoroszlai/ozone/actions/runs/12012816442
   
   Kubernetes is passing (without failures):
   
   ```
   2024-11-25 15:19:38 INFO  SCMSafeModeManager:248 - SCM exiting safe mode.
   ...
   Basic Freon smoketest                                                 | PASS 
|
   ...
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to