ztskycn opened a new issue, #10578:
URL: https://github.com/apache/cloudstack/issues/10578

   Description:
   I'm experiencing a critical issue with CloudStack 4.20 where all API 
endpoints become unresponsive approximately every 10 days. The only temporary 
resolution is to restart the CloudStack management server.
   
   Observed Behavior:
   
   API requests timeout/fail completely after ~10 days of uptime
   
   No explicit ERROR messages in logs prior to outage
   
   Found an unusually large INFO-level log entry (3MB per line) that might be 
relevant
   
   Attached log file: [filename.log] (Please ensure you actually attach the 
file via GitHub interface)
   
   Environment:
   
   CloudStack Version: 4.20.0.0
   
   OS:Ubuntu 24.04
   
   
   Steps to Reproduce:
   
   Start CloudStack management server
   
   Operate normally for ~10 days
   
   API services become unavailable without obvious triggers
   
   Expected Behavior:
   API endpoints should remain available continuously without requiring manual 
restarts.
   
   Additional Context:
   
   The large INFO-level log entry repeats periodically (full content attached)
   
   No observed resource exhaustion (CPU/MEM) before outages
   
   Problem persists across multiple maintenance windows
   
   Troubleshooting Attempted:
   
   Reviewed standard error logs - no smoking gun
   
   Monitored system resources - no apparent bottlenecks
   
   Server restart temporarily resolves the issue
   
   Request:
   Please help investigate:
   
   Potential memory leaks or thread blocking in the 4.20 codebase
   
   Significance of the oversized INFO log entries
   
   Known issues matching this periodic outage pattern
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to