ztskycn opened a new issue, #10578: URL: https://github.com/apache/cloudstack/issues/10578
Description: I'm experiencing a critical issue with CloudStack 4.20 where all API endpoints become unresponsive approximately every 10 days. The only temporary resolution is to restart the CloudStack management server. Observed Behavior: API requests timeout/fail completely after ~10 days of uptime No explicit ERROR messages in logs prior to outage Found an unusually large INFO-level log entry (3MB per line) that might be relevant Attached log file: [filename.log] (Please ensure you actually attach the file via GitHub interface) Environment: CloudStack Version: 4.20.0.0 OS:Ubuntu 24.04 Steps to Reproduce: Start CloudStack management server Operate normally for ~10 days API services become unavailable without obvious triggers Expected Behavior: API endpoints should remain available continuously without requiring manual restarts. Additional Context: The large INFO-level log entry repeats periodically (full content attached) No observed resource exhaustion (CPU/MEM) before outages Problem persists across multiple maintenance windows Troubleshooting Attempted: Reviewed standard error logs - no smoking gun Monitored system resources - no apparent bottlenecks Server restart temporarily resolves the issue Request: Please help investigate: Potential memory leaks or thread blocking in the 4.20 codebase Significance of the oversized INFO log entries Known issues matching this periodic outage pattern -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
