This repo is intended to be a quick documentation and report about the work done to implement an inactive resource optimization policy in CrownLabs.
- Inactive detection policy (initially based on Tenant last login, could be extended to instance metrics such as workload or ingress/proxy access)
- Tenant notification when an inactive instance is detected (email)
- Instance pause/deletion after max inactive time
- (opt.) Adjust pod resource limits based on the actual consumptions
- (opt.) Live migration for resource consolidation among worker nodes
31/03 - 06/04
: First architecture alternatives07/04 - 13/04
: Prometheus-stack installed; Achieved good capability to interact with PromQL, setting custom Alert rules;14/04 - 20/04
: Definition and tests of local development environment21/04 - 27/04
: Local deployment enviroment testing28/04 - 04/05
: Developed a demo implementation of theInstanceInactiveController
.05/05 - 11/05
: Refined logic for the termination mechanism12/05 - 18/05
: Implementation of Prometheus checks forInstanceInactiveController
, implementation of mail notification service19/05 - 25/05
: Implementation tests26/05 - 01/06
: Implementation tests26/05 - 01/06
: Implementation of new Grafana Dashboard for Crownlabs resource usage and consumption02/06 - 08/06
: Implementation bastion SSH checks09/06 - 15/06
: Fixes after Review comments, Refined Grafana Dashboard16/06 - 22/06
: Fixes after Review comments, Refined Grafana Dashboard