Design, implement & own end-to-end observability solutions using tools to ensure
comprehensive system visibility to improve reliability, architect highly resilience systems.
Advocate for observability best practices across engineering teams and integrate monitoring
into Infrastructure & applications.
Develop automation for infrastructure to reduce manual toil, ensure reliability and optimize resource utilization through performance analysis, AI abnormally detection and dynamic adjustments.
Mentor observability team and foster a culture of continuous improvement and innovation.
Work with technical partners, exploring tools/features PoC, manage licenses, and conducting training sessions.