**Job Title** **Site Reliability Engineer** **Your role:** + Design and scale **observability frameworks** (metrics, logs, traces, event streams) across cloud environments + Define and manage **SLIs/SLOs** to ensure high availability, performance, and reliability + Build **proactive, AI-driven monitoring systems** to detect anomalies and predict failures + Develop **automation and self-healing capabilities** to reduce manual intervention and improve system resilience + Enable **event-driven operations** , integrating with tools like **ServiceNow, PagerDuty, and Slack** + Collaborate with **engineering, SecOps, and FinOps teams** to improve reliability, security, and cost efficiency + Drive continuous improvement through **incident analysis, performance tuning, and reliability enhancements** **You're the right fit if:** + **SRE & Cloud Experience:** minimum 8+ years in SRE/Cloud/Platform Engineering with **AWS production environment experience** + **O...