Join a skilled team as a Senior Site Reliability Engineer, leveraging your expertise in Azure Kubernetes Service and observability tools like Dynatrace and Splunk. Deliver high-impact solutions to enhance system reliability and performance. As a critical member in this role, you will design observability-as-code solutions using Terraform to create effective monitoring pipelines and dashboards. Your responsibilities will encompass driving real-time performance insights, troubleshooting complex production incidents, and automating operational tasks to build resilient systems. You will collaborate with cross-functional teams to ensure service excellence and reliability. Key Responsibilities: • Design observability-as-code solutions with Terraform • Drive improvements using Dynatrace, ELK, and Splunk • Instrument applications for comprehensive observability • Troubleshoot complex incidents in production environments • Lead incident response and blameless postmortems