Elevate reliability standards at Confluent as a Senior Site Reliability Engineer. Focus on proactive reliability improvements within a multi-cloud streaming platform while managing incident response practices.
In this senior role, you'll devote 75% of your time to engineering, improving tooling, analyzing failure patterns, and designing solutions. The remaining 25% involves teaching and coordinating incident response enhancements, coaching teams, and driving organizational changes in reliability practices. Your expertise will help minimize incidents across Confluent Cloud's dynamic environment.
Key Responsibilities: • Analyze failure patterns for proactive reliability design • Own configuration of Rootly and integrations with key tools • Define and maintain SLO/SLA frameworks • Edit customer-facing incident documents for quality • Develop training programs and coach teams through post-mortems