Administer and support AWS infrastructure including EKS clusters, Elastic Beanstalk environments, EC2, S3, Cloud Front, RDS, and IAM across a multi-account environment
Support GCP workloads, primarily Big Query pipelines and related data infrastructure
Maintain and troubleshoot self-hosted applications including Airflow, Grafana, and other internal tooling
Support and maintain CI/CD pipelines in GitHub Actions, including self-hosted runners
Participate in incident response processes—triage, root cause analysis, remediation, and post-incident documentation
Deploy, configure, and maintain logging, monitoring, and alerting tooling (Grafana stack — Loki, Tempo, Mimir, Alloy; Cloud Watch; One Uptime)
Manage infrastructure-as-code using Terraform
Participate in on‑call rotation providing support across US and...