What This Site Reliability Engineer Owns
- Own recurring reliability workflows tied to monitoring, alerting, deployment safety, and production hygiene.
- Investigate incidents, document root causes, and improve follow-up after outages or high-severity events.
- Support infrastructure changes with stronger change management and rollback discipline.
- Maintain internal documentation around environments, service dependencies, and operational playbooks.
- Bridge engineering, DevOps, and support teams when production issues affect customers.