Sr. Site Reliability Engineer (13346)
Key Duties and Responsibilities
- Collaborating with engineers and stakeholders to resolve complex infrastructure, build, and packaging challenges across cloud and on-prem environments.
- Partnering with development teams to design and implement automated pipelines for continuous delivery and deployment.
- Leading and advocating for SRE best practices, fostering a culture of reliability and operational excellence.
- Improving the predictability and reliability of software releases, workflows, and operational systems.
- Reducing complexity and streamlining delivery by developing and promoting reusable code, tooling, and solution patterns.
- Serving as an expert in incident response, quickly identifying and resolving system incidents to maintain high availability.
- Measuring, monitoring, and analyzing system metrics and alarms to ensure performance and reliability, using a data-driven approach for decision-making.



