Sr. Site Reliability Engineer (13346)

Key Duties and Responsibilities

  • Collaborating with engineers and stakeholders to resolve complex infrastructure, build, and packaging challenges across cloud and on-prem environments.
  • Partnering with development teams to design and implement automated pipelines for continuous delivery and deployment.
  • Leading and advocating for SRE best practices, fostering a culture of reliability and operational excellence.
  • Improving the predictability and reliability of software releases, workflows, and operational systems.
  • Reducing complexity and streamlining delivery by developing and promoting reusable code, tooling, and solution patterns.
  • Serving as an expert in incident response, quickly identifying and resolving system incidents to maintain high availability.
  • Measuring, monitoring, and analyzing system metrics and alarms to ensure performance and reliability, using a data-driven approach for decision-making.
    0
    Your Backpack
    Your backpack is empty