✨ About The Role
- The Site Reliability Engineering Manager will lead and grow Zoox's Core Site Reliability Engineering team.
- Responsibilities include establishing and driving Service Level Objectives (SLOs) and monitoring strategies across core infrastructure.
- The role involves partnering with engineering teams to architect scalable solutions and improve system reliability.
- The manager will champion infrastructure automation and build robust observability solutions.
- Overseeing incident response and building resilient on-call processes to support business-critical services is a key responsibility.
âš¡ Requirements
- The ideal candidate will have over 5 years of experience in Site Reliability Engineering (SRE), DevOps, or similar technical roles.
- A strong background in distributed systems and cloud infrastructure is essential for success in this position.
- The candidate should possess at least 3 years of people management experience, demonstrating the ability to lead and mentor a team effectively.
- Familiarity with modern observability tools and practices is crucial for improving system reliability and performance.
- The successful candidate will have a proven track record of enhancing system reliability and performance in previous roles.