JobsSenior Site Reliability Engineer
Job description
The Senior Site Reliability Engineer role at Google focuses on combining software and systems engineering to build and maintain large-scale, fault-tolerant systems. The team ensures that Google Cloud's services have the necessary reliability and performance while driving production excellence through sound SRE principles. Engineers in this role will collaborate with a global team to tackle complex challenges and support groundbreaking AI/ML tools. The culture emphasizes intellectual curiosity, problem-solving, and mentorship.
Requirements
- Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
- 5 years of experience with software development in one or more programming languages.
- 3 years of experience in designing, analyzing, and troubleshooting large-scale distributed systems.
- 2 years of experience leading projects and providing technical leadership.
- 2 years of experience building and architecting production quality machine learning systems.
Responsibilities
- Engage in and improve the whole lifecycle of services from inception and design to deployment, operation, and refinement.
- Support services before they go live through system design consulting, developing software platforms, capacity planning, and launch reviews.
- Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
- Scale systems sustainably through automation and evolve systems by implementing changes that improve reliability and velocity.
- Practice sustainable incident response and conduct blameless postmortems.
Benefits
- Employees at Google are often offered benefits like comprehensive health insurance, 401(k) matching, and flexible work arrangements, among other benefits.
Is this posting expired or inaccurate?
