Back to Jobs
XX
Software Engineering Manager, Site Reliability EngineeringGoogle Inc.Sunnyvale, California, United States
XX

Software Engineering Manager, Site Reliability Engineering

Google Inc.
  • US
    Sunnyvale, California, United States
  • US
    Sunnyvale, California, United States

About

Software Engineering Manager, Site Reliability Engineering Sunnyvale, CA, USA
Qualifications
Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
8 years of experience in software engineering, systems engineering, or site reliability engineering.
5 years of experience building and developing large‑scale infrastructure or distributed systems.
2 years of experience with people management.
Preferred qualifications Master's degree in Computer Science or Engineering, or a related field.
About the job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large‑scale, massively distributed, fault‑tolerant systems. The SRE team ensures that Google Cloud’s services—both internally critical and externally visible—maintain reliability, uptime appropriate to customer needs, and continuous improvement. SREs also monitor system capacity and performance.
In this role you will manage the complex challenges of scale unique to Google Cloud while applying expertise in coding, algorithms, complexity analysis, and large‑scale system design. You will lead a small team, foster a culture of curiosity and open problem‑solving, and enable the team to take ownership of meaningful projects.
Key responsibilities include:
Managing the scale and availability of next–generation Workspace GenAI features in collaboration with the Workspace AI SRE team, ensuring model‑based features remain fast, reliable, and degrade gracefully under load.
Operating a newly partitioned Spanner storage topology, completing physical isolation of Spanner allocations per Editor and shard, and managing elastic resource capacity through autoscaling.
Orchestrating large‑scale, multi‑system restore operations for critical customer data, contributing to tools and playbooks that coordinate data recovery across dependencies, validate data correctness, and restore integrity after complex platform incidents.
Directing the resource headroom and efficiency roadmap for the Editors portfolio.
Compensation US: $207,000 – $301,000 (USD) + 20% bonus target + equity + benefits.
Equal‑Employment Opportunity Google is an equal opportunity and affirmative action employer. We are committed to building a workforce representative of the users we serve and to fostering a culture of belonging. Our hiring decisions are made without regard to race, creed, color, religion, gender, sexual orientation, gender identity or expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, criminal history, or any other basis protected by law. Please see our EEO and hiring policies for additional information.
Google is a global company and requires English proficiency for all roles unless otherwise specified.
To all recruiters: Google does not accept agency resumes. Please do not forward resumes for this position.
#J-18808-Ljbffr
  • Sunnyvale, California, United States

Languages

  • English
Notice for Users

This job comes from a TieTalent partner platform. Click "Apply Now" to submit your application directly on their site.