Sr Manager, Site Reliability Engineering
- Milan, Tennessee, United States
- Milan, Tennessee, United States
À propos
GROW WITH US:
Tandem Diabetes Care creates new possibilities for people living with diabetes, their loved ones, and their healthcare providers through a positively different experience. We'd love for you to team up with us to "innovate every day," put "people first," and take the "no-shortcuts" approach that has propelled us to become a leader in the diabetes technology industry.
STAY AWESOME:
Tandem Diabetes Care is proud to manufacture and sell the Tandem Mobi system and t:slim X2 insulin pump with Control-IQ+ technology — an advanced predictive algorithm that automates insulin delivery. But we're so much more than that. Our company's human-centered approach to design, development, and support delivers innovative products and services for people who use insulin. Because many of our own team members live with diabetes, or have a loved one impacted by diabetes, the work is personal, and we are committed to the cause. Learn more at
A DAY IN THE LIFE:
The Senior Manager, Site Reliability Engineering (SRE) is a hand-on leader who leads the SRE team and drives operational excellence across the cloud platform. The senior manager defines the vision and strategy for reliability and observability, builds and mentors a high-performing team, and partners closely with engineering, product, and cybersecurity teams to ensure our systems are reliable, scalable, performant, and secure. This role combines strategic vision with technical leadership, ensuring our cloud infrastructure can seamlessly support the organization's growth and innovation goals while strengthening our operations, observability, and incident management capabilities.
Vision & Strategy
- Defines and executes the roadmap for reliability, observability, and operational excellence across our cloud infrastructure.
- Team Leadership and Development Inspires, coaches, and mentors a team of SREs and engineering managers, cultivating a culture that thrives on ownership, technical depth, and continuous improvement.
- Fosters knowledge sharing and continuous learning through reviews, retrospectives, and technical discussions.
- Drives day-to-day operations of the team, guiding the team through complex challenges, remaining actively engaged in the technical execution and ensuring work is completed in alignment with established standards and processes.
- Supports the short-term planning for the department including headcount, budgeting, training, and systems requirements.
- Participates in the selection, development, performance appraisal, merit recommendation, and promotion of staff.
- Ensures department staff is properly trained, per designated training plan, before assuming job responsibilities.
- Develops and manages schedules and performance requirements of staff.
Observability, Monitoring & Incident Management
- Establishes observability standards, KPI's, SLO's/SLI's that enable proactive detection and rapid incident resolution.
- Establishes meaningful reliability metrics and dashboards for stakeholder visibility.
- Leads the on-call process, ensuring incidents are well-managed, lessons are captured, and service improvements are implemented to eliminate recurrence.
- Leads and standardizes troubleshooting practices across the team.
- Develops training, runbooks, and frameworks that enable consistent, efficient diagnosis and resolution of complex system issues.
- Drives automation of operational workflows, observability, and incident response processes to reduce toil and improve efficiency.
Production Operations & Reliability
- Oversees all aspects of production operations, ensuring stability, performance, and scalability across cloud environments.
- Leads application release support, partnering with development and release engineering teams to ensure smooth, low-risk deployments.
- Maintains operational integrity through OS patching, infrastructure version upgrades, and lifecycle management.
- Defines, tests, and evolves HA/DR strategies that ensure business continuity.
Security & Compliance
- Partners closely with the Cybersecurity team to ensure security and compliance of production environments, maintaining audit readiness, safeguard production environments, and meet regulatory standards.
- Provides documentation, evidence, and operational oversight for internal and external audits.
WHEN & WHERE YOU'LL WORK:
Remote: This position is fully remote and open to candidates within the United States. Equipment for the role will be provided and training will occur virtually.
WHAT YOU'LL NEED:
- Proven experience managing and scaling SRE or DevOps teams in cloud-native environments (Azure preferred, AWS, GCP), including hiring, mentoring and cross-functional collaboration.
- Demonstrated expertise in cloud platforms (Azure preferred, AWS, GCP) and infrastructure-as-code practices.
- Strong understanding of distributed systems, scalability, and high-availability architecture
- Experience establishing and managing SLOs, error budgets, and reliability metrics.
- Proficiency with monitoring, logging, and observability tools (Prometheus, Grafana, New Relic, ELK, Datadog, or equivalent).
- Experience with containerization and orchestration technologies (Docker, Kubernetes).
- Strong incident response and post-mortem process expertise.
- Experience in regulated industries (fintech, healthcare, etc.).
- Experience supporting internal and external audits and maintaining operational readiness.
- Ability to translate business priorities into operational actions and continuous improvement mindset.
- Knowledge of cost optimization and cloud financial management, preferred.
EXTRA AWESOME:
- Bachelor's degree in computer sciences or related technical field or equivalent combination of education and applicable job experience.
- Microsoft Azure certifications e.g., Azure Administrator Associate, DevOps Engineer Expert are a plus
- 8+ years of experience in software, infrastructure, or reliability engineering.
- 3+ years in a manager role leading technical teams.
COMPENSATION & BENEFITS:
The starting base pay range for this position is $164,000 to $190,000 annually. Base pay will vary based on job-related knowledge, skills, experience and may also fluctuate depending on candidate's location and the overall job market. In addition to base pay, Tandem offers a competitive compensation package that includes bonus and a robust benefits package.
YOU SHOULD KNOW:
Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable state and local Fair Chance laws and regulations. A conditional offer of employment from Tandem is contingent upon successful completion of a pre-employment screening process comprised of a drug test (excluding marijuana) and background check, which includes a review of criminal history information.
Tandem has good cause to conduct a review of criminal history information of candidates for this position, as this role may involve access to proprietary, sensitive and/or confidential information, including customer protected health information. This review is required to ensure that individuals in such roles uphold high standards of trust and integrity so as to protect the interests of our customers, employees, and stakeholders.
WHY YOU'LL LOVE WORKING HERE:
At Tandem, we believe joy fuels excellence. That's why we've built a workplace that celebrates your achievements and supports your well-being. Our team thrives on pushing boundaries and fostering growth, all while maintaining a spirit of fun and camaraderie. This is just one of the ways we stay awesome Explore the benefits and reasons to love Tandem at
BE YOU, WITH US
We embrace the value that every single one of us brings to the table. But sometimes we forget that when we don't meet 100% of a job description's criteria – maybe you're feeling that way right now? We encourage you to apply anyway. Because we want you to be you, with us.
Tandem is firmly committed to being an equal opportunity employer and does not discriminate on the basis of age, disability, sex, race, religion or belief, gender identity or expression, marriage/civil partnership, pregnancy/maternity, or sexual orientation. We are an inclusive organization, and we welcome applications from a wide range of candidates. Selection for roles will be based on individual merit alone.
REFERRALS:
We love a good referral If you know someone who would be a great fit for this position, please share
APPLICATION DEADLINE:
The position will be posted until a final candidate is selected for the requisition or the requisition has a sufficient number of applications.
Make a move that matters. Join Tandem Diabetes Care, where we're turning challenges into triumphs every day and where your talents will help shape a healthier, happier tomorrow.
LI-KT1 #LI-RemoteCompétences linguistiques
- English
Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.