Über
Build the SRE function from the ground up so product teams can ship faster without compromising reliability, security, or cost control.
Stand up the SRE practice from scratch: on-call rotation, PagerDuty configuration, SLA/SLO definitions for core infrastructure services, runbook library, and observability dashboards that tie site performance to business metrics.
Complete the AWS multi-account migration: move production workloads to an isolated account with zero unplanned downtime.
Deliver SOC 2 Type I audit-ready infrastructure evidence package: own the technical controls implementation end-to-end.
Version and publish the Terraform module library: (30+ modules) to a private registry to eliminate ad hoc git consumption by product teams.
Implement automated deployment rollback for ECS and Lambda: gate production on integration test passage.
Stand up monthly cost reporting to leadership: budget anomaly detection, savings plan recommendations, spend by service/team/environment.
Requirements: 5+ years of production AWS infrastructure experience with deep Terraform expertise.
Hands-on experience building the SRE function from scratch and had complete ownership.
Experience with a multi-site company where PaaS or microservices are required.
CI/CD pipeline ownership in one or more previous roles.
PagerDuty experience and standing up an on-call rotation.
5+ years hands-on with AWS, Terraform, CI/CD pipeline ownership, and SRE tooling (OpenTelemetry, Grafana, PagerDuty or equivalent) in a production environment.
Benefits: Base salary is set according to market rates for the nearest major metro and varies based on Launch Potato’s Levels Framework.
Your compensation package includes a base salary, profit-sharing bonus, and competitive benefits.
Sprachkenntnisse
- English
Hinweis für Nutzer
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.