Dieses Stellenangebot ist nicht mehr verfügbar
Senior Manager – RunOps
remoterocketship
- New York, New York, United States
- New York, New York, United States
Über
Execute the enterprise SRE strategy, including SLOs, SLIs, error budgets, and reliability roadmaps. Establish reliability standards and practices across the applications, backend services, APIs, data platforms, and AI workloads. Drive a culture of reliability-by-design and operational excellence across engineering teams. Lead adoption of AIOps capabilities for proactive issue detection, alert noise reduction, and predictive failure prevention. Partner with the AI Platform team to integrate LLMs and ML models into operational workflows (log summarization, anomaly detection, remediation). Own enterprise observability strategy across metrics, logs, traces, and user experience monitoring. Lead enterprise incident response, escalation, and post-incident learning (blameless postmortems). Requirements:
Bachelor's degree in Computer Science or equivalent work experience required, one additional year of experience is required for each year of college not attained. 10+ or more years of production support or service delivery experience. Experience working with a managed services vendor. ITIL Qualified & Expert knowledge of ITIL disciplines. Experience managing 3rd parties and 3rd party delivered services. Service Management or Support in a large-scale and diverse environment of incident management, escalation procedures and related disciplines. Benefits:
medical, dental, and vision coverage paid time off retirement savings options wellness programs
Sprachkenntnisse
- English
Hinweis für Nutzer
Dieses Stellenangebot wurde von einem unserer Partner veröffentlicht. Sie können das Originalangebot einsehen hier.