Cette offre d'emploi n'est plus disponible
Site Reliability Engineer
- Waggaman, Louisiana, United States
- Waggaman, Louisiana, United States
À propos
Marathon Health is a leading provider of advanced primary care in the U.S., serving 2.5 million eligible patients through approximately 630 employer and union-sponsored clients. Our comprehensive services include advanced primary care, mental health, occupational health, musculoskeletal, and pharmacy services, delivered through our 680+ health centers across 41 states. We also offer virtual primary care and mental health services accessible in all 50 states. Transforming healthcare delivery with a patient-first approach, we prioritize convenient access to both in-person and virtual care, resulting in improved health outcomes and significant cost savings. Committed to inclusivity and collaboration, we foster a positive work environment and recruit exceptional talent to ensure expertise and compassion in healthcare delivery. Marathon has been recognized as a five-time Modern Healthcare Best Places to Work in Healthcare winner and a six-time Best in KLAS award winner for employer-sponsored healthcare services.
ABOUT THE JOB
As a Site Reliability Engineer (SRE) for our Ignite Platform, you'll combine software and systems engineering to solve operational challenges, focusing on automation, infrastructure optimization, and system reliability. You'll work closely with a collaborative team to lead projects that improve production stability and scalability. This role offers the opportunity to grow in a supportive environment while driving innovation in cloud operations and DevOps practices.
ESSENTIAL DUTIES & RESPONSIBILITIES
- Develop internal tools and automation to streamline development workflows and reduce product cycle time
- Standardize repositories and automated build/deployment pipelines for scalable cloud hosting
- Manage and secure AWS infrastructure, including EC2 instances and Kubernetes clusters
- Design, implement, and maintain CI/CD pipelines to support automated builds and deployments
- Administer multiple interconnected AWS accounts, networks, and VPCs to ensure secure and efficient operations
- Automate system provisioning and product release processes to enhance deployment reliability
- Create, test, and debug automation scripts across applications, systems, and infrastructure
- Lead incident response efforts, including root cause analysis and blameless post-mortems
- Collaborate with development teams throughout the software lifecycle to ensure reliable releases
- Analyze incident trends and usage data to proactively identify and mitigate potential issues
- Promote and implement self-healing and resiliency patterns across infrastructure
- Conduct performance testing, identified system bottlenecks, and recommended optimizations
- Participate in 24/7 on-call rotations to support production systems and ensure uptime
- Lead and participate in infrastructure and platform efforts that span multiple teams, aligning technical solutions with organizational goals and engineering priorities
- Define and monitor Service Level Objectives (SLOs) and implement observability tooling to proactively detect, diagnose, and mitigate performance or reliability issues
- Create and maintain SOPs, runbooks, and technical documentation. Mentor peers and support team enablement through knowledge-sharing and process clarity
- Collaborate with security and compliance teams to implement IAM policies, secrets management practices, and audit controls across environments, ensuring alignment with frameworks like HIPAA, SOC2, HiTrust, and AWS Best Practices
- Design internal tooling and developer-facing automation to improve productivity, reduce cognitive overhead, and support self-service infrastructure usage
QUALIFICATIONS
Bachelor's degree in systems engineering, Computer Science, or a related field and a minimum of 5 years' experience deploying cloud-based applications or developing in cloud environments, or equivalent combination of education and experience. AWS certification required (e.g., DevOps Engineer – Professional, Solutions Architect).
DESIRED ATTRIBUTES
- Familiarity with industry compliance and security frameworks such as HIPAA and SOC 2
- Skilled in developing automated tools, systems, and services across multiple technology domains
- Advanced knowledge of infrastructure components including networking, cloud services, orchestration tools, containerization, compute, and storage systems
- Proficient in implementing service-level changes and diagnosing system components
- Proven experience as a DevOps or Cloud Engineer, ideally in hybrid (public/private cloud) environments
- Strong understanding of AWS
Compétences linguistiques
- English
Cette offre a été publiée par l’un de nos partenaires. Vous pouvez consulter l’offre originale ici.