XX
Site Reliability EngineerMarathon HealthWaggaman, Louisiana, United States

This job offer is no longer available

XX

Site Reliability Engineer

Marathon Health
  • US
    Waggaman, Louisiana, United States
  • US
    Waggaman, Louisiana, United States

About

Marathon Health is a leading provider of advanced primary care in the U.S., serving 2.5 million eligible patients through approximately 630 employer and union-sponsored clients. Our comprehensive services include advanced primary care, mental health, occupational health, musculoskeletal, and pharmacy services, delivered through our 680+ health centers across 41 states. We also offer virtual primary care and mental health services accessible in all 50 states. Transforming healthcare delivery with a patient-first approach, we prioritize convenient access to both in-person and virtual care, resulting in improved health outcomes and significant cost savings. Committed to inclusivity and collaboration, we foster a positive work environment and recruit exceptional talent to ensure expertise and compassion in healthcare delivery. Marathon has been recognized as a five-time Modern Healthcare Best Places to Work in Healthcare winner and a six-time Best in KLAS award winner for employer-sponsored healthcare services.

ABOUT THE JOB

As a Site Reliability Engineer (SRE) for our Ignite Platform, you'll combine software and systems engineering to solve operational challenges, focusing on automation, infrastructure optimization, and system reliability. You'll work closely with a collaborative team to lead projects that improve production stability and scalability. This role offers the opportunity to grow in a supportive environment while driving innovation in cloud operations and DevOps practices.

ESSENTIAL DUTIES & RESPONSIBILITIES

  • Develop internal tools and automation to streamline development workflows and reduce product cycle time
  • Standardize repositories and automated build/deployment pipelines for scalable cloud hosting
  • Manage and secure AWS infrastructure, including EC2 instances and Kubernetes clusters
  • Design, implement, and maintain CI/CD pipelines to support automated builds and deployments
  • Administer multiple interconnected AWS accounts, networks, and VPCs to ensure secure and efficient operations
  • Automate system provisioning and product release processes to enhance deployment reliability
  • Create, test, and debug automation scripts across applications, systems, and infrastructure
  • Lead incident response efforts, including root cause analysis and blameless post-mortems
  • Collaborate with development teams throughout the software lifecycle to ensure reliable releases
  • Analyze incident trends and usage data to proactively identify and mitigate potential issues
  • Promote and implement self-healing and resiliency patterns across infrastructure
  • Conduct performance testing, identified system bottlenecks, and recommended optimizations
  • Participate in 24/7 on-call rotations to support production systems and ensure uptime
  • Lead and participate in infrastructure and platform efforts that span multiple teams, aligning technical solutions with organizational goals and engineering priorities
  • Define and monitor Service Level Objectives (SLOs) and implement observability tooling to proactively detect, diagnose, and mitigate performance or reliability issues
  • Create and maintain SOPs, runbooks, and technical documentation. Mentor peers and support team enablement through knowledge-sharing and process clarity
  • Collaborate with security and compliance teams to implement IAM policies, secrets management practices, and audit controls across environments, ensuring alignment with frameworks like HIPAA, SOC2, HiTrust, and AWS Best Practices
  • Design internal tooling and developer-facing automation to improve productivity, reduce cognitive overhead, and support self-service infrastructure usage

QUALIFICATIONS

Bachelor's degree in systems engineering, Computer Science, or a related field and a minimum of 5 years' experience deploying cloud-based applications or developing in cloud environments, or equivalent combination of education and experience. AWS certification required (e.g., DevOps Engineer – Professional, Solutions Architect).

DESIRED ATTRIBUTES

  • Familiarity with industry compliance and security frameworks such as HIPAA and SOC 2
  • Skilled in developing automated tools, systems, and services across multiple technology domains
  • Advanced knowledge of infrastructure components including networking, cloud services, orchestration tools, containerization, compute, and storage systems
  • Proficient in implementing service-level changes and diagnosing system components
  • Proven experience as a DevOps or Cloud Engineer, ideally in hybrid (public/private cloud) environments
  • Strong understanding of AWS
  • Waggaman, Louisiana, United States

Languages

  • English
Notice for Users

This job was posted by one of our partners. You can view the original job source here.