Senior Machine Learning Site Reliability EngineerPrima Group • London, England, United Kingdom
Senior Machine Learning Site Reliability Engineer
Prima Group
- London, England, United Kingdom
- London, England, United Kingdom
Über
To help fuel that growth, we need a
Senior Machine Learning Site Reliability Engineer
to join our
Infrastructure
team . This team is the beating heart of Prima. Youll be joining over 300 engineers across software development, infrastructure, operations and security. Fueled by curiosity, experimentation and collaboration, youll help deliver scalable, impactful solutions that shape the future of insurance. Excited to make an impact? Here are the details
What youll do
Hands-on Reliability & System Engineering: Design, build, and operate reliable and scalable systems by defining and monitoring SLOs/SLIs, working directly on production infrastructure, and collaborating closely with software engineers on system design and reliability improvements
Automation, Operations & Incident Response: Actively develop automation for infrastructure and operational workflows to eliminate toil and reduce MTTR, participate in and lead incident response, and drive blameless post-incident reviews with concrete follow-ups implemented in code and tooling
Performance, Capacity & Security: Continuously analyze and optimize system performance and cost, provide data, insights, and recommendations to inform capacity planning, and support security best practices through hands-on vulnerability remediation and threat mitigation
What were looking for
SRE & Cloud Engineering: Hands-on experience with SRE practices in production, strong AWS expertise, Kubernetes, networking, DNS, and Infrastructure as Code (Pulumi preferred, Terraform a plus)
Automation, Software Engineering and MLOps: Demonstrate strong software engineering fundamentals with an emphasis on code quality and maintainability. This includes solid Python proficiency and deep knowledge of the Python ecosystem (testing, debugging, packaging), hands-on experience with PySpark, and a consistent focus on writing clean, well-structured, and maintainable code. Familiarity with MLOps practices such as model registries, model versioning, retraining workflows, and end-to-end deployment lifecycles is also expected
Reliability, Data & Operations: Add stakeholder engagement and mentoring e.g. lead incident response and RCAs, improve system reliability, and engage stakeholders to propose solutions, share learnings, and mentor others
Nice to have
Regulated Environments & Security: Experience operating in highly regulated industries (e.g. Insurance, Banking, Healthcare), managing sensitive data, and supporting secure networking setups, including exposure to security technologies such as Cloudflare
Distributed Systems & Microservices: Strong understanding of microservices architectures, their principles and trade-offs, with the ability to troubleshoot and maintain distributed systems and supporting technologies (RabbitMQ, Kafka, PostgreSQL, Redis)
Observability & Platform Operations: Hands-on experience with Datadog for platform and application monitoring, performance optimisation, and solid fundamentals in database structures and operational troubleshooting, with exposure to systems built in languages such as Rust and Elixir
Grow with us: We may move fast at Prima, but we move together. Get access to learning resources, mentorship and a growth plan tailored to you.
Thrive and perform: Your best work begins when you feel your best. Enjoy private healthcare, gym discounts, wellbeing programs and mental health support.
Think youre a match?
Apply now .
At Prima, we celebrate uniqueness. If you dont meet every requirement but are passionate about this role, we still want to hear from you. Innovation thrives on diverse perspectives. Prima is proud to be an equal opportunity employer. Need accommodations during the process? Email us at . Lets build the future of insurance, together.
LNKD1_UKTJ
#J-18808-Ljbffr
Sprachkenntnisse
- English
Hinweis für Nutzer
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klicken Sie auf „Jetzt Bewerben“, um Ihre Bewerbung direkt auf deren Website einzureichen.