Ubique Systems
Ubique Systems

System Reliability Engineer

  • +3
  • +4
  • FR
    France
Interesse zeigen
  • +3
  • +4
  • FR
    France

Über

Requirements -

  • Develop software to make infrastructure services self-managing and self-service
  • Deliver continuous service improvement by developing Infrastructure as Code
  • Eliminate manual, repetitive, automatable, tactical tasks that are devoid from value
  • Improve system performance, make effective use of resources, distribute load and reduce latency
  • Identify SLO's (Service Level Objectives) to meet availability and latency objectives
  • Develop pro-active monitoring solutions that alert on symptoms and not just on outages
  • Perform detailed root cause analysis (RCA's) on incidents and outages to prevent future
  • Partner with development teams to improve services via rigorous testing and release procedures
  • Identity technical debt and partner with application teams to build remediation plans
  • Develop standard operationa procedures and produce effective documentation
  • Analyse workloads and devise suitable cloud migration strategies where appropriate
  • Ensure all project / investment workloads are delivered according to plans and budget dafined
  • Liaise with infrastructure Control and IT Risk teams to satisfy internal and extemal audit requests
  • Deputise for team lead when required to do so and act-up accordingly
  • Identify cost saving and optimisation opportunities across the group
  • Build strong working relationships across the organisation
  • Adhere to the core values of the bank

Responsibilities -

  • Perform daily health and compliance checks for all systems as required
  • Ensure all systems are backed up successfully and any issues are promptly resolved
  • Validate monitoring alerts and batch job failures are detected promptly and satisfactorily resolved
  • Ensure sufficient capacity is available to accommodate drive growth
  • Respond to emails sent to the team distribution list / mailboxes in a timely manner
  • Handle incidents and requests with efficiency and a "customer first mindset
  • Maintain infrastructure in a highly available, reliable, secure and performant manner
  • General Server / Database / Virtualisation Administration maintenance activities
  • Provide technical support to application support and development teams
  • Provide consultancy to application support and development teams
  • Take part in On-Call & weekend work rotation; triaging and addressing production issues as they arise

Wünschenswerte Fähigkeiten

  • Root Cause Analysis
  • Project Management
  • Customer Service
  • Database Administration
  • France

Berufserfahrung

  • Build/Release
  • DevOps
  • Site Reliability (SRE)

Sprachkenntnisse

  • English