XX
Platform EngineerEpsilon Solutions Ltd.Toronto, Ontario, Canada

Cette offre d'emploi n'est plus disponible

XX

Platform Engineer

Epsilon Solutions Ltd.
  • CA
    Toronto, Ontario, Canada
  • CA
    Toronto, Ontario, Canada

À propos

Role: Platform Engineer
Location: Toronto Office / Hybrid
JD
Databricks, Snowflake, SRE, DevOps
The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our cloud-based applications and infrastructure. This role blends cloud architecture expertise, operational excellence, and leadership in the dynamic and rapidly evolving cloud landscape.
Key Responsibilities
Design and implement cloud-native architectures to ensure high availability, reliability, and performance of mission-critical systems hosted in the cloud (AWS, Azure).
Drive automation initiatives by developing and implementing Infrastructure as Code (IaC) and automating routine operational tasks such as provisioning, scaling, and deployment processes.
Optimize cloud infrastructure for cost-efficiency, scalability, and performance. Use cloud-native tools to manage resources efficiently and recommend scaling strategies.
Implement and manage cloud monitoring, logging, and alerting solutions to provide actionable insights into system health. Develop dashboards and metrics that help proactively detect potential issues before they affect end-users.
Lead and mentor a team of SRE engineers. Work closely with development, product, and operations teams to ensure the reliability and performance of all cloud-based services.
Proactively plan for capacity management to ensure resources are provisioned optimally as the organization scales. Use monitoring data to make informed decisions on scaling strategies and performance tuning.
Ensure that all cloud systems adhere to security best practices, including proper access controls, encryption, and compliance with regulatory standards.
Work with development teams to implement and improve CI/CD pipelines, enabling rapid and safe deployment of new code changes to production environments. Advocate for DevOps and SRE best practices across the organization.
Monitor cloud usage, costs, and billing. Implement cost optimization strategies without sacrificing performance or reliability, and help teams understand cloud cost implications.
Qualifications
Educational Background:
Bachelor's degree in computer science, Information Technology, Data Science, or a related field.
Technical Skills
Hands-on experience with cloud platforms such as AWS/ Azure. Knowledge of cloud-native services (e.g., compute, storage, networking, security, monitoring).
Strong expertise in Infrastructure as Code (IaC) tools such as Terraform, AWS CloudFormation, or Azure Resource Manager.
Experience with monitoring tools. Ability to design and implement observability solutions to track application performance and infrastructure health.
Proficiency in scripting languages such as Python, Bash to automate infrastructure management.
Solid experience implementing and maintaining CI/CD pipelines with tools such as Jenkins, GitLab etc.
Proven experience leading incident management processes, performing root cause analysis, and driving continuous improvement initiatives.
Analytical Skills
Strong problem-solving skills with the ability to think analytically and automate processes are a must-have. Candidates will be expected to demonstrate these skills during the interview process.
Soft Skills
Excellent communication and teamwork skills, with the ability to collaborate effectively with cross-functional teams.
  • Toronto, Ontario, Canada

Compétences linguistiques

  • English
Avis aux utilisateurs

Cette offre a été publiée par l’un de nos partenaires. Vous pouvez consulter l’offre originale ici.