This job offer is no longer available
Senior Software Engineer
- Vancouver, British Columbia, Canada
- Vancouver, British Columbia, Canada
About
IREN is a leading AI Cloud Service Provider, delivering large-scale GPU clusters for AI training and inference. IREN's vertically integrated platform is underpinned by its expansive portfolio of grid-connected land and data centers in renewable-rich regions across the U.S. and Canada.
With 100% renewable energy, we build, own and operate our data centers and take pride in being at the forefront of sustainable solutions for the ever-evolving applications of high-performance compute. We believe that human progress is invaluable, but it should be done in the right way – responsibly, sustainably and having a positive impact on the communities we operate in.
As a Senior Software Engineer, you will design, build, deploy, and operate production systems that power IREN's AI services and platform infrastructure.
This is a full-lifecycle engineering role. You will independently own services from API definition and backend implementation through Kubernetes deployment, observability, and production operations. You will work across service boundaries—sometimes writing application code, sometimes improving platform tooling, and sometimes debugging live systems in production.
This role is ideal for engineers who are comfortable operating across backend systems, distributed infrastructure, and cloud-native platforms, and who take responsibility for the reliability and performance of the systems they build.
Requirements
End-to-End Software Ownership
- Design, implement, and operate production software systems from initial design through long-term maintenance.
- Own services across environments (local, pre-prod, and production), including deployment via GitOps workflows (e.g.,ArgoCD).
- Make pragmatic architectural decisions that balance performance, reliability, and maintainability.
- Take responsibility for production health, on-call participation, and continuous improvement of owned systems.
Backend & Service Engineering
- Build high-performance backend services and APIs supporting multi-tenant, distributed workloads.
- Design and evolve API-first interfaces (REST and/orgRPC), including schema design, versioning, and client integration.
- Implement reliable inter-service communication patterns, including retries, backpressure, idempotency, and message processing.
- Integrate with data stores, message queues, and streaming systems as required by application workflows.
Platform & Cloud-Native Deployment
- Deploy andoperateservices on Kubernetes using modern cloud-native practices.
- Define andmaintaininfrastructure using Infrastructure-as-Code (Terraform or equivalent).
- Work within CI/CD pipelines to enable safe, repeatable builds and deployments.
Reliability, Performance & Observability
- Instrument services with metrics, logs, and tracing to ensure operational visibility.
- Diagnose and resolve issues across application, infrastructure, and networking layers.
- Participate in incident response, root-cause analysis, and post-incident improvements.
- Continuously improve system performance, scalability, and resource efficiency.
Languages
- English
This job was posted by one of our partners. You can view the original job source here.