Staff Software Engineer I - Confluent Compute PlatformIBM • Lowell, Massachusetts, United States
Dieses Stellenangebot ist nicht mehr verfügbar
Staff Software Engineer I - Confluent Compute Platform
IBM
- Lowell, Massachusetts, United States
- Lowell, Massachusetts, United States
Über
Your Role And Responsibilities As a Software Engineer on the Compute Platform Team, you will be a Key Technical Leader in building and evolving our next-generation, multi-tenant, cloud-native compute substrate that powers all of Confluent Cloud’s diverse workloads. Our Platform orchestrates workloads across thousands of Kubernetes clusters globally across all cloud service providers, providing a unified abstraction layer for scheduling, lifecycle management, and operational excellence. You'll work on critical systems including:
Multi-Cluster Workload Orchestration: Build the control plane that manages workload placement, lifecycle, and state across multiple Kubernetes clusters per region.
Platform APIs & Abstractions: Design and evolve APIs that provide clean abstractions for polyglot workload management across diverse compute needs.
Cloud Platform Integration: Build and optimize deep integrations with the broader Confluent Cloud platform for seamless end-to-end operations.
Multi-Tenancy & Security: Implement and enhance workload isolation, network policies, and secure execution environments.
Observability & Operations: Drive operational excellence through monitoring integration, automated health checks, and self‑healing capabilities.
What You Will Do
Drive the overall technical charter for the Compute Platform, including multi-cluster orchestration, workload placement, and security architecture.
Design and implement platform APIs and Kubernetes operators using Go to support evolving workload requirements.
Work closely with product management and engineering leadership to build and drive the roadmap for Confluent’s Compute Platform, enabling new business opportunities.
Deliver high-impact initiatives in areas such as workload scheduling, disruption management, network isolation, rolling‑updating strategies, and cross‑cluster resource management.
Lead technical design reviews and drive architectural decisions across organizational boundaries.
Mentor and grow other engineers on the team through code reviews, pairing, and technical guidance.
Own operational aspects including availability, reliability, performance monitoring, emergency response, and disaster recovery for our global compute infrastructure.
This job can be performed from anywhere in the U.S.
Preferred Education Master’s Degree
Required Technical And Professional Expertise
8+ years of experience delivering scalable software solutions.
Proven track record of leading the delivery of large‑scale, highly available, low‑latency systems.
Deep expertise in Kubernetes, including controller development, operator patterns, and multi‑cluster architectures.
Strong proficiency in Go with experience building production‑grade distributed systems.
Experience with multi‑tenant platform architectures and security isolation patterns.
Preferred Technical And Professional Experience
Familiarity with gRPC, Protobuf, and API design for internal platform services.
Experience with observability tools and operational excellence practices.
Experience with multi‑cloud environments (AWS, GCP, Azure) and cloud‑provider integrations.
Track record of providing technical leadership and mentorship.
Track record of working collaboratively across teams including product management, SRE, and other engineering teams.
A smart, humble, and empathetic attitude with a strong sense of teamwork.
Drive and excitement about the challenges of a fast‑paced, innovative software environment.
#J-18808-Ljbffr
Sprachkenntnisse
- English
Hinweis für Nutzer
Dieses Stellenangebot wurde von einem unserer Partner veröffentlicht. Sie können das Originalangebot einsehen hier.