Zurück zur Stellenangebote
XX
Senior Technical Lead - DevOps, Python, KubernetesHCLTechUnited States

Dieses Stellenangebot ist nicht mehr verfügbar

XX

Senior Technical Lead - DevOps, Python, Kubernetes

HCLTech
  • US
    United States
  • US
    United States

Über

Senior Technical Lead - DevOps, Python, Kubernetes
Santa Clara, California Job Summary
We are seeking an experienced Data Services Lead Engineer to own the technical direction, architecture, and operational excellence of our data platform. This role requires deep expertise in Cassandra, ZooKeeper, and Consul operations, strong leadership skills, and a passion for building robust, scalable distributed data systems. You will guide the team on best practices, lead complex technical projects, and act as the primary escalation point for data-platform-related issues. The team is also responsible for ZooKeeper, Consul, LDAP, PostgreSQL, and Qpid. Key Responsibilities
Lead the design, architecture, and implementation of highly available, scalable, and performant distributed data stores (including Cassandra and PostgreSQL) across cloud and OnPrem environments. Define and drive the technical roadmap and strategy for the persistence services layer within Apigee Edge Data Services. Lead incident response and management with clear communication. Lead comprehensive post-mortem analyses for production incidents to identify root causes, document findings, and drive the implementation of preventative measures across the data platform. Lead vulnerability management initiatives, including the execution of regular version and security upgrades for all supported data services. Establish and enforce best practices for distributed systems data modeling, capacity planning, performance tuning, security, and disaster recovery. Develop and improve automation for cluster provisioning, configuration management, and upgrades. Serve as the primary technical escalation point for complex production issues, including root cause analysis. Mentor and provide technical guidance to other engineers across the organization. Collaborate with Engineering, SRE, and Support teams to align the data layer with platform requirements. Drive continuous improvement initiatives to enhance reliability and maintainability. Participate in the team's on-call rotation for production support. Skill Requirements
7+ years of experience managing large-scale, mission-critical distributed data systems (e.g., Cassandra, ZooKeeper) in a production environment. Understanding of Consul for service discovery and configuration management. Deep understanding of distributed system architectures, data modeling, internals, and performance tuning. Proficiency in Linux environments and scripting languages (e.g., Python, Bash). Experience with infrastructure-as-code tools (e.g., Terraform). Experience with monitoring and alerting systems (e.g., Prometheus, Grafana, Cloud Monitoring). Experience working in cloud environments (GCP, AWS, etc.) Compensation and Benefits
A candidate's pay within the range will depend on their skills, experience, education, and other factors permitted by law. This role may also be eligible for performance-based bonuses subject to company policies. In addition, this role is eligible for the following benefits subject to company policies: medical, dental, vision, pharmacy, life, accidental death & dismemberment, and disability insurance; employee assistance program; 401(k) retirement plan; 10 days of paid time off per year (some positions are eligible for need-based leave with no designated number of leave days per year); and 10 paid holidays per year.
  • United States

Sprachkenntnisse

  • English
Hinweis für Nutzer

Dieses Stellenangebot wurde von einem unserer Partner veröffentlicht. Sie können das Originalangebot einsehen hier.