Senior DevOps Engineer
BabylonChain
- New York, New York, United States
- New York, New York, United States
About
What You’ll Do You’ll be wearing many hats, doing a fusion of DevOps, Site Reliability and Platform engineering. Your responsibilities include:
Own the onboarding process of services to production infrastructure, including:
Producing Proof of Concepts for the deployment of blockchain networks and dApps
Identifying hosting needs and rightsizing infrastructure for high availability
Planning and executing performance benchmarking strategies and load-testing to identify bottlenecks
Owning CDN caching, DNS and security configuration to achieve low latency
Strive for high uptime and swift incident resolution for live services:
Set up fine-grained monitoring and alerting rules
Enforce thoroughly tested disaster recovery procedures
Lead incident response procedures by being part of the DevOps on‑call rotation scheme
Operate scalable Kubernetes clusters and several in‑house platform offerings, including:
Bitcoin, Cosmos SDK, ETH and ZK blockchain infrastructure
Databases (Redis, MongoDB, MariaDB) and queuing systems (RabbitMQ)
Observability stacks (Prometheus, Grafana, Grafana Loki, Promtail, Sentry)
Collaborate closely with other teams to drive bug identification and resolution, as well as advocate for the adoption of production‑ready practices.
Requirements
Bachelor’s degree in Computer Science, Computer Engineering, or a related field.
2+ years of experience in operating secure Cosmos SDK and Bitcoin infrastructure.
It's a big plus for candidates to have production‑grade experience on Ethereum, Solidity smart contracts and Zero Knowledge systems.
2+ years of experience in operating highly available containerized web application systems on Kubernetes (web‑3 native systems are a plus).
Strong Top‑3 cloud management experience (AWS preferred).
Experience in Kubernetes application packaging (Helm Charts) and deployment (fluxcd, GitHub Actions).
Proficiency in Linux management and scripting.
Experience in Terraform scripting.
Experience in operating open‑source observability tooling (Prometheus, Grafana, Loki).
Ability to manage competing priorities and respond swiftly to incidents.
Fluent oral and written communication in English.
#J-18808-Ljbffr
Languages
- English
Notice for Users
This job comes from a TieTalent partner platform. Click "Apply Now" to submit your application directly on their site.