Senior Site Reliability Engineer / Senior DevOps EngineerDjinni • New York, New York, United States
Senior Site Reliability Engineer / Senior DevOps Engineer
Djinni
- New York, New York, United States
- New York, New York, United States
Über
|
|| --- | --- || Linux | 5 years || RHEL | 5 years || DevOps | 5 years || Kubernetes | 4 years || Docker | 4 years ||
|
|| --- | --- || Terraform | 3 years || CI/CD | 4 years || On-Premise Infrastructure | 4 years |## Required languages|
|
|| --- | --- || English | C1 - Advanced || Ukrainian | Native |Published 6 February15 views·1 applicationTo apply for this and other jobs on Djinnior.We are seeking a **Senior DevOps Engineer** to join the Release Management team. Release Management is the backbone of the product delivery, responsible for the design, installation, upgrade, and L3/L4 support of our entire product line, including Amelia (RPM & Cloud/K8s) and Autonomics.In this role, you will not just be a body in a seat; we are looking for "brilliant brains" to help us scale. You will adopt our "1Click" philosophy—if a task needs to be done more than twice, you will automate it.**Your future tasks:*** **Infrastructure & Cloud Management:**
+ Manage and support installations across hybrid environments, including DSaaS (Dedicated SaaS), On-Premise, and Public Cloud (AWS, GCP, Azure, OCI).
+ Administer and maintain Kubernetes clusters (EKS, GKE, AKS) and Docker-based deployments.
+ Perform L3/L4 System Administration on Linux environments (Scientific Linux, RHEL 7/8/9), ensuring OS patching, security, and upgrades.* **Automation & CI/CD:**
+ Develop and maintain Ansible playbooks and Terraform scripts to automate the spin-up of test infrastructure and product installation.
+ Manage CI/CD pipelines using Bamboo and Bitbucket to execute automated "1Click" upgrades and installations.
+ Script and automate release management processes, ensuring code upgrades are passed smoothly from R&D to production.* **Database & Application Support:**
+ Manage and support backend technologies including Percona (MySQL v8), Redis, OpenSearch, RabbitMQ, and HAProxy.
+ Oversee the deployment and maintenance of monitoring stacks, specifically ELK (Elasticsearch, Logstash, Kibana), Grafana, Prometheus, and Zabbix.
+ Support specialized telephony infrastructure components like Jambonz (open-source voice platform) and Freeswitch.* **Release Management & Reliability:**
+ Execute Release Management (RM) processes, creating client-specific git repositories for inventory configurations, certificates, and overrides.
+ Oversee automated backup and restore procedures (using S3, Minio, etc.) and ensure Disaster Recovery readiness.
+ Monitor upgrade success/failure rates via Jira and Slack integrations, intervening immediately to remediate exceptions.* **Client Success & Documentation:**
+ Provide expert-level "White Glove" support during partner installs and upgrades, offering real-time troubleshooting.
+ Create and maintain easily consumable documentation in Confluence for both internal teams and external partners.**What we expect from you:*** **Linux Expertise:**Expert-level knowledge (L3/L4) of Linux administration (RHEL/CentOS family).* **Automation Skills:**Proven experience with Ansible (playbooks) and Terraform for Infrastructure as Code.* **Container Orchestration:**Strong experience with Kubernetes (K8s) and Docker in production environments.* **CI/CD Tools:**Proficiency with Bamboo, Git, and Bitbucket for version control and deployment pipelines.* **Database Management:**Experience supporting MySQL (Percona XtraDB Cluster), Redis, and familiarity with replication strategies.* **Web & Proxy:**Experience configuring and managing Nginx, Apache, and HAProxy.* **Scripting:**Proficiency in Shell scripting (Bash) and familiarity with Python or Java.**Prefered qualifications:*** Experience with Voice/Telephony technologies (SIP, Freeswitch, Jambonz).* Familiarity with ELK Stack and Zabbix for monitoring and logging.* Experience in a "Hybrid" software environment (supporting both SaaS and On-Premise installations).* A mindset of "Don't break my stuff"—prioritizing stability and proactive testing (Eddie load testing) before deployment.* You believe that "Today's latest-and-greatest is often tomorrow's floppy disk," and you are constantly re-evaluating technology stacks (e.g., migrating from CentOS to RHEL 9).* You communicate effectively, capable of working with Delivery teams, R&D, and external Partners.**We offer:*** Remote-first work environment;* Collaborative and motivated team;* Impactful work improving patient treatment workflows;* Professional growth with modern technologies;* Autonomy and ownership of your work;* Competitive compensation;* Opportunity to contribute to future product phases.* **Only
from
5 years of experience*** **Full Remote*** **United States**
Countries where we consider candidates* + **English C1 - Advanced**
+ **Ukrainian Native** #J-18808-Ljbffr
Sprachkenntnisse
- English
Hinweis für Nutzer
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.