Dieses Stellenangebot ist nicht mehr verfügbar
DevOps Engineer IV- 4P/702
4P Consulting Inc
- Atlanta, Georgia, United States
- Atlanta, Georgia, United States
Über
Client- Southern Company Services.
Job Summary We are seeking an experienced DevOps Engineer IV / Site Reliability Engineer (SRE) with strong hands‑on experience in observability, telemetry, monitoring, and service reliability . The ideal candidate will have deep knowledge of Grafana, OpenTelemetry (OTEL), PromQL, and application/system instrumentation .
This role will partner with engineering, operations, and application teams to improve service reliability, telemetry quality, alerting maturity, and operational visibility across complex environments.
Key Responsibilities
Design, implement, and support monitoring and observability solutions.
Build dashboards, alerts, and telemetry solutions using Grafana and related tools.
Implement OpenTelemetry standards for application and system instrumentation.
Write and optimize PromQL queries for monitoring and reliability insights.
Improve alerting quality, reduce noise, and create actionable alerts.
Troubleshoot application and infrastructure issues using logs, metrics, and traces.
Support incident response, root cause analysis, and reliability improvements.
Collaborate with engineering, operations, and application teams.
Required Qualifications
Strong experience as a DevOps Engineer, SRE, Observability Engineer, or similar role.
Hands‑on experience with Grafana, OpenTelemetry, and PromQL.
Experience with application and system instrumentation.
Strong understanding of logs, metrics, traces, alerting, and service reliability.
Ability to design monitoring solutions across complex environments.
Strong troubleshooting, analytical, communication, and collaboration skills.
Preferred Qualifications
Experience with Prometheus, Loki, Tempo, Kubernetes, containers, cloud platforms, or microservices.
Familiarity with CI/CD, automation, infrastructure‑as‑code, incident response, SLIs, SLOs, and reliability metrics.
Key Skills DevOps, SRE, Observability, Grafana, OpenTelemetry, OTEL, PromQL, Prometheus, Monitoring, Alerting, Logs, Metrics, Traces, Instrumentation, Incident Response, Root Cause Analysis.
#J-18808-Ljbffr
Sprachkenntnisse
- English
Hinweis für Nutzer
Dieses Stellenangebot wurde von einem unserer Partner veröffentlicht. Sie können das Originalangebot einsehen hier.