Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Analytics Observability Engineer
HPC Observability Engineer
EIT Professionals CorpNew York22 hours ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Direct message the job poster from EIT Professionals Corp Role: HPC Observability Engineer
Principal, Cloud Engineer - Observability
Fidelity InvestmentsTrophy ClubJob Description:Note: Fidelity will not provide immigration sponsorship for this position.OverviewDo you want to work on cutting-edge cloud technologies that power the next generation of enterprise-sc
Principal, Cloud Engineer - Observability
Fidelity InvestmentsWestlakeJob Description:Note: Fidelity will not provide immigration sponsorship for this position.OverviewDo you want to work on cutting-edge cloud technologies that power the next generation of enterprise-sc
Principal, Cloud Engineer - Observability
Fidelity InvestmentsIrvingJob Description:Note: Fidelity will not provide immigration sponsorship for this position.OverviewDo you want to work on cutting-edge cloud technologies that power the next generation of enterprise-sc
Principal, Cloud Engineer - Observability
Fidelity InvestmentsMerrimackJob Description:Note: Fidelity will not provide immigration sponsorship for this position.OverviewDo you want to work on cutting-edge cloud technologies that power the next generation of enterprise-sc
Principal, Cloud Engineer - Observability
Fidelity InvestmentsFlower MoundJob Description:Note: Fidelity will not provide immigration sponsorship for this position.OverviewDo you want to work on cutting-edge cloud technologies that power the next generation of enterprise-sc
Principal, Cloud Engineer - Observability
Fidelity InvestmentsFort WorthJob Description:Note: Fidelity will not provide immigration sponsorship for this position.OverviewDo you want to work on cutting-edge cloud technologies that power the next generation of enterprise-sc
Principal, Cloud Engineer - Observability
Fidelity InvestmentsPelhamJob Description:Note: Fidelity will not provide immigration sponsorship for this position.OverviewDo you want to work on cutting-edge cloud technologies that power the next generation of enterprise-sc
Principal, Cloud Engineer - Observability
Fidelity InvestmentsEulessJob Description:Note: Fidelity will not provide immigration sponsorship for this position.OverviewDo you want to work on cutting-edge cloud technologies that power the next generation of enterprise-sc
Principal, Cloud Engineer - Observability
Fidelity InvestmentsHudsonJob Description:Note: Fidelity will not provide immigration sponsorship for this position.OverviewDo you want to work on cutting-edge cloud technologies that power the next generation of enterprise-sc
Principal, Cloud Engineer - Observability
Fidelity InvestmentsDentonJob Description:Note: Fidelity will not provide immigration sponsorship for this position.OverviewDo you want to work on cutting-edge cloud technologies that power the next generation of enterprise-sc
Principal, Cloud Engineer - Observability
Fidelity InvestmentsHighland VillageJob Description:Note: Fidelity will not provide immigration sponsorship for this position.OverviewDo you want to work on cutting-edge cloud technologies that power the next generation of enterprise-sc
Observability: Software Engineer ( W2 Contract)
MCS Group - USA | Your Specialist Recruitment FirmNew YorkObservability: Software Engineer (W2 Contract) Get AI-powered advice on this job and more exclusive features.MCS Group - USA | Your Specialist Recruitment Firm This pay range is provided by MCS Group
Remote Lead OpenTelemetry Engineer — Observability
Everest TechnologiesNew YorkEverest Technologies is seeking a highly skilled Lead OpenTelemetry Developer for a remote position focused on developing and maintaining OpenTelemetry-based solutions. The role includes tasks like im
Cloud Software Engineer - Observability Platform
ClickhouseNew YorkCloud Software Engineer - Observability Platform United States (remote)About ClickHouse ClickHouse is a private cloud company focusing on real-time analytics, data warehousing, observability, and AI w
Sr. Network Engineer- Network Observability
VisaUnited StatesAbout Us Visa is a world leader in payments technology, facilitating transactions between consumers, merchants, financial institutions and government entities across more than 200 countries and territ
Sr. Specialty Solutions Engineer - Observability
AHEADUnited StatesJob DescriptionJob DescriptionAHEAD builds platforms for digital business. By weaving together advances in cloud infrastructure, automation and analytics, and software delivery, we help enterprises de
Senior Observability Engineer – NRQL & SRE Strategy
EPAM Systems, Inc.New YorkEPAM Systems, Inc. is seeking a Senior Observability Engineer to resolve technical monitoring issues in New Relic and drive observability practices across platforms. This strategic role will develop c
Senior Detection Engineer (SIEM / Security Observability)
Keeper Security, Inc.Saint PaulSenior Detection Engineer (SIEM / Security Observability)Remote, US Description Keeper Security is seeking a Senior Detection Engineer to advance detection engineering, SIEM operations, and security t
Senior Solutions Engineer - Observability & AI-Native
DynatraceSan FranciscoDynatrace in San Francisco is seeking a Lead Solution Engineer who will play a pivotal role in supporting the sales team with advanced technical expertise. This role demands excellent communication sk
Backend Engineer - AI Observability & Real-Time Data
FeedinkooUnited StatesFeedinkoo is seeking a backend engineer to help build infrastructure for cutting-edge AI development tools. You will design features that deliver insights into LLM usage and develop data pipelines for
Observability Engineer — Remote (Prometheus/Grafana/Datadog)
Bright-Vision-TechnologiesNew YorkBright-Vision-Technologies is seeking an Observability Engineer to design and operate observability platforms fully remote. The ideal candidate has strong experience with Prometheus, Grafana, and Data
Backend Engineer - AI Observability & Real-Time Data
FeedinkooRichmondFeedinkoo is seeking a backend engineer to help build infrastructure for cutting-edge AI development tools. You will design features that deliver insights into LLM usage and develop data pipelines for
Frontend Engineer Media Infra Systems Observability L4
NetflixUnited StatesAt Netflix, our mission is to entertain the world. Together, we are writing the next episode - pushing the boundaries of storytelling, global fandom and making the unimaginable a reality. We are a dre
Senior Software Engineer- Observability and Reliability Platform Engineering (REMOTE)
GEICOColorado SpringsSenior Software Engineer - Observability and Reliability Platform Engineering (REMOTE)Join to apply for theSenior Software Engineer - Observability and Reliability Platform Engineering (REMOTE)role at
HPC Observability Engineer
- New York, New York, United States
- New York, New York, United States
Über
Location: Remote Contract Description:
The client has Grafana and InfluxDB services running on K8S in-house on-premises. Telegraf is used to ingest data from a GPU HPC cluster into InfluxDB. This engineer will help collect and visualize data for the “Terra” platform. The HPC Observability Engineer should have experience in: Setting up and maintaining Grafana dashboards for HPC environments Creating drill-down dashboards for servers, including metrics like memory, network, and CPU utilization Exploring and utilizing out-of-the-box metrics from InfluxDB Writing Python scripts for data ingestion into InfluxDB with examples Developing a proof of concept with a simple Python script to monitor load Ingesting Infiniband packet data Monitoring LSF jobs in various states Visualizing server-specific and cluster-wide metrics in Grafana Optional: Integrating third-party plugins like DDN’s Lustre, Mellanox fabric, etc. Qualifications and Skills:
B.Tech, MS, or PhD in Computer Science or related field 5-8 years of experience with Grafana, InfluxDB, and Telegraf Experience in Python and Bash scripting is a plus Knowledge of Docker and Google Cloud Platform is advantageous HPC operations experience is beneficial Strong communication skills and ability to work independently Proficiency in requirements analysis and automated testing Ability to write efficient, secure, and well-documented Python code Experience with Git and pipeline development Awareness of modern security and development practices Responsibilities:
Develop and leverage Grafana dashboards and Telegraf configurations Create dashboards for server and cluster metrics Develop Python scripts for data ingestion and documentation Visualize non-native resources in Grafana Optional: Integrate third-party plugins Maintain high-quality code and documentation Collaborate with teams to troubleshoot and optimize pipelines Desired Skills:
Python (good to have) Bash scripting (good to have) Docker (must) HPC operations and LSF (good to have) Experience with DDN Lustre, Mellanox fabric (good to have) Google Cloud Platform (good to have) Knowledge of Git (must) Seniority level:
Mid-Senior level Employment type:
Contract Job function:
Engineering and Information Technology Industries:
IT Services and IT Consulting This job is active and accepting applications.
#J-18808-Ljbffr
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.