Über
three-tier NVIDIA switch fabric
including: 5400 Series
– 400G/800G core fabric 3400 Series
– 10G management tier 2200 Series
– 1G distribution tier Provide configuration and operational support for
Intervision-provided 2200 series switches . Troubleshoot and resolve
authentication/access issues
on network switches and integrate them into the network fabric. Ensure proper
physical connectivity and cabling
to designated rack zones and cabinet infrastructure. Integrate switch telemetry and monitoring with
OpenTelemetry (OTEL)
and the existing observability platform. Support the
migration of the observability system
from the Development Cluster to the Infrastructure Cluster. Monitor network performance, availability, and fabric visibility. Required Skills Strong experience with
data center networking and switch configuration Experience with
NVIDIA networking switches / Spectrum / Cumulus or similar Knowledge of
high-speed Ethernet networks (10G–800G) Troubleshooting of
switch access, authentication, and connectivity issues Experience with
network monitoring, telemetry, and OpenTelemetry Familiarity with
data center rack architecture and cabling Preferred Qualifications Experience in
AI/HPC infrastructure networking Knowledge of
observability platforms and telemetry pipelines Hands-on experience with
large-scale fabric deployments
Sprachkenntnisse
- English
Hinweis für Nutzer
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klicken Sie auf „Jetzt Bewerben“, um Ihre Bewerbung direkt auf deren Website einzureichen.