Back to Jobs
XX
GPU Fleet Reliability EngineerfalNew York, New York, United States
XX

GPU Fleet Reliability Engineer

fal
  • US
    New York, New York, United States
  • US
    New York, New York, United States

About

fal is seeking an Operations Engineer to join their team in the United States. This hands-on role involves provisioning and managing GPU nodes in the fleet. You will troubleshoot issues, monitor health, and write documentation to improve operations. The ideal candidate has experience with Linux systems, GPU troubleshooting, and observability tools like Grafana. If you're up for challenges and enjoy automating processes, this position could be for you.
#J-18808-Ljbffr
  • New York, New York, United States

Languages

  • English
Notice for Users

This job comes from a TieTalent partner platform. Click "Apply Now" to submit your application directly on their site.