High-Performance Computing (HPC) Systems AdministratorMassachusetts General Hospital • United States
Dieses Stellenangebot ist nicht mehr verfügbar
High-Performance Computing (HPC) Systems Administrator
Massachusetts General Hospital
- United States
- United States
Über
Education Bachelor's Degree Related Field of Study required
Licenses and Credentials Class D Passenger Vehicle Driver's License [State License] - Generic - HR Only preferred
Experience Experience in systems/applications administration. 2-3 years required Key
Responsibilities Cluster
Management :
Oversee the day-to-day operations, maintenance, and optimization of the Martinos Center's HPC cluster, ensuring high availability, reliability, and performance. Perform hardware and software upgrades, patching, and troubleshooting of HPC nodes, storage, and networking.
User
Support :
Provide technical support and guidance to researchers and staff using the HPC cluster for computational tasks, such as neuroimaging, machine learning, and data analysis. Assist users with job scheduling, resource allocation, and troubleshooting.
System
Monitoring
and
Performance
Optimization :
Develop and implement robust monitoring tools to track resource utilization and identify performance bottlenecks. Analyze workloads and provide recommendations for optimization of computational workflows.
Collaboration
and
Training :
Collaborate with researchers to understand their computational needs and assist in designing tailored HPC solutions for their projects. Develop training materials and lead workshops to educate researchers on best practices for using the cluster.
Qualifications Required
: Bachelor's degree in Computer Science, Information Technology, Engineering, or a related field. 3+ years of experience in HPC systems administration or equivalent. Strong expertise in Linux systems administration (e.g., CentOS, RHEL, Ubuntu) in an HPC environment. Experience with job scheduling using Slurm. Proficiency in HPC-related programming and scripting languages (e.g., Bash, Python, Perl). Familiarity with parallel computing, distributed systems, and scientific computing frameworks. Hands-on experience with storage systems, networking, and security in an HPC environment. Excellent interpersonal and communication skills to interact with researchers and non-technical staff, and previous experience working with researchers Demonstrated ability to adapt to changing technologies, workflows, and priorities in a dynamic research environment. Strong organizational and time-management skills to efficiently manage multiple concurrent projects and tasks. Preferred
: Advanced degree in Computer Science, Engineering, or a related field. Knowledge of biomedical or neuroimaging applications and related software (e.g., FreeSurfer, FSL, SPM, ANTs, MATLAB). Experience with machine learning workflows and GPU-based computing (e.g., PyTorch, CUDA, TensorFlow). Familiartiy with data-intensive workflows and large-scale storage systems.
The General Hospital Corporation is an Equal Opportunity Employer. By embracing diverse skills, perspectives and ideas, we choose to lead. All qualified applicants will receive consideration for employment without regard to race, color, religious creed, national origin, sex, age, gender identity, disability, sexual orientation, military service, genetic information, and/or other status protected under law. We will ensure that all individuals with a disability are provided a reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment.
Sprachkenntnisse
- English
Hinweis für Nutzer
Dieses Stellenangebot wurde von einem unserer Partner veröffentlicht. Sie können das Originalangebot einsehen hier.