XX
HPC Solution ArchitectDellHopkinton, Iowa, United States

This job offer is no longer available

XX

HPC Solution Architect

Dell
  • US
    Hopkinton, Iowa, United States
  • US
    Hopkinton, Iowa, United States

About

Senior Principal Software Engineer The Software Engineering team delivers next-generation software application enhancements and new products for a changing world. Working at the cutting edge, we design and develop software for platforms, peripherals, applications and diagnostics — all with the most advanced technologies, tools, software engineering methodologies and the collaboration of internal and external partners.
Join us to do the best work of your career and make a profound social impact as a
Senior Principal Software Engineer
on our
Software Engineering
Team in
Austin, Texas or Hopkinton, Massachusetts.
As a Senior Software Principal Engineer, you will be responsible for developing sophisticated systems and software basis the customer’s business goals, needs and general business environment creating software solutions.
We are hiring a
Senior HPC Solution Architect to design, deploy, and support large‑scale HPC and AI clusters for enterprise, research, and hyperscale customers. This is a hands‑on, customer‑facing Individual Contributor role that blends
Linux systems engineering, cluster lifecycle automation, provisioning frameworks (Omnia/OpenCHAMI), Slurm/Kubernetes
, and deep troubleshooting of production environments. Ideal for strong technical engineers who enjoy solving complex customer problems, contributing to open‑source, and shaping modern HPC deployment practices.
Lead customer architecture & design, translating HPC/AI workload requirements into scalable cluster architectures (compute, schedulers, storage, interconnects) Build and maintain provisioning workflows (OpenCHAMI‑based or equivalent) covering PXE/iPXE boot, cloud‑init, security, and identity/cert operations Serve as Tier‑3 engineering escalation, troubleshooting complex provisioning, scheduling, GPU, networking, and performance issues; perform RCAs and drive permanent fixes Contribute to open source and customer enablement through code contributions, documentation, workshops, runbooks, templates, and field readiness materials
Linux & Automation:
Deep experience with RHEL/Rocky/Ubuntu; hands‑on cluster deployments using open‑source toolchains, Omnia, and OpenCHAMI (composable provisioning, cloud‑init, microservices) proficient with Docker/Podman, OpenTelemetry pipelines, and telemetry instrumentation Networking, Fabrics & Streaming:
Scripting, Monitoring & Customer Engagement:
Strong skills in Ansible, Python, Bash; If you’re looking for an opportunity to grow your career with some of the best minds and most advanced tech in the industry, we’re looking for you.
Dell Technologies is a unique family of businesses that helps individuals and organizations transform how they work, live and play. Read the full Equal Employment Opportunity Policy
here
.
  • Hopkinton, Iowa, United States

Languages

  • English
Notice for Users

This job was posted by one of our partners. You can view the original job source here.