Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Staff Machine Learning Engineer - ML Training Infrastructure
Staff Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesStaff ML EngineerThe Role: We are seeking an experienced, technically strong, impact-driven expert in ML Training Infrastructure with a demonstrated ability to lead through hands-on technical work. In
Staff Machine Learning Engineer - ML Training Infrastructure
General MotorsWashingtonThe Role We are seeking an experienced, technically strong, impact-driven expert in ML Training Infrastructure with a demonstrated ability to lead through hands‑on technical work. In this role, you wi
Staff Machine Learning Engineer, ML Infrastructure
Unity TechnologiesUnited StatesBellevue, WA, USAStaff Machine Learning Engineer, ML InfrastructureLocationBellevue, WA, USADepartmentAI & Machine LearningRequisition IDJOBREQ-2615904Role descriptionThe opportunityUnity Vector build
Staff Machine Learning Engineer, ML Infrastructure
Unity TechnologiesUnited StatesSan Francisco, CA, USAStaff Machine Learning Engineer, ML InfrastructureLocationSan Francisco, CA, USADepartmentAI & Machine LearningRequisition IDJOBREQ-2615904Role descriptionThe opportunityUnity Ve
Staff Machine Learning Engineer, ML Infrastructure
Venturefizz Product Management CommunityUnited StatesPrincipal DevOps EngineerWe're a high-tech home security company that's passionate about protecting the life you've built and our mission of keeping Every Home Secure. And we've created a culture here
Staff Software Engineer, Machine Learning Infrastructure
Work180CaliforniaA home is the biggest investment most people make, and yet, it doesn’t come with a manual. That's why we’re building the only app homeowners need to effortlessly manage their homes — knowing what to d
Machine Learning Infrastructure Engineer
Garuda VenturesPalo AltoLocation Palo Alto Employment Type Full time Location Type On-site Department Software Engineering We’re hiring Machine Learning Infrastructure Engineers to build the systems that make large-scale mod
Machine Learning Infrastructure Engineer
Mind RoboticsUnited StatesMachine Learning Infrastructure EngineerAt Mind Robotics, we're building generalized physical AI—robotic systems capable of dexterous, adaptive, and reasoning-intensive work in real-world industrial e
Senior Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesJob Description**The Role:**We are seeking an experienced, technical oriented, impact delivering-driven expert in ML Training Infrastructure with a strong ability to execute hands-on technical work. I
Senior Machine Learning Engineer - ML Training Infrastructure
General MotorsDoverJob OverviewSenior ML Engineer – ML Training Infrastructure, General Motors. We are seeking an experienced, technical‑oriented, impact‑delivering expert in ML training infrastructure to design and bui
Machine Learning Infrastructure Engineer
Astera LabsUnited StatesAstera Labs (NASDAQ: ALAB) provides rack-scale AI infrastructure through purpose-built connectivity solutions. By collaborating with hyperscalers and ecosystem partners, Astera Labs enables organizati
Machine Learning Infrastructure Engineer
Virtual Vocations IncUnited StatesSeeking a fully remote Machine Learning & AI Infrastructure Engineer, the role focuses on designing, deploying, and supporting advanced AI, machine learning, and high-performance computing environment
Senior Machine Learning Infrastructure Engineer
PlusAI, Inc.United StatesPlusAI is a Physical AI company pioneering AI-based virtual driver software for factory-built autonomous trucks. Headquartered in Silicon Valley with operations in the United States and Europe, Plus w
Senior Machine Learning Infrastructure Engineer
Morph Inc.United StatesJob TitleGoal: 99.99% uptime We serve custom inference stacks that have irregular GPU load. We're looking for people that have done genuinely amazing work in infrastructure who are interested in a cha
Machine Learning Infrastructure Engineer- Model Inference
AbridgeUnited StatesML Infrastructure Engineer, Model InferenceAs an ML Infrastructure Engineer, Model Inference at Abridge, you'll play a pivotal role in building and optimizing the core inference infrastructure that po
Machine Learning Solutions Engineer (ML + Infrastructure Focus)
Lightning AIUnited StatesMachine Learning Solutions Engineer (ML + Infrastructure Focus)New York, New York, United States; San Francisco, California, United States; Seattle, Washington, United States Who We AreLightning AI is
Software Engineer (Multiple Levels) - Machine Learning Infrastructure, Slack
B CapitalSeattleDescription About Slack AI Slack AI's mission is to transform how people work by making Slack an AI-powered operating system. We're tackling significant challenges like unlocking collective knowledge
Sr Machine Learning Engineer- ML Infrastructure & Data Platforms
Dormont Manufacturing CoSan JoseWe’re looking for a Senior Machine Learning Engineer to join our Applied Science Data Frameworks team. In this role, you’ll build the infrastructure that powers large-scale, multimodal AI training and
Machine Learning Engineer, Offline Infrastructure (Entry-Level / New Grad)
Unity TechnologiesUnited StatesBellevue, WA, USAMachine Learning Engineer, Offline Infrastructure (Entry-Level / New Grad)LocationBellevue, WA, USADepartmentAI & Machine LearningRequisition IDJOBREQ-2616004Role descriptionThe oppor
Staff ML Platform Engineer: Data Infrastructure & Tools
Dormont Manufacturing CoNew YorkDormont Manufacturing Co in New York is looking for a candidate to work on their Data Platform, focusing on building platforms that support the data science development lifecycle. You'll collaborate w
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
Scale AINew YorkAI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, h
Staff Artificial Intelligence Machine Learning Engineer
General MotorsAustinJob Description General Motors is seeking a Staff AI/ML Engineer for the Vehicle Mechatronic Embedded Controls (VMEC) Analytics team. The team delivers production AI/ML solutions for high‑impact diagn
Staff Machine Learning Engineer - VLM/LLM Evaluation
WaymoUnited StatesWaymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building
Staff Machine Learning Compiler Engineer
RivianUnited StatesAbout RivianRivian is on a mission to keep the world adventurous forever. This goes for the emissions-free Electric Adventure Vehicles we build, and the curious, courageous souls we seek to attract.As
Staff Machine Learning Engineer, AI
SentryUnited StatesAbout SentrySoftware runs the world and the pace is faster than ever. Sentry helps developers fix errors and performance issues before users notice, so teams can spend less time firefighting and more
Staff Machine Learning Engineer - ML Training Infrastructure
- United States
- United States
Über
The Role: We are seeking an experienced, technically strong, impact-driven expert in ML Training Infrastructure with a demonstrated ability to lead through hands-on technical work. In this role, you will be responsible for defining the technical direction and driving the design and development of scalable, reliable, and high-performance AI/ML platform infrastructure that enables advanced AI research and model development at scale. As a Staff ML Engineer, you will operate as a technical leader across initiatives, partnering closely with machine learning engineers, research scientists, and platform teams to shape architecture, drive major technical decisions, and deliver state-of-the-art AI infrastructure that enables the future of intelligent driving technologies across General Motors vehicles. What You'll Do: Define and drive the architecture, design, and development of scalable, reliable, and high-performance ML frameworks and platform capabilities to support model training at scale. Lead model training performance analysis and optimization efforts across distributed training workflows, improving scalability, efficiency, and cost across heterogeneous hardware environments. Raise the bar on system observability, debuggability, operational excellence, and developer experience across the ML training stack. Own large, ambiguous, cross-functional technical initiatives from strategy through execution, including technical roadmap definition, tradeoff analysis, and delivery. Influence platform direction by identifying long-term infrastructure investments, setting engineering standards, and driving adoption of best practices across teams. Collaborate across organizational boundaries to align requirements, resolve technical disagreements, and integrate new capabilities into the platform ecosystem. Mentor engineers through design reviews, technical guidance, and hands-on partnership, while elevating engineering quality across the team. Your Skills & Abilities (Required Qualifications) Bachelor's degree or higher in Computer Science or a related field, or equivalent practical experience. 7+ years of professional software engineering experience. 5+ years of specialized experience in AI/ML infrastructure, such as enabling distributed training for large-scale ML models. Strong programming skills in Python, with deep proficiency in frameworks such as PyTorch (preferred), TensorFlow, or similar ML systems. Proven experience designing and operating distributed systems for ML training, including distributed computing, GPU computing, and cloud environments (AWS, GCP, Azure). Demonstrated track record of leading technically ambiguous, cross-team infrastructure initiatives and driving them to measurable impact. Strong architectural judgment and ability to make sound technical tradeoffs across performance, reliability, usability, and cost. Willingness to travel to Sunnyvale, CA as needed. Comfortable operating in highly ambiguous and dynamic environments. What Will Give You a Competitive Edge (preferred qualifications): Deep expertise in PyTorch 2.x+ and distributed training frameworks. Experience designing and developing training platforms that support FSDP, pipeline parallelism, and other scalable solutions for training large foundational models. Experience profiling, analyzing, debugging, and optimizing training and data loading performance at scale. Strong record of technical leadership through architecture reviews, roadmap influence, and cross-team execution. Excellent communication skills, with the ability to build consensus, navigate controversial decisions, communicate risks clearly, and provide constructive technical feedback. Self-motivated, execution-oriented, and motivated by delivering broad organizational impact. Compensation: The salary range for this role is $185,000 to $335,300. The actual base salary a successful candidate will be offered within this range will vary based on factors relevant to the position. Bonus Potential: An incentive pay program offers payouts based on company performance, job level, and individual performance. Relocation: This job may be eligible for relocation benefits. Benefits: GM offers a variety of health and wellbeing benefit programs. Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more. Company Vehicle: Upon successful completion of a motor vehicle report review, you will be eligible to participate in a company vehicle evaluation program, through which you will be assigned a General Motors vehicle to drive and evaluate. Note: program participants are required to purchase/lease a qualifying GM vehicle every four years unless one of a limited number of exceptions applies. This role is categorized as remote. This means the selected candidate may be based anywhere in the country of work and is not expected to report to a GM worksite unless directed by their manager.
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.