Offres d'emploi
Trouvez des postes près de chez vous, sur site, hybrides ou à distance.- Emplois similaires à : Staff Machine Learning Engineer - ML Training Infrastructure
Staff Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesStaff ML EngineerThe Role: We are seeking an experienced, technically strong, impact-driven expert in ML Training Infrastructure with a demonstrated ability to lead through hands-on technical work. In
Software Engineer, Machine Learning Infrastructure
WhatnotNew YorkJoin the Future of Commerce with Whatnot! Whatnot is the largest livestream shopping platform in North America and Europe to buy, sell, and discover the things you love. Whether it's trading cards, fa
Staff ML Platform Engineer: Data Infrastructure & Tools
Dormont Manufacturing CoNew YorkDormont Manufacturing Co in New York is looking for a candidate to work on their Data Platform, focusing on building platforms that support the data science development lifecycle. You'll collaborate w
AI Security Backend Engineer Build Safe AI Infrastructure
The Consulting SolutionsNew YorkThe Consulting Solutions in New York, NY, is seeking an Engineer to enhance security for AI systems. The ideal candidate will be the first on the AI security team, responsible for building security fr
Staff ML Infra Engineer: Scalable Backend for AI Training
Fireworks AINew YorkFireworks AI in New York is seeking a Training Infrastructure Engineer to design, develop, and maintain backend and cloud-native infrastructure for AI training and inference. You will collaborate with
Senior Staff Machine Learning Systems Engineer, Indexing & Retrieval Search Remote - United States
Reddit, Inc.New YorkSenior Staff Software Engineer, Indexing & Retrieval Platform Remote - United StatesReddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most op
Staff Machine Learning Engineer - Edge AI
SamsaraNew YorkWho we are Samsara (NYSE: IOT) is the pioneer of the Connected Operations Cloud, which is a platform that enables organizations that depend on physical operations to harness Internet of Things (IoT) d
Staff Machine Learning Engineer
IsraelvcforumNew YorkJoin our team at Workiva as aStaff Machine Learning Engineer ! As a pivotal member of our Machine Learning (ML) team, you'll spearhead the architecture and delivery of groundbreaking machine learning
Senior/Staff Machine Learning Research Engineer, General Agents, Enterprise GenAI
Scale AINew YorkScale AI is the data foundation for AI, helping organizations build and deploy reliable production AI applications. We partner with leading enterprises and government organizations to accelerate their
Staff Machine Learning Engineer
Cresta CTO & coNew YorkCresta is on a mission to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center. Our platform combines the best of AI and human intelligen
Senior Staff Machine Learning Engineer, ML Understanding Remote - United States
RedditNew YorkRemote - United StatesReddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. Every day, Red
Senior Machine Learning Engineer / Tech Lead - AI & ML
Civo LtdNew YorkAbout Civo: Civo is revolutionising the cloud industry by providing developers and businesses with cutting‑edge, developer‑friendly cloud solutions. With a focus on simplicity, performance, and reliab
AI Tech Lead - Staff Machine Learning Engineer
Sumo LogicNew YorkAI Tech Lead - Staff Machine Learning Engineer Location: USAThe proliferation of AI and machine log data has the potential to give organizations unprecedented real-time visibility into their infrastru
Staff Machine Learning Engineer, Consumer
RedditNew YorkStaff Machine Learning Engineer, Consumer Remote - United StatesReddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open and authentic con
Sr. Machine Learning Engineer ML/AI Team
A Place for MomNew YorkWho you are: A Place for Mom is seeking a Senior Machine Learning Engineer to design, build, and scale production-grade machine learning and GenAI systems. This role will focus on developing advanced
Staff Machine Learning Engineer, Ads Measurement Modeling
RedditNew YorkStaff Machine Learning Engineer, Ads Measurement Modeling Remote - United States Reddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open
Senior/Staff Machine Learning Engineer
RiverNew YorkAt River we are building the world’s most trusted financial institution to empower people to take ownership of their financial lives through Bitcoin, the world’s only incorruptible digital money. We b
Staff Machine Learning Engineer
BjakNew YorkAbout the Role A1 is building a proactive AI system that understands context across conversations, plans actions, and carries work forward over time.You will be responsible for turning research direct
Staff Machine Learning Engineer - Search
OmazeNew YorkOverview THE POSITION Our roster has an opening with your name on it As a Staff Data Architect in Data Architecture, you will perform design work for large and complex data solutions and the interface
Managing Veterinarian
National Veterinary AssociatesNew YorkManaging Veterinarian – All Creatures Great and Small | Pomona, NY Lead with Purpose. Inspire a Team. Elevate Patient Care. All Creatures Great and Small is searching for a dynamic, people-focuse
Dedicated Truck Driver - Class A, Daily Home Time
J.B. Hunt TransportNew YorkLooking for dedicated truck driving jobs? J.B. Hunt is hiring local CDL-A drivers! Become a Dedicated Contract Services® driver and start enjoying consistent freight and deliveries for a single custom
Regional CDL-A Driver - Weekly Home, Full Benefits
J.B. Hunt TransportNew YorkLooking for dedicated truck driving jobs? J.B. Hunt is hiring regional CDL-A drivers! Become a Dedicated Contract Services® driver and start enjoying consistent freight and deliveries for a single cus
Regional Truck Driver - Class A
J.B. Hunt TransportNew YorkLooking for dedicated truck driving jobs? J.B. Hunt is hiring regional CDL-A drivers! Become a Dedicated Contract Services® driver and start enjoying consistent freight and deliveries for a single cus
Occupational Therapist (OT) (Grades K - 7)
Aaron SchoolNew YorkAt Aaron School, we serve K–12 students with learning differences in a safe, supportive environment designed to maximize potential. Guided by the values of self-advocacy, respect, and individualized e
Senior Safety Associate
North American Science Associates, Inc.New YorkNAMSA pioneered the industry and was the first independent company in the world to focus solely on medical device materials for safety. NAMSA started testing medical devices before the U.S. Food and D
Staff Machine Learning Engineer - ML Training Infrastructure
- United States
- United States
À propos
The Role: We are seeking an experienced, technically strong, impact-driven expert in ML Training Infrastructure with a demonstrated ability to lead through hands-on technical work. In this role, you will be responsible for defining the technical direction and driving the design and development of scalable, reliable, and high-performance AI/ML platform infrastructure that enables advanced AI research and model development at scale. As a Staff ML Engineer, you will operate as a technical leader across initiatives, partnering closely with machine learning engineers, research scientists, and platform teams to shape architecture, drive major technical decisions, and deliver state-of-the-art AI infrastructure that enables the future of intelligent driving technologies across General Motors vehicles. What You'll Do: Define and drive the architecture, design, and development of scalable, reliable, and high-performance ML frameworks and platform capabilities to support model training at scale. Lead model training performance analysis and optimization efforts across distributed training workflows, improving scalability, efficiency, and cost across heterogeneous hardware environments. Raise the bar on system observability, debuggability, operational excellence, and developer experience across the ML training stack. Own large, ambiguous, cross-functional technical initiatives from strategy through execution, including technical roadmap definition, tradeoff analysis, and delivery. Influence platform direction by identifying long-term infrastructure investments, setting engineering standards, and driving adoption of best practices across teams. Collaborate across organizational boundaries to align requirements, resolve technical disagreements, and integrate new capabilities into the platform ecosystem. Mentor engineers through design reviews, technical guidance, and hands-on partnership, while elevating engineering quality across the team. Your Skills & Abilities (Required Qualifications) Bachelor's degree or higher in Computer Science or a related field, or equivalent practical experience. 7+ years of professional software engineering experience. 5+ years of specialized experience in AI/ML infrastructure, such as enabling distributed training for large-scale ML models. Strong programming skills in Python, with deep proficiency in frameworks such as PyTorch (preferred), TensorFlow, or similar ML systems. Proven experience designing and operating distributed systems for ML training, including distributed computing, GPU computing, and cloud environments (AWS, GCP, Azure). Demonstrated track record of leading technically ambiguous, cross-team infrastructure initiatives and driving them to measurable impact. Strong architectural judgment and ability to make sound technical tradeoffs across performance, reliability, usability, and cost. Willingness to travel to Sunnyvale, CA as needed. Comfortable operating in highly ambiguous and dynamic environments. What Will Give You a Competitive Edge (preferred qualifications): Deep expertise in PyTorch 2.x+ and distributed training frameworks. Experience designing and developing training platforms that support FSDP, pipeline parallelism, and other scalable solutions for training large foundational models. Experience profiling, analyzing, debugging, and optimizing training and data loading performance at scale. Strong record of technical leadership through architecture reviews, roadmap influence, and cross-team execution. Excellent communication skills, with the ability to build consensus, navigate controversial decisions, communicate risks clearly, and provide constructive technical feedback. Self-motivated, execution-oriented, and motivated by delivering broad organizational impact. Compensation: The salary range for this role is $185,000 to $335,300. The actual base salary a successful candidate will be offered within this range will vary based on factors relevant to the position. Bonus Potential: An incentive pay program offers payouts based on company performance, job level, and individual performance. Relocation: This job may be eligible for relocation benefits. Benefits: GM offers a variety of health and wellbeing benefit programs. Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more. Company Vehicle: Upon successful completion of a motor vehicle report review, you will be eligible to participate in a company vehicle evaluation program, through which you will be assigned a General Motors vehicle to drive and evaluate. Note: program participants are required to purchase/lease a qualifying GM vehicle every four years unless one of a limited number of exceptions applies. This role is categorized as remote. This means the selected candidate may be based anywhere in the country of work and is not expected to report to a GM worksite unless directed by their manager.
Compétences linguistiques
- English
Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.