Staff Machine Learning Engineer

Waymo

London, England, United Kingdom

London, England, United Kingdom

Jetzt Bewerben

Über

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self‑Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—the World’s Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo’s fully autonomous ride‑hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider‑only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.
About the Team The DUE ML Core London team builds and operates scalable machine‑learning systems, simulation workflows, and insight tools designed to improve the evaluation and developer onboarding journeys. By combining expert human judgment with advanced machine‑learning models, we deliver training and evaluation data for hundreds of metrics and components that comprise the Waymo Driver. We are looking for researchers and software engineers passionate about developing ML techniques for evaluation systems and driving performance improvements across our technology stack.
Responsibilities
Build scalable systems for training and fine‑tuning large‑scale generative models to produce and evaluate realistic driving behaviors.
Lead the implementation and iteration of novel Reinforcement Learning (RL) algorithms, reward functions, and training paradigms tailored for generating high‑fidelity driving behaviors.
Lead the development of cutting‑edge Deep Learning and Generative AI (LLM/VLM) solutions to enhance human‑led triaging, automate high‑volume workflows, and detect critical anomalies in driving behavior.
Oversee the production and optimization of ML models used to assess the performance of Waymo's fleet across millions of miles.
Monitor industry trends and Alphabet‑wide research to develop novel Reinforcement Learning from Human Preference (RLHF) based data collection and evaluation systems.
Partner with Prediction, Planning, and Research teams, as well as senior leadership, to deliver on important strategic efforts.
Qualifications
M.S. or Ph.D. in Computer Science, Machine Learning, AI, or a related technical field (or equivalent practical experience).
7+ years of hands‑on experience applying Machine Learning models, with a specific focus on Reinforcement Learning.
Demonstrated expertise in deep learning, sequence modeling, and generative models.
A strong publication record or a history of impactful project delivery in RL or related areas.
Proficiency in Python and standard ML frameworks (e.g., JAX, TensorFlow).
Experience with large‑scale distributed training and data processing.
Preferred Experience
10+ years of relevant experience in ML/RL research and application.
Experience in autonomous vehicles, robotics, or complex simulation environments.
Familiarity with state‑of‑the‑art RL techniques, specifically for fine‑tuning large models (e.g., RLHF).
Experience integrating large‑scale simulation platforms with ML training workflows.
A track record of technical leadership and influencing senior stakeholders.
Salary Range £150,000 – £162,000 GBP
Seniority level Mid‑Senior level
Employment type Full‑time
Job function Engineering and Information Technology
Industries Technology, Information and Internet
#J-18808-Ljbffr

London, England, United Kingdom

Sprachkenntnisse

English

Hinweis für Nutzer

Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klicken Sie auf „Jetzt Bewerben“, um Ihre Bewerbung direkt auf deren Website einzureichen.

Jetzt Bewerben