Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Machine Learning Engineer, LLM Post-Training
Machine Learning Engineer, LLM Post-Training
GoTo MeetingMountain ViewAbout the Role We are looking for a hands‑on Machine Learning Engineer to drive the post‑training of our large language models, with a strong emphasis on reinforcement learning (RL). You will own the
Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training
NetflixUnited StatesResearch Scientist 4 - Machine Learning and Inference Research, LLM Post-TrainingNew York, New York, United States of America • Los Angeles, California, United States of America At Netflix, our missio
Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training
Netflix IncUnited StatesResearch Scientist 4 - Machine Learning and Inference Research, LLM Post-TrainingNew York, New York, United States of America • Los Angeles, California, United States of America At Netflix, our missio
AI / Machine Learning Engineer NLP / LLM
Pro Contract JobsManchesterAI / Machine Learning Engineer NLP / LLM - ContractManchester, UK Job type Full-time Job Description AI / Machine Learning Engineer NLP / LLM - Contract Contract Duration: 6 monthsRate: Up to £500 per
Machine Learning Research Scientist, Post-Training
Scale AISan FranciscoScale works with the industry's leading AI labs to provide high quality data and accelerate progress in GenAI research. We are looking for Research Scientists and Research Engineers with expertise in
Machine Learning Engineer: LLM Interpretability & Systems
CTGTUnited StatesAbout CTGT & The MissionDespite massive investment in commercial AI, organizations often find that demonstrated value is elusive, primarily due to the non-deterministic risk inherent to generative mod
Senior Machine Learning Engineer - VLM/LLM Evaluation
Neura MarketMountain ViewWaymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building
Senior Machine Learning Engineer - GenAI, LLM, RAG
Puritas GroupLondonSenior Machine Learning Engineer - GenAI, LLM, RAG - 12 month contractIm looking for Machine Learning Engineers (and Senior Data Scientists with strong engineering capability) who have real, hands-on
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
Scale AINew YorkAI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, h
Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI
Scale AIUnited StatesStaff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAISan Francisco, CA; New York, NY AI is becoming vitally important in every function of our society. At Scale, our mission
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
Scale AIUnited StatesAI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, h
Machine Learning Engineer Perception LLM/VLM (PhD, New Grad)
WaymoUnited StatesMachine Learning Engineer Perception LLM/VLM (PhD, New Grad)Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self
Remote Principal Machine Learning Engineer- LLM Fine-tuning and Optimization
GrabJobsNew YorkAirbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every co
Senior Staff Machine Learning Engineer, LLM/VLM Model Architecture & Optimization
WaymoUnited StatesSenior Staff Machine Learning Engineer, LLM/VLM Model Architecture & OptimizationWaymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its sta
Senior Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesJob Description**The Role:**We are seeking an experienced, technical oriented, impact delivering-driven expert in ML Training Infrastructure with a strong ability to execute hands-on technical work. I
Senior Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesJob Description**The Role:**We are seeking an experienced, technical oriented, impact delivering-driven expert in ML Training Infrastructure with a strong ability to execute hands-on technical work. I
Machine Learning Engineer
Centaur LabsMountain ViewThe Opportunity Do you want to lead projects to build and deploy cutting‑edge AI technology to help people get unparalleled value from meetings and conversations? Join our core AI team responsible for
Senior Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesJob Description**The Role:**We are seeking an experienced, technical oriented, impact delivering-driven expert in ML Training Infrastructure with a strong ability to execute hands-on technical work. I
Senior Machine Learning Engineer - ML Training Infrastructure
General MotorsDoverJob OverviewSenior ML Engineer – ML Training Infrastructure, General Motors. We are seeking an experienced, technical‑oriented, impact‑delivering expert in ML training infrastructure to design and bui
Staff Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesJob Description**The Role:**We are seeking an experienced, technically strong, impact-driven expert in ML Training Infrastructure with a demonstrated ability to lead through hands-on technical work. I
Staff Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesJob Description**The Role:**We are seeking an experienced, technically strong, impact-driven expert in ML Training Infrastructure with a demonstrated ability to lead through hands-on technical work. I
Senior Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesJob Description**The Role:**We are seeking an experienced, technical oriented, impact delivering-driven expert in ML Training Infrastructure with a strong ability to execute hands-on technical work. I
Machine Learning Engineer
Local InfusionNashvilleWe are Local Infusion. Local Infusion is the fastest growing infusion provider in the United States, with a mission to transform the specialty infusion industry, because patients deserve better. By pr
Senior Machine Learning Systems Engineer
redditNew YorkSenior Machine Learning Systems EngineerRemote - United States Reddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open and authentic conv
Machine Learning Engineer (Generative AI)
Applied Materials, Inc.Santa ClaraWe are seeking a research scientist to develop and deploy generative AI models for scientific and materials science applications. Key ResponsibilitiesDevelop, pretrain, fine‑tune, and align large lang
Machine Learning Engineer, LLM Post-Training
- Mountain View, California, United States
- Mountain View, California, United States
Über
Responsibilities
Lead post‑training of our LLMs across the full pipeline:
continuous pre‑training, SFT, and reinforcement learning , with RL as the primary focus (e.g., RLHF, PPO, GRPO, DPO, and related methods).
Design, build, and curate the
data
that drives each training stage — instruction/SFT datasets, preference pairs, reward signals, on‑policy rollouts, and rejection‑sampled completions — and define data‑preparation strategies tailored to specific business needs.
Partner closely with
business and product stakeholders
to understand their scenarios, rapidly convert requirements into training plans, and deliver targeted model capabilities on tight timelines.
Run large‑scale training on
mid‑to‑large GPU clusters , applying distributed‑training techniques (data parallelism, FSDP, and where relevant tensor/pipeline parallelism) and tuning for throughput and stability.
Build and maintain
evaluation and reward/verifier pipelines
to measure model quality, prevent regressions, and ensure training‑serving consistency.
Stay current with post‑training research and turn promising techniques into working, production‑ready code.
Requirements
Hands‑on LLM post‑training experience.
You have personally run CPT, SFT, and RL training — with
demonstrated, practical RL experience
(RLHF / PPO / GRPO / DPO or similar), beyond just launching training scripts.
Strong data engineering for ML.
You can independently design data‑preparation plans for a given business scenario — sourcing, cleaning, filtering, labeling strategy, and synthetic/preference data generation — to meet specific product requirements.
Proven large‑scale GPU training ability.
You have trained LLMs on
mid‑to‑large GPU hardware
and are comfortable with distributed training and debugging at scale.
Strong
PyTorch
fundamentals; working familiarity with frameworks such as Hugging Face TRL/Accelerate, DeepSpeed or FSDP, and inference engines like vLLM.
Solid understanding of tokenization, attention, chat templates, and common failure modes in alignment/agent training.
A bias toward
fast iteration and business impact , with strong communication skills to work across research and product teams.
Preferred Qualifications
Experience designing
reward models or rule‑based verifiers
for RL.
Experience with
tool‑use / agentic
model training (function calling, multi‑step planning).
Publications or open‑source contributions in LLM post‑training or RL.
Benefits We offer a competitive benefits package:
Health, dental, and vision care for you and your family (100% coverage for employee)
Top‑tier 401(K) plan with company matching
Paid time off and paid holidays
FSA, HSA and commuter benefits programs
Team activity budget
The US base salary range for this full‑time position is listed below. Pay may vary based on a number of factors including job‑related skills, level, experience, geographic location and relevant education or training. At NewsBreak, we design our overall rewards package to attract top talents. Depending on the position, the role may also be eligible for discretionary bonus and options. Your recruiter can share more details during the hiring process.
Annual Base Pay Range
$150,000 — $230,000 USD
CPRA Privacy Notice for California Candidates
#J-18808-Ljbffr
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.