Offres d'emploi
Trouvez des postes près de chez vous, sur site, hybrides ou à distance.- Emplois similaires à : Machine Learning Engineer - Distributed ML Systems
Machine Learning Engineer - Distributed ML Systems
Pluralis ResearchUnited StatesSenior/Staff EngineerPluralis Research carries out foundational research on Protocol Learning: multi-participant training of foundation models where no single participant has, or can ever obtain, a fu
Machine Learning Engineer - Distributed ML Systems
PluralisUnited StatesOverviewPluralis Research carries out foundational research onProtocol Learning : multi-participant training of foundation models where no single participant has, or can ever obtain, a full copy of th
Machine Learning Systems Engineer: Distributed Training
Susquehanna International GroupUnited StatesOverviewWe're looking for a Machine Learning Systems Engineer to strengthen the performance and scalability of our distributed training infrastructure. In this role, you'll work closely with researche
Machine Learning Engineer, Distributed Data Systems - Robotics
OpenAIUnited StatesAbout the TeamThe OpenAI Robotics team is focused on unlocking general-purpose robotics and pushing towards AGI-level intelligence in dynamic, real-world settings. Working across the entire model stac
Senior Machine Learning Systems Engineer
RedditUnited StatesSenior Machine Learning Systems EngineerRemote - United StatesReddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open and authentic conve
Sr. Machine Learning Engineer (Recommendation Systems)
PhiloUnited StatesAt Philo, we're a group of technology and product people who set out to build the future of television, marrying the best in modern technology with the most compelling medium ever invented - in short,
Machine Learning Engineer, ML Systems and Infrastructure
AutodeskUnited StatesJob Requisition ID #26WD98119POSITION OVERVIEWThe work we do at Autodesk touches nearly every person on the planet. By creating software tools for making buildings,machines, and even the latest movies
Machine Learning (ML) Engineer
ViewsUnited StatesLOCATION: REMOTE UNITED STATES DEPARTMENT: ENGINEERING WORK STATUS: FULL-TIME OverviewAre you looking for a hybrid or remote work opportunity? Are you interested in a workplace that allows for flexibi
Principal ML Engineer, Machine Learning Platform and Systems Architecture
AutodeskUnited StatesJob Requisition ID # 26WD97132 Principal Machine Learning Engineer, ML Platform and Systems ArchitecturePOSITION OVERVIEW The work we do at Autodesk touches nearly every person on the planet. By creat
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
Scale AIUnited StatesAI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, h
Senior Machine Learning Engineer - ML Planner
MotionalUnited StatesOn Behaviors , you'll have the opportunity to work with world-class ML engineers and research scientists to make self-driving vehicles a reality and create positive social impact. Our team works on th
Machine Learning Operations (ML Ops) Engineer
DraxNewcastle upon TyneMachine Learning Operations (MLOps) EngineerFlexible Location – Ipswich, London or SelbyPermanent, full time Closing date - 2 July 2025 Who we are We’re not just talking about making a difference, we’
Machine Learning Operations (ML Ops) Engineer
DraxManchesterMachine Learning Operations (MLOps) EngineerFlexible Location – Ipswich, London or SelbyPermanent, full time Closing date - 2 July 2025 Who we are We’re not just talking about making a difference, we’
Machine Learning Operations (ML Ops) Engineer
DraxHullMachine Learning Operations (MLOps) EngineerFlexible Location – Ipswich, London or SelbyPermanent, full time Closing date - 2 July 2025 Who we are We’re not just talking about making a difference, we’
Machine Learning Operations (ML Ops) Engineer
DraxCardiffMachine Learning Operations (MLOps) EngineerFlexible Location – Ipswich, London or SelbyPermanent, full time Closing date - 2 July 2025 Who we are We’re not just talking about making a difference, we’
Machine Learning Operations (ML Ops) Engineer
DraxCoventryMachine Learning Operations (MLOps) EngineerFlexible Location – Ipswich, London or SelbyPermanent, full time Closing date - 2 July 2025 Who we are We’re not just talking about making a difference, we’
Machine Learning Operations (ML Ops) Engineer
DraxBirminghamMachine Learning Operations (MLOps) EngineerFlexible Location – Ipswich, London or SelbyPermanent, full time Closing date - 2 July 2025 Who we are We’re not just talking about making a difference, we’
Machine Learning Operations (ML Ops) Engineer
DraxLiverpoolMachine Learning Operations (MLOps) EngineerFlexible Location – Ipswich, London or SelbyPermanent, full time Closing date - 2 July 2025 Who we are We’re not just talking about making a difference, we’
Machine Learning Research Engineer, GenAI Applied ML
Scale AIUnited StatesAbout This RoleLead applied ML engineering on Scale's Applied ML team, powering data infrastructure for leading agentic LLMs (ChatGPT, Gemini, Llama). You will build scalable multi-agent systems to va
Senior Staff Machine Learning Engineer, ML Understanding
RedditUnited StatesReddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote
Senior Machine Learning Engineer - Content ML (AU remote)
CanvaUnited StatesSenior Machine Learning Engineer - Content ML (AU Remote)Join the team redefining how the world experiences design. Hey, g'day, mabuhay, kia ora, hallo, vítejte! Thanks for stopping by. We know job hu
Lead Machine Learning Engineer - Merchandising AI (ML Ops)(Remote Or Hybrid)
TargetUnited StatesThe pay range is $132,000.00 - $238,000.00Pay is based on several factors which vary based on position. These include labor markets and in some instances may include education, work experience and cer
Young Mothers Wanted - Earn Up to $120K+ as a Gestational Carrier
OWG SurrogacyCaliforniaWhy OWG Surrogacy?1. Competitive Base Compensation Earn $50,000 – $120,000+ depending on your experience, location, and medical history. 2. Comprehensive Bonus & Benefit Package ($8,000 – $12,000+)
Locums Anesthesiologist Needed in California.
Odyssey LocumsCaliforniaLocum Anesthesiologists needed to start ASAP and work ongoing near Mission District, CA. CA license required. Providers can work either 4 or 5 days/week. Shifts are generally 8 hours long, but can b
Pre-K Teacher
Merryhill SchoolCaliforniaSpring Education Group’s Early Childhood Education Division includes nearly 150 schools offering services from infant care through Pre-K/K programs, as well as summer camp and after-school programs.
Machine Learning Engineer - Distributed ML Systems
- United States
- United States
À propos
Pluralis Research carries out foundational research on Protocol Learning: multi-participant training of foundation models where no single participant has, or can ever obtain, a full copy of the model. The purpose of Protocol Learning is to facilitate the creation of community-trained and community-owned frontier models with self-sustaining economics. We're looking for Senior/Staff engineers with 5+ years of experience in distributed systems and ML large-scale training. You'll be implementing a novel substrate for training distributed ML models that work under consumer grade internet connection. Distributed Training Architecture & Optimization
Design and implement large-scale distributed training systems optimized for heterogeneous hardware operating under low-bandwidth, high-latency conditions.
Develop and optimize model-parallel training strategies (data, tensor, pipeline parallelism) with custom sharding techniques that minimize communication overhead.
Optimize GPU utilization, memory efficiency, and compute performance across distributed nodes.
Implement robust checkpointing, state synchronization, and recovery mechanisms for long-running, fault-prone training jobs.
Build monitoring and metrics systems to track training progress, model quality, and system bottlenecks.
Decentralized Networking & Resilience
Architect resilient training systems where nodes can fail, networks can partition, and participants can dynamically join or leave.
Design and optimize peer-to-peer topologies for decentralized coordination across non-co-located nodes.
Implement NAT traversal, peer discovery, dynamic routing, and connection lifecycle management.
Profile and optimize communication patterns to reduce latency and bandwidth overhead in multi-participant environments.
What You'll Bring
Strong experience building and operating distributed systems in production.
Hands-on expertise with distributed training frameworks (FSDP, DeepSpeed, Megatron, or similar).
Deep understanding of model parallelism (data, tensor, pipeline parallelism).
Expert-level Python with production experience (concurrency, error handling, retry logic, clean architecture).
Strong networking fundamentals: P2P systems, gRPC, routing, NAT traversal, distributed coordination.
Experience optimizing GPU workloads, memory management, and large-scale compute efficiency.
What We Offer
Equity-heavy compensation with meaningful ownership in a mission-driven company
Competitive base salary for senior engineering roles in Australia
Visa sponsorship available for exceptional candidates
Remote-first with optional access to our Melbourne hub
World-class team — team mates were previously at Google, Amazon, Microsoft, and leading startups
Backed by Union Square Ventures and other tier-1 investors, we're a world-class, deeply technical team of ML researchers and engineers. Pluralis is unapologetically ideological. We view the world as a better place if we are able to implement what we are attempting, and Protocol Learning as the only plausible approach to preventing a handful of massive corporations monopolizing model development, access and release, and achieving massive economic capture. If this resonates, please apply.
Compétences linguistiques
- English
Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.