Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Machine Learning Engineer - Distributed ML Systems
Machine Learning Engineer - Distributed ML Systems
PluralisUnited StatesOverviewPluralis Research carries out foundational research onProtocol Learning : multi-participant training of foundation models where no single participant has, or can ever obtain, a full copy of th
Machine Learning Engineer - Distributed ML Systems
Pluralis ResearchUnited StatesSenior/Staff EngineerPluralis Research carries out foundational research on Protocol Learning: multi-participant training of foundation models where no single participant has, or can ever obtain, a fu
Machine Learning Systems Engineer: Distributed Training
Susquehanna International GroupUnited StatesOverviewWe're looking for a Machine Learning Systems Engineer to strengthen the performance and scalability of our distributed training infrastructure. In this role, you'll work closely with researche
Sr. Machine Learning Engineer (Recommendation Systems)
PhiloUnited StatesAt Philo, we're a group of technology and product people who set out to build the future of television, marrying the best in modern technology with the most compelling medium ever invented - in short,
Senior Machine Learning Systems Engineer
RedditUnited StatesSenior Machine Learning Systems EngineerRemote - United StatesReddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open and authentic conve
Machine Learning Engineer, ML Systems and Infrastructure
AutodeskUnited StatesJob Requisition ID #26WD98119POSITION OVERVIEWThe work we do at Autodesk touches nearly every person on the planet. By creating software tools for making buildings,machines, and even the latest movies
Machine Learning Operations (ML Ops) Engineer
DraxBirminghamMachine Learning Operations (MLOps) EngineerFlexible Location – Ipswich, London or SelbyPermanent, full time Closing date - 2 July 2025 Who we are We’re not just talking about making a difference, we’
Machine Learning Operations (ML Ops) Engineer
DraxLiverpoolMachine Learning Operations (MLOps) EngineerFlexible Location – Ipswich, London or SelbyPermanent, full time Closing date - 2 July 2025 Who we are We’re not just talking about making a difference, we’
Machine Learning Operations (ML Ops) Engineer
DraxImminghamMachine Learning Operations (MLOps) EngineerFlexible Location – Ipswich, London or SelbyPermanent, full time Closing date - 2 July 2025 Who we are We’re not just talking about making a difference, we’
Machine Learning Operations (ML Ops) Engineer
DraxPortsmouthMachine Learning Operations (MLOps) EngineerFlexible Location – Ipswich, London or SelbyPermanent, full time Closing date - 2 July 2025 Who we are We’re not just talking about making a difference, we’
Machine Learning Operations (ML Ops) Engineer
DraxNorthamptonMachine Learning Operations (MLOps) EngineerFlexible Location – Ipswich, London or SelbyPermanent, full time Closing date - 2 July 2025 Who we are We’re not just talking about making a difference, we’
Senior Machine Learning Engineer - Content ML (AU remote)
CanvaUnited StatesSenior Machine Learning Engineer - Content ML (AU Remote)Join the team redefining how the world experiences design. Hey, g'day, mabuhay, kia ora, hallo, vítejte! Thanks for stopping by. We know job hu
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
Scale AIUnited StatesAI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, h
Senior Machine Learning Engineer / Tech Lead - AI & ML
Civo LtdUnited StatesAbout Civo:Civo is revolutionising the cloud industry by providing developers and businesses with cutting‑edge, developer‑friendly cloud solutions. With a focus on simplicity, performance, and reliabi
Lead Machine Learning Engineer - Merchandising AI (ML Ops)(Remote Or Hybrid)
TargetUnited StatesThe pay range is $132,000.00 - $238,000.00Pay is based on several factors which vary based on position. These include labor markets and in some instances may include education, work experience and cer
Locums Anesthesiologist Needed in California.
Odyssey LocumsCaliforniaLocum Anesthesiologists needed to start ASAP and work ongoing near Mission District, CA. CA license required. Providers can work either 4 or 5 days/week. Shifts are generally 8 hours long, but can b
Young Mothers Wanted - Earn Up to $120K+ as a Gestational Carrier
OWG SurrogacyCaliforniaWhy OWG Surrogacy?1. Competitive Base Compensation Earn $50,000 – $120,000+ depending on your experience, location, and medical history. 2. Comprehensive Bonus & Benefit Package ($8,000 – $12,000+)
Pre-K Teacher
Merryhill SchoolCaliforniaSpring Education Group’s Early Childhood Education Division includes nearly 150 schools offering services from infant care through Pre-K/K programs, as well as summer camp and after-school programs.
Preschool Teacher
Merryhill SchoolCaliforniaSpring Education Group’s K-12 Division includes nearly 75 schools, with programs spanning Preschool, Elementary, Middle School, and High School. Across all our K-12 schools, the common theme is our d
Preschool Teacher
Merryhill SchoolCaliforniaSpring Education Group’s Early Childhood Education Division includes nearly 150 schools offering services from infant care through Pre-K/K programs, as well as summer camp and after-school programs.
Communication Teacher
Merryhill SchoolCaliforniaSpring Education Group is a multi-brand education network of superior private school institutions spanning infant care through high school. The network (currently composed of approximately 220 schoo
Toddler Teacher
Merryhill SchoolCaliforniaSpring Education Group’s Early Childhood Education Division includes nearly 150 schools offering services from infant care through Pre-K/K programs, as well as summer camp and after-school programs.
Preschool Two's Teacher - $2,500 Sign On Bonus
Merryhill SchoolCaliforniaSpring Education Group’s Early Childhood Education Division includes nearly 150 schools offering services from infant care through Pre-K/K programs, as well as summer camp and after-school programs.
Fleet Technician
Primo BrandsCaliforniaOverview: Primo Brands is a leading branded beverage company in North America with a focus on healthy hydration. We are proud to offer an extensive and iconic portfolio of highly recognizable, sustain
Digital Prepress Technician
MCCCaliforniaBuild your Career with an Industry Leader Multi-Color Corporation is the global leader of premium printed label solutions, helping brands stand out in a competitive marketplace while inspiring pos
Über
Pluralis Research carries out foundational research on
Protocol Learning : multi-participant training of foundation models where no single participant has, or can ever obtain, a full copy of the model. The purpose of Protocol Learning is to facilitate the creation of community-trained and community-owned frontier models with self-sustaining economics.
We're looking for Senior/Staff engineers with 5+ years of experience in distributed systems and ML large-scale training. You'll be implementing a novel substrate for training distributed ML models that work under consumer grade internet connection.
Responsibilities
Distributed Training Architecture & Optimization Design and implement large-scale distributed training systems optimized for heterogeneous hardware operating under low-bandwidth, high-latency conditions. Develop and optimize model-parallel training strategies (data, tensor, pipeline parallelism) with custom sharding techniques that minimize communication overhead. Optimize GPU utilization, memory efficiency, and compute performance across distributed nodes. Implement robust checkpointing, state synchronization, and recovery mechanisms for long-running, fault-prone training jobs. Build monitoring and metrics systems to track training progress, model quality, and system bottlenecks. Decentralized Networking & Resilience
Architect resilient training systems where nodes can fail, networks can partition, and participants can dynamically join or leave. Design and optimize peer-to-peer topologies for decentralized coordination across non-co-located nodes. Implement NAT traversal, peer discovery, dynamic routing, and connection lifecycle management. Profile and optimize communication patterns to reduce latency and bandwidth overhead in multi-participant environments. What You'll Bring
Strong experience building and operating distributed systems in production. Hands-on expertise with distributed training frameworks (FSDP, DeepSpeed, Megatron, or similar). Deep understanding of model parallelism (data, tensor, pipeline parallelism). Expert-level Python with production experience (concurrency, error handling, retry logic, clean architecture). Strong networking fundamentals: P2P systems, gRPC, routing, NAT traversal, distributed coordination. Experience optimizing GPU workloads, memory management, and large-scale compute efficiency. What We Offer
Equity-heavy compensation
with meaningful ownership in a mission-driven company Competitive base salary
for senior engineering roles in Australia Visa sponsorship
available for exceptional candidates Remote-first
with optional access to our Melbourne hub World-class team
- team mates were previously at at Google, Amazon, Microsoft, and leading startups
Backed by Union Square Ventures and other tier-1 investors, we're a world-class, deeply technical team of ML researchers and engineers. Pluralis is unapologetically ideological. We view the world as a better place if we are able to implement what we are attempting, and Protocol Learning as the only plausible approach to preventing a handful of massive corporations monopolising model development, access and release, and achieving massive economic capture. If this resonates, please apply.
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.