Offres d'emploi
Trouvez des postes près de chez vous, sur site, hybrides ou à distance.- Emplois similaires à : Machine Learning Engineer - Distributed ML Systems
Machine Learning Engineer - Distributed ML Systems
PluralisUnited StatesOverviewPluralis Research carries out foundational research onProtocol Learning : multi-participant training of foundation models where no single participant has, or can ever obtain, a full copy of th
Machine Learning Systems Engineer: Distributed Training
Susquehanna International GroupUnited StatesOverviewWe're looking for a Machine Learning Systems Engineer to strengthen the performance and scalability of our distributed training infrastructure. In this role, you'll work closely with researche
Research Engineer, Machine Learning Systems
DeepgramUnited StatesCompany OverviewDeepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text (STT), text-to-speech (TTS), and building produ
Sr. Machine Learning Engineer (Recommendation Systems)
PhiloUnited StatesAt Philo, we're a group of technology and product people who set out to build the future of television, marrying the best in modern technology with the most compelling medium ever invented - in short,
Machine Learning Engineer, ML Systems and Infrastructure
AutodeskUnited StatesJob Requisition ID #26WD98119POSITION OVERVIEWThe work we do at Autodesk touches nearly every person on the planet. By creating software tools for making buildings,machines, and even the latest movies
Machine Learning (ML) Engineer
ViewsUnited StatesLOCATION: REMOTE UNITED STATES DEPARTMENT: ENGINEERING WORK STATUS: FULL-TIME OverviewAre you looking for a hybrid or remote work opportunity? Are you interested in a workplace that allows for flexibi
Principal ML Engineer, Machine Learning Platform and Systems Architecture
AutodeskUnited StatesPrincipal Machine Learning Engineer, ML Platform and Systems ArchitectureThe work we do at Autodesk touches nearly every person on the planet. By creating software tools for making buildings, machines
Senior Principal Machine Learning Engineer, ML Platform and Systems Architecture
AutodeskUnited StatesSenior Principal Machine Learning Engineer, ML Platform and Systems ArchitectureThe work we do at Autodesk touches nearly every person on the planet. By creating software tools for making buildings, m
Machine Learning Engineer - ML Training Platform
Pluralis ResearchUnited StatesML Training Platform EngineerPluralis Research is pioneering Protocol Learning —a fully decentralised way to train and deploy AI models that opens this layer to individuals rather than well resourced
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
Scale AIUnited StatesAI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, h
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
Scale AIUnited StatesAI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, h
Senior Machine Learning Engineer - ML Planner
MotionalUnited StatesOn Behaviors , you'll have the opportunity to work with world-class ML engineers and research scientists to make self-driving vehicles a reality and create positive social impact. Our team works on th
Machine Learning Research Engineer, GenAI Applied ML
Scale AIUnited StatesAbout This Role Lead applied ML engineering on Scale's Applied ML team, powering data infrastructure for leading agentic LLMs (ChatGPT, Gemini, Llama). You will build scalable multi-agent systems to v
Machine Learning Research Engineer, GenAI Applied ML
Scale AIUnited StatesAbout This RoleLead applied ML engineering on Scale's Applied ML team, powering data infrastructure for leading agentic LLMs (ChatGPT, Gemini, Llama). You will build scalable multi-agent systems to va
Lead Machine Learning Engineer - Merchandising AI (ML Ops)(Remote Or Hybrid)
TargetUnited StatesThe pay range is $132,000.00 - $238,000.00Pay is based on several factors which vary based on position. These include labor markets and in some instances may include education, work experience and cer
Senior Oncology Account Manager - Northern New Jersey
Jazz PharmaceuticalsNewarkIf you are a current Jazz employee please apply via the Internal Career site. Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and th
Licensed Practical Nurse LPN Full Time
Aveanna HealthcareFairviewPosition OverviewThe Licensed Practical Nurse is an essential part of the team responsible for providing and documenting skilled nursing care in accordance with the developed care plan and physicians’
Senior Oncology Account Manager - Northern New Jersey
Jazz PharmaceuticalsTrentonIf you are a current Jazz employee please apply via the Internal Career site. Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and th
Home Health Registered Nurse Full Time
Aveanna HealthcareCouncil BluffsPosition OverviewThe Registered Nurse Admissions is responsible for providing and documenting skilled nursing care in accordance with the developed care plan and physicians’ orders for each individual
Home Health Registered Nurse RN Full Time 10K Bonus
Aveanna HealthcareWaupacaRegistered Nurse (Home Health) - Full-Time $5,000 Sign On BonusAt Aveanna, we believe the best care happens at home—and that great outcomes start with supporting the nurses who deliver that care. When
In Home Healthcare LVN-Weekday Shifts l No Weekends!
Aveanna HealthcareFloresvilleJoin a Company That Puts People First!Licensed Practical / Vocational Nurse – LPN/LVNOur local office is looking for a team of compassionate nurses to provide care for a very special client/patient. H
Home Health Registered Nurse RN Salaried Full Time
Aveanna HealthcareHardeevilleRegistered Nurse (Home Health)At Aveanna, we believe the best care happens at home—and that great outcomes start with supporting the nurses who deliver that care. When you join Aveanna’s Home Health t
Home Health Registered Nurse Case Manager Full Time
Aveanna HealthcareHillsboroRegistered Nurse (Home Health) Full TimeTerritory: Hillsboro, Elroy, Camp Douglas, MaustonAt Aveanna, we believe the best care happens at home—and that great outcomes start with supporting the nurses
Medical Science Liaison, Neuro-Oncology, Mid-Atlantic
Jazz PharmaceuticalsHartfordIf you are a current Jazz employee please apply via the Internal Career site. Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and th
Medical Science Liaison, Neuro-Oncology - Central
Jazz PharmaceuticalsIndianapolisIf you are a current Jazz employee please apply via the Internal Career site. Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and th
Machine Learning Engineer - Distributed ML Systems
- United States
- United States
À propos
Pluralis Research carries out foundational research on
Protocol Learning : multi-participant training of foundation models where no single participant has, or can ever obtain, a full copy of the model. The purpose of Protocol Learning is to facilitate the creation of community-trained and community-owned frontier models with self-sustaining economics.
We're looking for Senior/Staff engineers with 5+ years of experience in distributed systems and ML large-scale training. You'll be implementing a novel substrate for training distributed ML models that work under consumer grade internet connection.
Responsibilities
Distributed Training Architecture & Optimization Design and implement large-scale distributed training systems optimized for heterogeneous hardware operating under low-bandwidth, high-latency conditions. Develop and optimize model-parallel training strategies (data, tensor, pipeline parallelism) with custom sharding techniques that minimize communication overhead. Optimize GPU utilization, memory efficiency, and compute performance across distributed nodes. Implement robust checkpointing, state synchronization, and recovery mechanisms for long-running, fault-prone training jobs. Build monitoring and metrics systems to track training progress, model quality, and system bottlenecks. Decentralized Networking & Resilience
Architect resilient training systems where nodes can fail, networks can partition, and participants can dynamically join or leave. Design and optimize peer-to-peer topologies for decentralized coordination across non-co-located nodes. Implement NAT traversal, peer discovery, dynamic routing, and connection lifecycle management. Profile and optimize communication patterns to reduce latency and bandwidth overhead in multi-participant environments. What You'll Bring
Strong experience building and operating distributed systems in production. Hands-on expertise with distributed training frameworks (FSDP, DeepSpeed, Megatron, or similar). Deep understanding of model parallelism (data, tensor, pipeline parallelism). Expert-level Python with production experience (concurrency, error handling, retry logic, clean architecture). Strong networking fundamentals: P2P systems, gRPC, routing, NAT traversal, distributed coordination. Experience optimizing GPU workloads, memory management, and large-scale compute efficiency. What We Offer
Equity-heavy compensation
with meaningful ownership in a mission-driven company Competitive base salary
for senior engineering roles in Australia Visa sponsorship
available for exceptional candidates Remote-first
with optional access to our Melbourne hub World-class team
- team mates were previously at at Google, Amazon, Microsoft, and leading startups
Backed by Union Square Ventures and other tier-1 investors, we're a world-class, deeply technical team of ML researchers and engineers. Pluralis is unapologetically ideological. We view the world as a better place if we are able to implement what we are attempting, and Protocol Learning as the only plausible approach to preventing a handful of massive corporations monopolising model development, access and release, and achieving massive economic capture. If this resonates, please apply.
Compétences linguistiques
- English
Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.