Offres d'emploi
Trouvez des postes près de chez vous, sur site, hybrides ou à distance.- Emplois similaires à : Sr. Machine Learning Infrastructure Engineer, Creator Studio
Machine Learning Infrastructure Engineer
Astera LabsUnited StatesAstera Labs (NASDAQ: ALAB) provides rack-scale AI infrastructure through purpose-built connectivity solutions. By collaborating with hyperscalers and ecosystem partners, Astera Labs enables organizati
Machine Learning Infrastructure Engineer
Virtual Vocations IncUnited StatesSeeking a fully remote Machine Learning & AI Infrastructure Engineer, the role focuses on designing, deploying, and supporting advanced AI, machine learning, and high-performance computing environment
Machine Learning Infrastructure Engineer
Garuda VenturesPalo AltoLocation Palo Alto Employment Type Full time Location Type On-site Department Software Engineering We’re hiring Machine Learning Infrastructure Engineers to build the systems that make large-scale mod
Machine Learning Infrastructure Engineer
Mind RoboticsUnited StatesMachine Learning Infrastructure EngineerAt Mind Robotics, we're building generalized physical AI—robotic systems capable of dexterous, adaptive, and reasoning-intensive work in real-world industrial e
Machine Learning Infrastructure Engineer
TRM LabsCaliforniaBuild a Safer World.TRM Labs provides blockchain analytics and AI solutions to help law enforcement, national security agencies, financial institutions, and cryptocurrency businesses detect, investiga
Senior Machine Learning Infrastructure Engineer
PlusAI, Inc.United StatesPlusAI is a Physical AI company pioneering AI-based virtual driver software for factory-built autonomous trucks. Headquartered in Silicon Valley with operations in the United States and Europe, Plus w
Senior Machine Learning Infrastructure Engineer
Morph Inc.United StatesJob TitleGoal: 99.99% uptime We serve custom inference stacks that have irregular GPU load. We're looking for people that have done genuinely amazing work in infrastructure who are interested in a cha
Machine Learning Infrastructure Engineer Intern
PlusAI, Inc.Santa ClaraPlusAI is a Physical AI company pioneering AI-based virtual driver software for factory-built autonomous trucks. Headquartered in Silicon Valley with operations in the United States and Europe, Plus w
Software Engineer (Machine Learning Infrastructure)
WhatnotSan FranciscoJoin the Future of Commerce with Whatnot! Whatnot is the largest livestream shopping platform in North America and Europe to buy, sell, and discover the things you love. Whether it's trading cards, fa
Staff Machine Learning Engineer, ML Infrastructure
SimpliSafe Wireless Home SecurityUnited StatesAbout SimpliSafeWe're a high-tech home security company that's passionate about protecting the life you've built and our mission of keeping Every Home Secure. And we've created a culture here that car
Senior Machine Learning Infrastructure Engineer, Fintech
Optasia GroupBrooklynOptasia is a fully enabled B2B2X financial technology platform covering scoring, financial decisioning, disbursement and collection. We are committed to enabling financial inclusion for all. We are ch
Staff Machine Learning Engineer, ML Infrastructure
Unity TechnologiesUnited StatesSan Francisco, CA, USAStaff Machine Learning Engineer, ML InfrastructureLocationSan Francisco, CA, USADepartmentAI & Machine LearningRequisition IDJOBREQ-2615904Role descriptionThe opportunityUnity Ve
Staff Machine Learning Engineer, ML Infrastructure
Venturefizz Product Management CommunityUnited StatesPrincipal DevOps EngineerWe're a high-tech home security company that's passionate about protecting the life you've built and our mission of keeping Every Home Secure. And we've created a culture here
Senior Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesJob Description**The Role:**We are seeking an experienced, technical oriented, impact delivering-driven expert in ML Training Infrastructure with a strong ability to execute hands-on technical work. I
Machine Learning Solutions Engineer (ML + Infrastructure Focus)
Lightning AIUnited StatesMachine Learning Solutions Engineer (ML + Infrastructure Focus)New York, New York, United States; San Francisco, California, United States; Seattle, Washington, United States Who We AreLightning AI is
Staff Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesJob Description**The Role:**We are seeking an experienced, technically strong, impact-driven expert in ML Training Infrastructure with a demonstrated ability to lead through hands-on technical work. I
Staff Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesJob Description**The Role:**We are seeking an experienced, technically strong, impact-driven expert in ML Training Infrastructure with a demonstrated ability to lead through hands-on technical work. I
Staff Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesJob Description**The Role:**We are seeking an experienced, technically strong, impact-driven expert in ML Training Infrastructure with a demonstrated ability to lead through hands-on technical work. I
Sr Machine Learning Engineer- ML Infrastructure & Data Platforms
Dormont Manufacturing CoSan JoseWe’re looking for a Senior Machine Learning Engineer to join our Applied Science Data Frameworks team. In this role, you’ll build the infrastructure that powers large-scale, multimodal AI training and
Software Engineer (Multiple Levels) - Machine Learning Infrastructure, Slack
100 Salesforce, Inc.SeattleSoftware Engineer – ML Infrastructure We are looking for Software Engineers to join the ML Infrastructure focus area and help architect and operate the core systems that power AI at Slack. Responsibil
Software Engineer (Multiple Levels) - Machine Learning Infrastructure, Slack
B CapitalSeattleDescription About Slack AI Slack AI's mission is to transform how people work by making Slack an AI-powered operating system. We're tackling significant challenges like unlocking collective knowledge
Machine Learning Engineer, Offline Infrastructure (Entry-Level / New Grad)
United States Digital Space LLCMountain ViewRegional Manager, Sales Engineering - Public Sector As a Regional Manager, Sales Engineering, you will lead a team of Sales Engineers and frontline leaders, driving technical execution, operational ex
Machine Learning Engineer, Offline Infrastructure (Entry-Level / New Grad)
Unity TechnologiesUnited StatesMountain View, CA, USAMachine Learning Engineer, Offline Infrastructure (Entry-Level / New Grad)LocationMountain View, CA, USADepartmentAI & Machine LearningRequisition IDJOBREQ-2616004Role descriptio
Sr. Computer Vision and Machine Learning Engineer, Creator Studio
AppleUnited StatesSr. Computer Vision and Machine Learning Engineer, Creator StudioWork Locations (2) Submit Resume At Apple, new ideas have a way of becoming phenomenal products, services, and customer experiences ver
Help Build a Family - Become a Surrogate
Patriot ConceptionsCulver CityPatriot Conceptions helps qualified women become gestational surrogates through a guided, supported process. Surrogates can earn up to $120k+ in total compensation and benefits. Journey-related items
À propos
Machine Learning Infrastructure Engineer
Location:
San Jose, CA Experience:
1-5 years Team:
Applied AI The role
We're hiring a Machine Learning Infrastructure Engineer to build the runtime, platform, and operational backbone for modern AI systems. This role is for someone who wants to work on the systems behind the systems: model access layers, routing, serving paths, telemetry, observability, evaluation infrastructure, and the controls needed to make fast-moving AI work reliable in practice.
This is a platform role, but not in the old sense. The work is tightly coupled to how modern AI systems are actually built and used: multiple model providers, agent runtimes, skill and tool layers, inference telemetry, cost-aware routing, AI spend visibility, and governance that is strong enough for real internal adoption.
What you'll do
Build and improve internal AI infrastructure for LLM applications, agents, retrieval systems, and model-backed engineering workflows. Own inference deployment paths across managed and self-serve environments, including access control, monitoring, and operational reliability. Build platform layers such as model gateways, routing, runtime integrations, telemetry, and controls for safe execution at scale. Develop AI Ops capabilities across evaluation, release readiness, observability, incident triage, regression detection, and cost monitoring. Build dashboards, tracing, logging, and alerting for production AI systems, including spend and usage visibility across tools and teams. Improve performance and unit economics through routing, caching, batching, failover, and latency/cost optimization. Create reusable APIs, SDKs, and platform abstractions that make AI systems easier to deploy, evaluate, govern, and operate. What we're looking for 1-5 years of experience in software engineering, ML infrastructure, MLOps, platform engineering, or related backend/infrastructure roles. Strong Python plus strong systems instincts. Experience with AWS or GCP and real production service ownership. Familiarity with inference deployments, model APIs, gateways, serving systems, or runtime infrastructure for LLM/ML workloads. Experience with observability, telemetry, reliability engineering, and incident response. Understanding of eval systems, release workflows, retrieval-backed systems, and debugging non-deterministic AI behavior. Ability to translate messy platform needs into scalable internal infrastructure. What strong candidates often look like
They have built or operated systems where latency, routing, cost, telemetry, and reliability actually matter. They understand that modern AI infrastructure is not just about getting a model endpoint running. It is about building the runtime, visibility, controls, and developer experience that let an applied AI team move fast without losing quality or trust.
Why this role is interesting
The team is building AI-ready infrastructure in the most literal sense: observability, access control, AI spend tracking, secure managed platforms, skill/tool infrastructure, and telemetry that spans requests, tools, models, and outcomes. If you want to work on the platform layer that makes modern agentic systems possible - and do it in a setting where the downstream users are serious engineers with high expectations - this is that role.
The base pay compensation range for this role is between $140,000 - $165,000
We know that creativity and innovation happen more often when teams include diverse ideas, backgrounds, and experiences, and we actively encourage everyone with relevant experience to apply, including people of color, LGBTQ+ and non-binary people, veterans, parents, and individuals with disabilities.
Compétences linguistiques
- English
Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.