Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Staff Machine Learning Engineer, ML Infrastructure
Staff Machine Learning Engineer, ML Infrastructure
Venturefizz Product Management CommunityUnited StatesPrincipal DevOps EngineerWe're a high-tech home security company that's passionate about protecting the life you've built and our mission of keeping Every Home Secure. And we've created a culture here
Staff Machine Learning Engineer, ML Infrastructure
Unity TechnologiesUnited StatesSan Francisco, CA, USAStaff Machine Learning Engineer, ML InfrastructureLocationSan Francisco, CA, USADepartmentAI & Machine LearningRequisition IDJOBREQ-2615904Role descriptionThe opportunityUnity Ve
Staff Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesJob Description**The Role:**We are seeking an experienced, technically strong, impact-driven expert in ML Training Infrastructure with a demonstrated ability to lead through hands-on technical work. I
Machine Learning Infrastructure Engineer
Garuda VenturesPalo AltoLocation Palo Alto Employment Type Full time Location Type On-site Department Software Engineering We’re hiring Machine Learning Infrastructure Engineers to build the systems that make large-scale mod
Machine Learning Infrastructure Engineer
TRM LabsCaliforniaBuild a Safer World.TRM Labs provides blockchain analytics and AI solutions to help law enforcement, national security agencies, financial institutions, and cryptocurrency businesses detect, investiga
Senior Machine Learning Infrastructure Engineer
PlusAI, Inc.United StatesPlusAI is a Physical AI company pioneering AI-based virtual driver software for factory-built autonomous trucks. Headquartered in Silicon Valley with operations in the United States and Europe, Plus w
Machine Learning Infrastructure Engineer- Model Inference
AbridgeUnited StatesML Infrastructure Engineer, Model InferenceAs an ML Infrastructure Engineer, Model Inference at Abridge, you'll play a pivotal role in building and optimizing the core inference infrastructure that po
Machine Learning Engineer-Model Serving Infrastructure
ByteDanceSeattleMachine Learning Engineer-Model Serving Infrastructure Machine Learning Engineer-Model Serving Infrastructure 2 weeks ago Be among the first 25 applicants Responsibilities The mission of our AML team
AIML - ML Engineer, Machine Learning Platform & Infrastructure
Career-MoverSeattleAIML - ML Engineer, Machine Learning Platform & InfrastructureSeattle, United States | Posted on 09/16/2023 At Apple, the Information Intelligence teams are at the forefront of developing groundbreaki
Senior Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesJob Description**The Role:**We are seeking an experienced, technical oriented, impact delivering-driven expert in ML Training Infrastructure with a strong ability to execute hands-on technical work. I
Senior Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesJob Description**The Role:**We are seeking an experienced, technical oriented, impact delivering-driven expert in ML Training Infrastructure with a strong ability to execute hands-on technical work. I
Senior Machine Learning Engineer - ML Training Infrastructure
General MotorsDoverJob OverviewSenior ML Engineer – ML Training Infrastructure, General Motors. We are seeking an experienced, technical‑oriented, impact‑delivering expert in ML training infrastructure to design and bui
Sr. Machine Learning Infrastructure Engineer, Creator Studio
AppleCulver CitySr. Machine Learning Infrastructure Engineer, Creator Studio Culver City, California, United States Software and ServicesAt Apple, new ideas have a way of becoming phenomenal products, services, and c
Software Engineer (Multiple Levels) - Machine Learning Infrastructure, Slack
SalesforceUnited StatesTo get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.Job Category Software EngineeringJob DetailsAbout Sal
Software Engineer (Multiple Levels) - Machine Learning Infrastructure, Slack
100 Salesforce, Inc.SeattleSoftware Engineer – ML Infrastructure We are looking for Software Engineers to join the ML Infrastructure focus area and help architect and operate the core systems that power AI at Slack. Responsibil
Software Engineer (Multiple Levels) - Machine Learning Infrastructure, Slack
Centaur LabsAustinSoftware EngineeringAbout Salesforce Salesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action. Tech meets trust. And innovation isn’t a buzzwo
Software Engineer (Multiple Levels) - Machine Learning Infrastructure, Slack
Salesforce.Com IncUnited StatesTo get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Job Category Software EngineeringJob Details About S
Sr Machine Learning Engineer- ML Infrastructure & Data Platforms
AdobeUnited StatesSenior Machine Learning EngineerWe're looking for a Senior Machine Learning Engineer to join our Applied Science Data Frameworks team. In this role, you'll build the infrastructure that powers large-s
Machine Learning Engineer, Offline Infrastructure (Entry-Level / New Grad)
Unity TechnologiesUnited StatesMountain View, CA, USAMachine Learning Engineer, Offline Infrastructure (Entry-Level / New Grad)LocationMountain View, CA, USADepartmentAI & Machine LearningRequisition IDJOBREQ-2616004Role descriptio
Staff ML Platform Engineer: Data Infrastructure & Tools
Dormont Manufacturing CoNew YorkDormont Manufacturing Co in New York is looking for a candidate to work on their Data Platform, focusing on building platforms that support the data science development lifecycle. You'll collaborate w
Senior Staff Machine Learning Engineer, (Machine Learning)
AffirmSalt Lake CityAffirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest.Join the team as a Senior St
Staff Machine Learning Engineer
XPENGUnited StatesStaff Machine Learning EngineerSanta Clara, CA XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles
Staff Machine Learning Engineer
AppFolioUnited StatesDescriptionHi, We're AppFolioWe're innovators, changemakers, and collaborators. We're more than just a software company - we're building the AI-native platform where the real estate industry comes to
Staff Machine Learning Engineer
XometryUnited StatesStaff Machine Learning EngineerWaltham, MA Xometry powers the industries of today and tomorrow by connecting the people with big ideas to the manufacturers who can bring them to life. Xometry's digita
Staff Machine Learning Engineer
XometryUnited StatesStaff Machine Learning EngineerNorth Bethesda, MD Xometry powers the industries of today and tomorrow by connecting the people with big ideas to the manufacturers who can bring them to life. Xometry's
Staff Machine Learning Engineer, ML Infrastructure
- United States
- United States
Über
We're a high-tech home security company that's passionate about protecting the life you've built and our mission of keeping Every Home Secure. And we've created a culture here that cares just as deeply about the career you're building. Ours is a no ego culture of collaboration and innovation where those seeking their next challenge can find big opportunities and make a huge impact on the lives of all those who we protect. We don't just want you to work here. We want you to grow and thrive here. We're embracing a hybrid work model that enables our teams to split their time between office and home. Hybrid for us means we expect our teams to come together in our state-of-the-art office on two core days, typically Tuesday, Wednesday, or Thursday – working together in person and choosing where they work for the remainder of the week. We all benefit from flexibility and get to use the best of both worlds to get our work done. We're growing and thriving. So, we need smart, talented, and humble people who share our values to join us as we disrupt the home security space and relentlessly pursue our mission of keeping Every Home Secure. We're looking for a Staff ML Engineer to join our Cloud ML team — the team that owns both the cloud-side ML infrastructure and the applied ML research that powers SimpliSafe's intelligent home security products. This is a senior individual contributor role focused on raising the bar for how we build, deploy, and operate ML systems at scale. You'll partner closely with other Staff and Principal engineers to drive architecture, mentor across the team, and set the technical direction for our ML platform. The work spans two of our most demanding workloads: real-time computer vision inference that processes video from cameras and doorbells across our customer base, and LLM/GenAI infrastructure that will power our future generation of intelligent applications. This role is for someone who has built ML infrastructure before, knows where the sharp edges are, and is energized by making other teams faster and more reliable. Set technical direction for ML infrastructure Drive architecture decisions for our Kubernetes-based ML platform — anchored on Ray for inference, alongside KServe, Triton, and vLLM — across real-time and batch workloads. Lead deep technical reviews on system design, capacity planning, and reliability for the highest-stakes ML systems at SimpliSafe. Identify and remove the systemic bottlenecks in our ML deployment infrastructure — whether that's serving reliability, deployment friction, observability gaps, scaling, or cost. Build and operate real-time CV inference at scale Own the design and evolution of cloud-side inference systems that process live video and events from SimpliSafe devices in real time. Drive throughput, latency, and cost improvements (batching strategies, GPU utilization, autoscaling, multi-model serving) for production CV models. Build the feedback loops between cloud inference, edge devices, and the data flywheel that improves model quality over time. Stand up LLM/GenAI serving infrastructure Help shape how SimpliSafe serves LLMs in production — model serving patterns, KV-cache and batching strategies, evaluation pipelines, guardrails, and cost controls. Partner with applied ML engineers to take new GenAI-powered product features from prototype to scaled deployment. Raise the engineering bar across Cloud ML Mentor engineers across the team through design reviews, code reviews, pairing, and written guidance — a meaningful uplift on everyone you work with. Establish and evangelize best practices for model lifecycle management (registry, deployment, monitoring, rollback, drift) and on-call. Write the documentation, runbooks, and architectural decision records that make the platform legible and durable. Own reliability and operational excellence Lead incident response and postmortems for critical ML systems; turn lessons learned into platform-level improvements. Define SLOs, observability standards, and on-call practices for ML services in production. Qualifications 8+ years of software/ML engineering experience, with a clear track record of building and operating production ML systems at scale. Deep expertise in cloud ML infrastructure on Kubernetes, with hands-on production experience with Ray (which powers our inference stack); experience with KServe, Triton, vLLM, Kubeflow, Argo, or similar is a strong plus. Strong production experience on AWS (EKS, S3, IAM, networking) and with Kafka, containerized deployments, CI/CD, and infrastructure-as-code. Demonstrated experience designing and operating high-throughput, low-latency inference systems — GPU-aware scheduling, batching, autoscaling, multi-tenancy. Solid grounding in ML fundamentals: how models are trained, evaluated, versioned, deployed, monitored, and rolled back in production. Proficiency in Python is required; experience with a systems language (Go, C++, Rust) for performance-sensitive components is a plus. Staff-level technical leadership: ability to drive ambiguous, cross-cutting initiatives, align senior stakeholders, and elevate the engineers around you without formal authority. Strong written and verbal communication — you can make complex technical tradeoffs legible to ML scientists, product, and other infra teams. Bonus Points Hands-on experience with LLM serving in production (vLLM, TGI, TensorRT-LLM, SGLang) — KV cache management, continuous batching, speculative decoding, quantization for serving. Experience building real-time video or streaming ML pipelines (Kafka, Kinesis, Flink, or similar) at scale. Background supporting CV workloads in production — model formats, GPU/accelerator tradeoffs, video codecs. Experience with model lifecycle tooling (MLflow, Weights & Biases, model registries, drift detection, shadow deployments). Open source contributions to the ML infrastructure ecosystem (Ray, KServe, Triton, vLLM, Kubeflow, etc.). Experience operating in environments with strong security and compliance requirements. The Cloud ML team owns the full surface area — infrastructure and applied research — which means your work as a Staff infra engineer directly shapes what's possible for the science. You'll have unusual leverage: the platform you build determines how fast SimpliSafe can ship intelligent features, and the features we ship directly impact whether someone's home is safer tonight than it was yesterday. What Values You'll Share Customer Obsessed – Building deep empathy for our customers, putting them at the core of our work, and developing strong, long-term relationships with them. Aim High – Always challenging ourselves and others to raise the bar. No Ego – Maintaining a "no job too small" attitude, and an open, inclusive and humble style. One Team – Taking a highly collaborative approach to achieving success. Lift As We Climb – Investing in developing others and helping others around us succeed. Lean & Nimble – Working with agility and efficiency to experiment in an often ambiguous environment. What We Offer A mission- and values-driven culture and a safe, inclusive environment where you can build, grow and thrive A comprehensive total rewards package that supports your wellness and provides security for SimpliSafers and their families Free SimpliSafe system and professional monitoring for your home. Employee Resource Groups (ERGs) that bring people together, give opportunities to network, mentor and develop, and advocate for change. The target annual base pay range for this role is $183,500 to $244,600. This target annual base pay range represents our good-faith estimate of what we expect to pay for this role. We use a market-based compensation approach to set our target annual base pay ranges and make adjustments annually. We carefully tailor individual compensation packages, including base pay, taking into consideration employees' job-related skills, experience, qualifications, work location, and other relevant business factors. Beyond base pay, we offer a Total Rewards package that may include participation in our annual bonus program, equity, and other forms of compensation, in addition to a full range of medical, retirement, and lifestyle benefits. More details can be found here. We're committed to fair and equitable pay practices, as
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.