Offres d'emploi
Trouvez des postes près de chez vous, sur site, hybrides ou à distance.- Emplois similaires à : Founding Machine Learning Engineer
Founding Machine Learning Infrastructure Engineer
Model AIPalo AltoFounding Machine Learning Infrastructure Engineer Location: Onsite in Palo AltoCompensation: Competitive Salary + EquityAbout Model AIModel AI is building the infrastructure and application stack for
Founding Forward Deployed Machine Learning Engineer
AdaptionSan FranciscoAbout Us Most AI is frozen in place - it doesn't adapt to the world. We think that’s backwards. Our mandate is to build efficient intelligence that evolves in real-time. Our vision is AI systems that
Founding Forward Deployed Machine Learning Engineer
Adaption LabsSan FranciscoFounding Forward Deployed Machine Learning Engineer Most AI is frozen in place - it doesn't adapt to the world. We think that's backwards. Our mandate is to build efficient intelligence that evolves i
Founding Technical Staff, Machine Learning Engineer
AICEl SegundoAbout the job The U.S. is in a Cold War-like “space race” to lead in physical AI and our supply chain needs urgent transformation to scale manufacturing of electronics for robotics. This transformatio
Founding Engineer (Backend)
Triage ServicesSan FranciscoTriage is an applied research lab building adaptive security and safety alignment infrastructure for AI systems: defending against novel attack vectors and catching misalignment at inference-time in d
Founding Engineer, Backend
FalconerSan FranciscoAt Falconer, we’re transforming how engineers create, access, and share knowledge. We’re looking for a Founding Backend Engineer to help us build an AI-powered knowledge platform that companies love.
Founding Backend Engineer
TROVESan FranciscoAbout TroveTrove is developing an AI associate for financial firms - think enterprise search & agents for private equity, hedge funds, and banksOur mission is to deliver associate‑level AGIWe’ve raise
Machine Learning Research Engineer, Agent Data Foundation - Enterprise GenAI
Scale AISan FranciscoAI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, h
Machine Learning Engineer
TensecSan FranciscoOrchard Roboticsis a Series A startup backed by top VCs like Quiet Capital, Shine Capital, and General Catalyst. We're securing America’s food supply by building the AI farmer that automates our natio
Machine Learning Engineer
Framework VenturesSan FranciscoAndalusia Labs is building foundational economic infrastructure for programmable global markets, connecting capital, computation, and coordination across the internet. Our work sits at the intersectio
Founding Backend Engineer
David Joseph & CompanySan FranciscoSan Francisco, CA · On-site · Full-time Compensation:$150,000–$300,000 + 0.1%–1% equityAbout the Company A Seed-stage AI company (founded 2024, $7.5M raised) building production-grade systems that con
Founding Data Engineer
MundiSan FranciscoAbout Probably Genetic Probably Genetic is changing the lives of patients living with severe, complex diseases. Our data platform is used by drug developers and patient advocacy groups to develop and
Machine Learning Engineer
DocuSignSan FranciscoCompany Overview Docusign brings agreements to life. Over 1.5 million customers and more than a billion people in over 180 countries use Docusign solutions to accelerate the process of doing business
Machine Learning Engineer
LatentSan FranciscoLatent is building the intelligence infrastructure for American healthcare. Our products are already helping hospitals and clinics dramatically increase workflow output, speed up patient access to med
Machine Learning Engineer
Shepherd InsuranceSan FranciscoWhat We Do Shepherd is an AI-native commercial insurance platform transforming how high-hazard industries get covered. Our mission is to make risk frictionless for the builders and operators shaping t
Machine Learning Engineer
QuantcastSan FranciscoAt Quantcast, we don't just build advertising technology, we revolutionize how it works. Our AI-powered Demand Side Platform (DSP) connects the world's most ambitious marketers with their ideal audien
Founding Data Engineer
Menlo VenturesSan FranciscoLocationSan Francisco Bay Area Employment TypeFull time Location TypeHybrid DepartmentFinance & Operations About HiggsfieldHiggsfield AI is the leading video AI company redefining synthetic media on s
Founding Frontend Engineer
Worktrace AISan FranciscoThe role We are hiring a Founding Frontend Engineer to be the tip of the spear for turning Worktrace AI’s ambitious product vision into reality. In this role, you will be a key part of the founding te
Machine Learning Engineer
AppleSan FranciscoSan Francisco Bay Area, California, United States – Corporate FunctionsWe are seeking a passionate, highly motivated, hands‑on Applied Machine Learning Engineer to assist our Online Retail Decision Au
Machine Learning Engineer
Alumni VenturesSan FranciscoLocation Strava SFEmployment Type Full timeLocation Type HybridDepartment Department Technology EngineeringCompensation$160K – $180K • Offers EquityThis range reflects base compensation only and does
Machine Learning Engineer
Reducto, Inc.San FranciscoReducto helps AI teams ingest real world enterprise data with state of the art accuracy. The vast majority of enterprise data — from financial statements to health records — is locked in unstructured
Machine Learning Engineer
APIphanySan FranciscoAbout Apiphany Apiphany is a pioneering foundational AI company for physical product development. We empower global innovators in automotive, aerospace, medtech, and energy to transform mountains of u
Founding Senior Backend Engineer
Button LabsSan FranciscoABOUT US Our team includes builders from Facebook , LinkedIn , Uber and leading crypto projects like Rainbow , Xai , Ex Populus , and thirdweb . We move fast, stay lean, and obsess over shipping real
Founding Frontend Engineer (Staff)
NextradarSan FranciscoVariant is code generation with creativity and taste. Instead of a confined conversation, you can generate endless designs from a single idea. Freely explore, discover directions you wouldn’t have tho
Robot Machine Learning Engineer
Rainfall VenturesSan FranciscoAbout Deft Robotics Our mission is to build the world’s first labor agency that deploys dexterous robots as its primary workforce.We start by deploying wheeled humanoid robots in industrial manufactur
Founding Machine Learning Infrastructure Engineer
- Palo Alto, California, United States
- Palo Alto, California, United States
À propos
Compensation: Competitive Salary + Equity
About Model AI
Model AI is building the infrastructure and application stack for the next generation of agentic AI systems.
We believe token usage will grow exponentially over the coming years, but routing all inference through closed model providers will remain too expensive for many users and enterprises. Our thesis is that agentic applications require a vertically integrated stack: high-throughput, cost-efficient serving infrastructure paired with an application layer designed for long-running, agentic workloads.
Model AI is building the Agent Cloud, a serving and training infrastructure platform purpose-built for agentic workloads, long-context inference, and large-scale open-source model deployment. By combining infrastructure and application design, we aim to make open-source models significantly more performant, practical, and competitive.
About This Role
We are looking for an ML Systems Engineer to help build and optimize the core serving infrastructure behind Agent Cloud. This role focuses on high-performance inference across different accelerators.
You will work on model serving performance, accelerator utilization, long-context inference, batching, scheduling, KV cache management, runtime efficiency, and cost reduction. This is a deeply technical role at the intersection of ML systems, infrastructure, and product.
Direct TPU experience is a strong plus, but not required. We care most about strong ML systems fundamentals, performance intuition, and the ability to ship reliable systems quickly.
What You'll Do
Optimize large-scale LLM inference and serving systems.
Improve total tokens per second, decode tokens per second, latency, throughput, and cost efficiency.
Work on serving infrastructure for open-source models across different types of accelerators.
Improve batching, scheduling, KV cache management, memory usage, and accelerator utilization.
Support long-context inference, including workloads targeting up to 1M context.
Debug performance bottlenecks across model execution, runtime, networking, and infrastructure.
Work with frameworks such as JAX/XLA, PyTorch, vLLM, SGLang, TensorRT-LLM, or related systems.
Collaborate closely with the application team to ensure infrastructure is optimized for agentic workloads, not just generic chatbot inference.
Help turn research prototypes into reliable, high-performance production systems.
Qualifications
Strong experience in ML systems, distributed systems, or high-performance computing.
Experience optimizing inference or training workloads for large models.
Familiarity with TPUs, GPUs, or other accelerators.
Experience with one or more of CUDA, Triton, NCCL, JAX/XLA, PyTorch internals, vLLM, SGLang, TensorRT-LLM, distributed inference, or distributed training.
Strong systems debugging skills.
Comfort working across model code, runtime, infrastructure, and product requirements.
High ownership and the ability to operate effectively in an early-stage startup environment.
Cultural Fit
Hands-on technical excellence and strong engineering judgment.
End-to-end ownership, from design to implementation to production outcomes.
Bias for action: ship quickly, learn from failures, and iterate.
High intensity during critical milestones, with a focus on real customer impact.
Ability to do deep, focused work and sustain execution.
Clear communication with teammates, customers, and stakeholders.
Comfort with ambiguity, rapid change, and wearing multiple hats.
Low ego, high integrity, high accountability, and strong collaboration.
Continuous learning and a belief that judgment, intelligence, and capability compound over time.
If you are excited to build the infrastructure and agent systems behind the next generation of AI applications, push open-source models to production-grade performance, and turn ambitious research ideas into real-world impact, Model AI is the place for you.
#J-18808-Ljbffr
Compétences linguistiques
- English
Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.