Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Founding Machine Learning Infrastructure Engineer
Founding Machine Learning Infrastructure Engineer
Model AIPalo AltoFounding Machine Learning Infrastructure Engineer Location: Onsite in Palo AltoCompensation: Competitive Salary + EquityAbout Model AIModel AI is building the infrastructure and application stack for
Founding Data Infrastructure Engineer (SF On-site)
REACH INDUSTRIESSan FranciscoREACH INDUSTRIES is seeking a Founding Data Infrastructure Engineer in San Francisco. You will build the backend for data collection efforts, designing scalable architectures for multimodal data. Idea
Machine Learning Engineer II
TinderPalo AltoOur Mission As humans, there are few things more exciting than meeting someone new. At Tinder, we’re inspired by the challenge of keeping the magic of human connection alive. With tens of millions of
Senior Staff Machine Learning Engineer
Government Employees Insurance CompanyPalo AltoSenior Staff AI Engineer – Lead the design, architecture, and implementation of GEICO’s virtual agent platform, enhancing productivity and service quality for over 20,000 contact‑center employees acro
Staff Machine Learning Engineer
Government Employees Insurance CompanyPalo AltoStaff AI Engineer – GEICOGEICO is seeking a Staff AI engineer to join our AI organization. This role focuses on the technical leadership and development of Geico’s virtual agent platform, which suppor
Senior AI Engineer - Machine Learning (US)
SlabPalo AltoGauss Labs is looking for a passionate and talented AI Engineer to develop cutting-edge Industrial AI solutions that will normalize the standard of AI for manufacturing. We are working with the world'
Software Engineer II, Machine Learning
TinderPalo AltoAbout the Role We are looking for a Machine Learning Engineer II to help build and ship machine learning systems that improve product experience and drive measurable business impact. This role is idea
Senior Computer Vision & Machine Learning Engineer
BlackhornvcPalo AltoJob Description Buzz is revolutionizing the analytics and maintenance of power grid infrastructure through our advanced AI solutions. Our computer vision systems analyze critical infrastructure to enh
Machine Learning Engineer / AI Specialist
OnesimplicityPalo AltoBuild and optimize the AI models that power Simplicity's products across real estate, travel, and civic technology platforms. Requirements 3+ years in ML/AI engineering. Strong Python, PyTorch or Tens
Machine Learning Engineer Robotics
OnesimplicityPalo AltoWork at the intersection of AI and robotics through our partnership with Unitree. Requirements 3+ years in ML engineering. Experience with computer vision and sensor fusion. Robotics experience (ROS,
Agentic AI Machine Learning Engineer in Palo Alto - Prophecy
WorksHubPalo AltoThe leader in AI-native data preparation and analysis, Prophecy is revolutionizing how the world’s top enterprises turn data chaos into reliable insights. We introduce the AI-native data lifecycle (ge
Applied AI, Forward Deployed Machine Learning Engineer - Palo Alto
Mistral AIPalo AltoAbout Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life
Principal Machine Learning Engineer
SAP SEPalo AltoWe help the world run betterAt SAP, we keep it simple: you bring your best to us, and we'll bring out the best in you. We’re builders touching over 20 industries and 80% of global commerce, and we nee
Machine Learning Researcher / Engineer (Foundational Models)
Pathway Vet AlliancePalo AltoAbout Pathway Pathway is building Live AI systems that think and learn in real time as humans do. Our mission is to deeply understand how and why LLMs work, fundamentally changing the way models think
OTR CDL-A Truck Driver Job (Earn Top Pay / Consistent Miles)
US XpressPalo AltoCDL-A Truck Drivers: GREAT ROUTES AVAILABLE! Top Pay & Benefits OTR Drivers: Average 2,000+ miles per week. Bonuses may be available. Count on consistent miles and paycheck with flexible home time t
Credit Risk Analytics Analyst II — Data-Driven & Remote
AffirmPalo AltoAffirm is seeking a Credit Risk Analyst to optimize credit strategies and drive data analytics initiatives. The ideal candidate will work cross-functionally with various teams, utilizing their experti
Senior IB Leader - M&A & Cross-Border Deals
Bank of AmericaPalo AltoBank of America is seeking an experienced investment banking advisor based in Palo Alto. The role involves supporting transaction mandates for large corporate clients, leading due diligence processes,
Certified Nursing Assistant-Menlo Park VAMC
Loyal SourcePalo AltoJob Description Loyal Source Government Service is currently seeking CNAs for an opportunity for the VA in Menlo Park, CA.Why Choose VA for your healthcare career? To serve with and for the Veterans w
Senior Product Designer
SpotnanaPalo AltoLet’s Build What’s Next, Together. We are Spotnana. We’re on a mission to modernize the infrastructure of the $1.6 trillion travel industry to power the perfect trip for travelers everywhere. Our Trav
Electrical Hardware TPM: ECU & Vehicle Programs Lead
Rivian VW GroupPalo AltoRivian VW Group in Palo Alto is seeking a Technical Program Manager to oversee the development of electronic control units (ECUs). This full-time position calls for a candidate with a strong backgroun
Founding Member, Formal Methods for Hardware Verification
Architect LabsPalo AltoArchitect Labs in Palo Alto is looking for a Founding Member of the Technical Staff with expertise in formal methods. You will design specifications, build proof generators, and establish rigor in AI-
Senior IT Data Engineer - Real-Time Data Pipelines
x.aiPalo Altox.ai is seeking a Data Engineer to lead its data strategy and implement ETL patterns. You will support internal applications and improve processes while maintaining a healthy application environment.
Manager, Electrical Design Technical Program Management
Rivian VW GroupPalo AltoAbout Us Rivian and Volkswagen Group Technologies is a joint venture between two industry leaders with a clear vision for automotive’s next chapter. From operating systems to zonal controllers to clou
Director, Investment Banker
Bank of AmericaPalo AltoJob Description This job is responsible for acting as the primary contact and advisor to a targeted client group. Key responsibilities include assessing client needs and proactively generating relevan
Staff Product Designer - AI-Driven Social Commerce
TrynectarPalo AltoTrynectar is seeking a Staff Product Designer in Palo Alto to define the design language of Nectar's AI-native platform. This role involves owning core product experiences and designing intuitive work
Founding Machine Learning Infrastructure Engineer
- Palo Alto, California, United States
- Palo Alto, California, United States
Über
Compensation: Competitive Salary + Equity
About Model AI
Model AI is building the infrastructure and application stack for the next generation of agentic AI systems.
We believe token usage will grow exponentially over the coming years, but routing all inference through closed model providers will remain too expensive for many users and enterprises. Our thesis is that agentic applications require a vertically integrated stack: high-throughput, cost-efficient serving infrastructure paired with an application layer designed for long-running, agentic workloads.
Model AI is building the Agent Cloud, a serving and training infrastructure platform purpose-built for agentic workloads, long-context inference, and large-scale open-source model deployment. By combining infrastructure and application design, we aim to make open-source models significantly more performant, practical, and competitive.
About This Role
We are looking for an
ML Systems Engineer
to help build and optimize the core serving infrastructure behind Agent Cloud. This role focuses on high-performance inference across different accelerators.
You will work on model serving performance, accelerator utilization, long-context inference, batching, scheduling, KV cache management, runtime efficiency, and cost reduction. This is a deeply technical role at the intersection of ML systems, infrastructure, and product.
Direct TPU experience is a strong plus, but not required. We care most about strong ML systems fundamentals, performance intuition, and the ability to ship reliable systems quickly.
What You'll Do
Optimize large-scale LLM inference and serving systems.
Improve total tokens per second, decode tokens per second, latency, throughput, and cost efficiency.
Work on serving infrastructure for open-source models across different types of accelerators.
Improve batching, scheduling, KV cache management, memory usage, and accelerator utilization.
Support long-context inference, including workloads targeting up to 1M context.
Debug performance bottlenecks across model execution, runtime, networking, and infrastructure.
Work with frameworks such as JAX/XLA, PyTorch, vLLM, SGLang, TensorRT-LLM, or related systems.
Collaborate closely with the application team to ensure infrastructure is optimized for agentic workloads, not just generic chatbot inference.
Help turn research prototypes into reliable, high-performance production systems.
Qualifications
Strong experience in ML systems, distributed systems, or high-performance computing.
Experience optimizing inference or training workloads for large models.
Familiarity with TPUs, GPUs, or other accelerators.
Experience with one or more of CUDA, Triton, NCCL, JAX/XLA, PyTorch internals, vLLM, SGLang, TensorRT-LLM, distributed inference, or distributed training.
Strong systems debugging skills.
Comfort working across model code, runtime, infrastructure, and product requirements.
High ownership and the ability to operate effectively in an early-stage startup environment.
Cultural Fit
Hands-on technical excellence and strong engineering judgment.
End-to-end ownership, from design to implementation to production outcomes.
Bias for action: ship quickly, learn from failures, and iterate.
High intensity during critical milestones, with a focus on real customer impact.
Ability to do deep, focused work and sustain execution.
Clear communication with teammates, customers, and stakeholders.
Comfort with ambiguity, rapid change, and wearing multiple hats.
Low ego, high integrity, high accountability, and strong collaboration.
Continuous learning and a belief that judgment, intelligence, and capability compound over time.
If you are excited to build the infrastructure and agent systems behind the next generation of AI applications, push open-source models to production-grade performance, and turn ambitious research ideas into real-world impact, Model AI is the place for you.
#J-18808-Ljbffr
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.