Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Product Engineer - Training Platform
Product Engineer - Training Platform
BasetenSan FranciscoAbout BasetenBaseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, fl
Platform Product Manager
Software Technology, Inc.San FranciscoPlatform Product Manager, StandardsAs a Platform Product Manager, Standards, you will be responsible for executing on a set of strategic priorities that uphold our client's community standards. You wi
Product Manager, Compute Platform
ColorwaveSan FranciscoProduct Manager Focused On Compute PlatformAnthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a w
Senior Data Engineer: Scalable Ads Data Platform
Activision BlizzardSan FranciscoActivision Blizzard, Inc. is looking for a Senior Staff Software Engineer (Data) to join their Xbox Advertising Engineering team. The role focuses on building high-scale backend systems for advertisin
Senior Backend Engineer, Ops Platform & AI Automations
WhatnotSan FranciscoWhatnot is seeking a Senior Backend Engineer for the Horizontal Ops Platform team in San Francisco. This role involves building world-class internal operations tooling to meet complex operational need
Senior Backend Engineer for Health AI Platform (Hybrid)
Onos HealthSan FranciscoOnos Health, based in San Francisco, is looking for a Senior Backend Engineer to shape the future of healthcare administration through innovative software solutions. You will architect workflows that
Data Platform Engineer Scale Real-Time Analytics & Infra
FairygodbossSan FranciscoDoorDash is seeking a Data Platform Engineer based in the Bay Area to lead the vision and strategy for a rapidly growing analytics framework. You will scale the platform for increasing data workloads
Senior Frontend Engineer, Payments Platform
Finix Payments IncSan FranciscoFinix Payments, Inc. is hiring a Senior Frontend Engineer in San Francisco to build sophisticated payment dashboards and user interfaces. The role involves leading technical design, optimizing perform
Senior Data Platform Engineer Remote ML/Analytics
Ellipsis HealthSan FranciscoEllipsis Health is seeking a Senior Data Platform Engineer to create AI/ML solutions addressing healthcare staffing challenges. This role offers the flexibility to work remotely anywhere in the U.S. a
Senior Platform Engineer AI-Driven Manufacturing Backend
GetcleraSan FranciscoGetclera is looking for a Senior/Staff Software Engineer (Platform) to design scalable backend services, data pipelines, and integrations with ERP and freight systems in San Francisco. The role is ess
Senior Backend Engineer, AI-Driven RCM Platform
MonographSan FranciscoCommure is looking for a Senior Backend Engineer to join our Revenue Cycle Management AI team. You'll be integral in building intelligent systems that revolutionize healthcare's financial workflows. T
Staff Backend Engineer, Genomics Platform
Radical Numerics Inc.San FranciscoRadical Numerics Inc. is looking for a Member of Technical Staff in Backend Engineering in San Francisco, California. In this role, you will design and build backend services that power the company's
Senior Backend Engineer - AI API Platform & Scale
Perplexity AISan FranciscoPerplexity AI in San Francisco is looking for a skilled backend engineer to design, implement, and scale high-traffic APIs. The ideal candidate will have over 5 years of experience with Python, Go, or
Platform Data Engineer II: BigQuery & Cloud Pipelines
Neon RedwoodSan FranciscoNeon Redwood, a data services consulting company in San Francisco, is seeking an experienced Data Engineer II to enhance their data infrastructure. The candidate should have at least 2 years of experi
Software Engineer II, Backend Full Stack - AI Platform
RipplingSan FranciscoSoftware Engineer II, Backend Full Stack - AI Platform About this position Rippling gives businesses one place to run HR, IT, and Finance. It brings together all of the workforce systems that are norm
Backend Engineer AI Voice Platform & Scalable Systems
Alumni VenturesSan FranciscoAlumni Ventures is seeking a Software Engineer, Backend to build core systems for its voice platform. You'll work on APIs and infrastructure that power dictation features across platforms, collaborati
Senior Backend & Infrastructure Engineer AI Platform
janitorAISan FranciscojanitorAI in San Francisco is seeking an engineer to build AI interactive entertainment. The role involves improving product features deeply wired into our AI stack and enhancing performance, reliabil
Frontend-Driven Product Engineer (Hybrid SF)
AI Talent NowSan FranciscoAI Talent Now is seeking a Full Stack Developer in San Francisco. This hybrid role focuses on building responsive interfaces with React and leading product endeavors. You will manage features end-to-e
Frontend Design Engineer AI UI & Product Experience
Happy RobotSan FranciscoHappyrobot Inc. is seeking a Design Engineer in San Francisco, California to create intuitive, user-friendly interfaces. This role combines product design with frontend engineering, overseeing the aes
Staff Software Engineer, Data Cloud & Analytics Platform
RipplingSan FranciscoRippling is seeking an experienced software developer to join their Data Cloud team in San Francisco. This role involves developing high-quality software across various tech stacks and building data p
UI Design Engineer for AI Platform React/TypeScript
Dedalus Labs, Inc.San FranciscoDedalus Labs, Inc. is seeking a Design Engineer to develop the UI for a pioneering AI platform. The ideal candidate is passionate about user experience and has deep expertise in React and TypeScript.
Senior Staff Backend Engineer - AI Finance Platform
United States Digital Space LLCSan FranciscoUnited States Digital Space LLC in San Francisco is looking for an experienced backend engineer to join their Codex for Finance team. This role involves designing and scaling systems to support AI inn
Software Engineer, Core Product Gaming & Social UX
United States Digital Space LLCSan FranciscoUnited States Digital Space LLC in San Francisco is seeking a Software Engineer to join their Core Product team. This role is crucial in shaping their gaming platform, providing excellent experiences
Staff Software Engineer, Data & Analytics Platform (Remote)
Ultimate LLCSan FranciscoUltimate.ai is seeking a Staff Software Engineer for its Data and Analytics team, focusing on developing analytics capabilities utilizing Java and TypeScript. This hybrid role requires collaboration a
Backend Platform Engineer - Scale, Security & Reliability
United States Digital Space LLCSan FranciscoUnited States Digital Space LLC is looking for a Backend Platform engineer in San Francisco, California. This role involves leading the design and implementation of a sandboxing platform, ensuring rob
Product Engineer - Training Platform
- San Francisco, California, United States
- San Francisco, California, United States
Über
Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.
THE ROLE
We’re looking for a customer-obsessed software engineer to come ship with us. You’ll own features like multi-node training and products like serverless reinforcement learning (RL) from conception to MVP (and from MVP to GA!). You’ll work through the stack, architecting solutions from API and UI down to our infrastructure layer. You’ll fine tune models yourself to develop an understanding of user workflows. You’ll work closely with research engineers leveraging state-of-the-art training techniques to build experiences that accelerate model development and solve for real pain points. If you’re excited to dive deep into the training, let’s talk!
THE PRODUCT
Take a look at what we’ve built so far:
Overview of the product so far
Training docs overview
Story of the Training product
Research we've done
EXAMPLE INITIATIVES
Checkpointing Pipeline: Our checkpointing pipeline starts with automated checkpointing, a feature that ensures that versions of models created during training are automatically backed up to the cloud. Users are able to then deploy checkpoints seamlessly into inference servers, providing point-and-click integrations into inference frameworks like vLLM and Baseten’s Inference Stack. This enables customers to quickly evaluate the performance of their checkpoints with real traffic.
Multinode training: Multinode training enables customers to easily run training jobs across multiple compute nodes, enabling users to train large models like GLM 4.7 and DeepSeek. We’ve built deeply at the Kubernetes layer to ensure that scheduling, startup, inter-node communication, and shutdown happen seamlessly under the hood and as the user expects.
Training DX: Customers come to train on Baseten because it helps them get to value fast. To do this, we ensure that the features we ship aren’t just fast, but are easy to iterate with. We enhanced Baseten’s metrics from pod-level GPU summaries to per-GPU and per-Node. We’ve built a CLI experience that caters to terminal users, and UI experiences that enable user to seamlessly manage their training jobs.
Responsibilities
Iterate like crazy
Design ergonomic APIs and abstractions to model complex resources and lifecycles
Work throughout the stack (API layer, backend and database implementation, infra layer; frontend is a plus) to implement features.
Fine-tune and deploy models to develop intuition around training workflows.
Partner closely with model developers and world-class research engineers to understand the requirements and pain points of post-training workflows.
Drive long-term improvements to improve reliability of systems and velocity of development
Fix bugs & resolve customer issues with urgency
Requirements
5+ years experience building software applications
Deep knowledge of the web stack, databases, and distributed systems
Experience developing developer tooling or infrastructure products for external or internal users.
Good taste in product, particularly developer-oriented tools
Interest in ML/AI infrastructure and willingness to learn
Driven by high agency and ownership
Strong communication skills with the ability to bridge technical depth and business needs
NICE TO HAVE
Experience launching features and products through different release cycles (MVP, Beta, GA, etc.)
Experience with model development methods and paradigms, like Supervised Fine-Tuning, Reinforcement Learning, Synthetic Data Generation, LoRA, Full Finetunes, etc.
Familiarity or experience with the open source training stack and frameworks (NCCL, PyTorch, Megatron, NemoRL, VeRL, Axolotl, HF Trainer) and distributed training techniques (FSDP, DeepSpeed).
Experience developing AI products, tooling, or agents
Frontend fluency
Benefits
Competitive compensation, including meaningful equity.
100% coverage of medical, dental, and vision insurance for employee and dependents
Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year\'s Day!)
Paid parental leave
Company-facilitated 401(k)
Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.
Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.
At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.
Compensation Range: $200K - $275K
#J-18808-Ljbffr
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.