This job offer is no longer available
Machine Learning Infrastructure Engineer- Model Inference
Abridge
- United States
- United States
About
As an ML Infrastructure Engineer, Model Inference at Abridge, you'll play a pivotal role in building and optimizing the core inference infrastructure that powers our machine learning models. Your work will be instrumental in enhancing the scalability, efficiency, and performance of our AI-driven solutions. You will work with our Infrastructure and Research teams to build, deploy, optimize and orchestrate across our AI models. What You'll Do Design, deploy and maintain scalable Kubernetes clusters for AI model inference and training Develop, optimize, and maintain ML model serving infrastructure, ensuring high-performance and low-latency. Collaborate with ML and product teams to scale backend infrastructure for AI-driven products, focusing on model deployment, throughput optimization, and compute efficiency. Optimize compute-heavy workflows and enhance GPU utilization for ML workloads. Build a robust model API orchestration system Collaborate with leadership to define and implement strategies for scaling infrastructure as the company grows, ensuring long-term efficiency and performance. What You'll Bring Strong experience in building and deploying machine learning models in production environments. Deep understanding of container orchestration and distributed systems architecture Expertise in Kubernetes administration, including custom resource definitions, operators, and cluster management Experience developing APIs and managing distributed systems for both batch and real-time workloads Excellent communication skills, with the ability to interface between research and product engineering Ideally, You Have Expertise with model serving frameworks such as NVIDIA Triton Server, VLLM, TRT-LLM and so on. Expertise with ML toolchains such as PyTorch, Tensorflow or distributed training and inference libraries. Familiarity with GPU cluster management and CUDA optimization Knowledge of infrastructure as code (Terraform, Ansible) and GitOps practices Experience with container registries, image optimization, and multi-stage builds for ML workloads Experience orchestrating across ASR models or LLM models for building various GenAI applications Why Work at Abridge?
At Abridge, we're transforming healthcare delivery experiences with generative AI, enabling clinicians and patients to connect in deeper, more meaningful ways. Our mission is clear: to power deeper understanding in healthcare. We're driving real, lasting change, with millions of medical conversations processed each month. Joining Abridge means stepping into a fast-paced, high-growth startup where your contributions truly make a difference. Our culture requires extreme ownership—every employee has the ability to (and is expected to) make an impact on our customers and our business. Beyond individual impact, you will have the opportunity to work alongside a team of curious, high-achieving people in a supportive environment where success is shared, growth is constant, and feedback fuels progress. At Abridge, it's not just what we do—it's how we do it. Every decision is rooted in empathy, always prioritizing the needs of clinicians and patients. We're committed to supporting your growth, both professionally and personally. Whether it's flexible work hours, an inclusive culture, or ongoing learning opportunities, we are here to help you thrive and do the best work of your life. If you are ready to make a meaningful impact alongside passionate people who care deeply about what they do, Abridge is the place for you. How We Take Care of Abridgers
Generous Time Off
: 14 paid holidays, flexible PTO for salaried employees, and accrued time off for hourly employees Comprehensive Health Plans
: Medical, Dental, and Vision coverage for all full-time employees and their families. Generous HSA Contribution
: If you choose a High Deductible Health Plan, Abridge makes monthly contributions to your HSA. Paid Parental Leave
: Generous paid parental leave for all full-time employees. Family Forming Benefits
: Resources and financial support to help you build your family. 401(k) Matching
: Contribution matching to help invest in your future. Personal Device Allowance
: Tax free funds for personal device usage. Pre-tax Benefits
: Access to Flexible Spending Accounts (FSA) and Commuter Benefits. Lifestyle Wallet
: Monthly contributions for fitness, professional development, coworking, and more. Mental Health Support
: Dedicated access to therapy and coaching to help you reach your goals. Sabbatical Leave
: Paid Sabbatical Leave after 5 years of employment. Compensation and Equity
: Competitive compensation and equity grants for full time employees. ... and much more! Equal Opportunity Employer
Abridge is an equal opportunity employer and considers all qualified applicants equally without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran status, or disability. We provide reasonable accommodations throughout the interview process. If you need reasonable accommodation in applying, interviewing, completing any assessment or otherwise participating in the employee selection process, please contact us at accommodations@abridge.com Staying Safe - Protect Yourself from Recruitment Fraud
We are aware of individuals and entities fraudulently representing themselves as Abridge recruiters and/or hiring managers. Abridge will never ask for financial information or payment, or for personal information such as bank account number or social security number during the job application or interview process. Any emails from the Abridge recruiting team will come from an @ abridge.com email address. You can learn more about how to protect yourself from these types of fraud by referring to this article . Please exercise caution and cease communications if something feels suspicious about your interactions.
Languages
- English
Notice for Users
This job was posted by one of our partners. You can view the original job source here.