This job offer is no longer available
About
Build and maintain evaluation infrastructure to measure whats working and catch regressions before customers do
Optimize LLM inference: latency, cost, model routing, and quality tradeoffs
Partner with product teams on model selection and performance benchmarking
Work closely with product engineers and PMs to translate customer quality problems into ML hypotheses and solutions
Own models end-to-end: from research and experimentation to production deployment
Requirements: 5+ years of experience in a dedicated ML Engineer role (not ML-adjacent software engineering)
Strong Python skills; experience with PyTorch, VLLM, DSPy or similar LLM optimization frameworks
Hands-on experience working with large language models in production: prompt engineering, fine-tuning, evaluation, inference optimization
Ability to move from ambiguous problem (this agent isnt performing well) to experimental design to shipped improvement
Comfort working with limited supervision in an early-stage product environment.
Benefits: Equity plan available for eligible roles
Annual bonus targets under the bonus plan for eligible roles
Languages
- English
Notice for Users
This job was posted by one of our partners. You can view the original job source here.