About
Goal: 99.99% uptime We serve custom inference stacks that have irregular GPU load. We're looking for people that have done genuinely amazing work in infrastructure who are interested in a challenge, working with both traditional infrastructure such as load balancers, NLB, etc., as well as very different infrastructure around inference engines and GPU loads. This is a role that will inherently require deep experience with inference engines. Contributions to vLLM, SGLang, trtllm, or inference frameworks a plus. Every role at Morph comes with unlimited tokens on claude code/codex
Languages
- English
Notice for Users
This job comes from a TieTalent partner platform. Click "Apply Now" to submit your application directly on their site.