Über
About The Company
Founded in 2004 in sunny San Diego, California, ServiceNow has established itself as a global leader in cloud-based digital workflows. Inspired by visionary engineer Fred Luddy, the company has grown to serve over 8,100 customers worldwide, including 85% of the Fortune 500. ServiceNow's intelligent platform seamlessly connects people, systems, and processes, empowering organizations to operate more efficiently and innovatively. With a focus on leveraging advanced AI technologies, ServiceNow continues to push the boundaries of how work gets done, aiming to make the world work better for everyone. Our commitment to innovation, customer success, and inclusive growth makes us an exciting place to build a career in technology and platform engineering.
About The Role
We are seeking a highly skilled Staff Machine Learning Engineer to join our Platform Engineering and AI Technology Organization (PLATO) at ServiceNow. This role is pivotal in designing, developing, and deploying infrastructure and platform features that support AI workloads, including large language models (LLMs). The successful candidate will collaborate closely with research, AI engineering, and infrastructure teams to ensure GPU clusters are optimized for performance, scalability, and reliability. You will contribute to operational excellence by transforming operational use cases into software tooling, supporting deployment activities, and ensuring high-quality code practices. This position requires a strong background in AI/ML, distributed systems, and software engineering, with a focus on building scalable, reusable solutions that enhance our AI platform capabilities. The role is based in Santa Clara, with a requirement to be onsite two days per week, offering an engaging environment to work at the forefront of AI technology and platform development.
Qualifications
- 4+ years of development experience with Python, GoLang, Java, or similar languages
- 4+ years of experience operating highly available distributed workloads on Kubernetes using a DevOps approach
- Proficient in prompt engineering and developing features based on large language models (LLMs)
- Experience with training and fine-tuning large language models, including distillation, supervised fine-tuning, and policy optimization
- Hands-on experience operating LLMs on NVIDIA GPUs
- Strong working knowledge of Linux and J2EE-based distributed systems
- Experience with DevOps tooling such as Helm, Ansible, Prometheus, GitLab CI, and Splunk
- Understanding of software-defined networking, infrastructure as code, and configuration management
- Experience building secure and compliant software solutions in regulated environments
- Ability to lead projects with significant technical risk and drive outcomes
- Asset: 4+ years of experience in platform operations, SRE, and infrastructure deployment
Responsibilities
- Design, develop, and implement infrastructure, platform, deployment, and observability features to support AI workloads
- Collaborate with research and infrastructure teams to optimize GPU cluster performance, scalability, and reliability
- Enhance operational practices by translating use cases into software tooling to improve system efficiency and stability
- Support deployment activities and provide ongoing support for AI/ML developers
- Write high-quality, scalable, and reusable code following best practices, including code reviews and unit testing
- Partner with product owners to understand detailed requirements and oversee the entire software development lifecycle from design to delivery
- Operate and manage large language models on NVIDIA GPUs, ensuring optimal performance
- Mentor colleagues, promote knowledge sharing, and foster a collaborative team environment
- Continuously explore and experiment with new AI technologies to unlock innovative work experiences
Benefits
- Competitive base salary ranging from $173,100 to $303,000, commensurate with experience and location
- Equity options and incentive compensation programs
- Comprehensive health plans, including flexible spending accounts
- 401(k) retirement plan with company match
- Employee Stock Purchase Program (ESPP) and matching donations
- Flexible time off and family leave programs
- Access to professional development and growth opportunities
- Inclusive and accessible work environment supporting diverse talent
Equal Opportunity
ServiceNow is an equal opportunity employer. We are committed to fostering an inclusive workplace where all qualified applicants receive equal consideration for employment regardless of race, color, creed, religion, sex, sexual orientation, gender identity or expression, national origin, age, disability, veteran status, or any other protected category by law.
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klicken Sie auf „Jetzt Bewerben“, um Ihre Bewerbung direkt auf deren Website einzureichen.