Machine Learning Engineer

ServiceNow

United States

United States

Jetzt Bewerben

Über

About The Company

Founded in 2004 in sunny San Diego, California, ServiceNow has established itself as a global leader in cloud-based digital workflows. Inspired by visionary engineer Fred Luddy, the company has grown to serve over 8,100 customers worldwide, including 85% of the Fortune 500. ServiceNow's intelligent platform seamlessly connects people, systems, and processes, empowering organizations to operate more efficiently and innovatively. With a focus on leveraging advanced AI technologies, ServiceNow continues to push the boundaries of how work gets done, aiming to make the world work better for everyone. Our commitment to innovation, customer success, and inclusive growth makes us an exciting place to build a career in technology and platform engineering.

About The Role

We are seeking a highly skilled Staff Machine Learning Engineer to join our Platform Engineering and AI Technology Organization (PLATO) at ServiceNow. This role is pivotal in designing, developing, and deploying infrastructure and platform features that support AI workloads, including large language models (LLMs). The successful candidate will collaborate closely with research, AI engineering, and infrastructure teams to ensure GPU clusters are optimized for performance, scalability, and reliability. You will contribute to operational excellence by transforming operational use cases into software tooling, supporting deployment activities, and ensuring high-quality code practices. This position requires a strong background in AI/ML, distributed systems, and software engineering, with a focus on building scalable, reusable solutions that enhance our AI platform capabilities. The role is based in Santa Clara, with a requirement to be onsite two days per week, offering an engaging environment to work at the forefront of AI technology and platform development.

Qualifications

4+ years of development experience with Python, GoLang, Java, or similar languages
4+ years of experience operating highly available distributed workloads on Kubernetes using a DevOps approach
Proficient in prompt engineering and developing features based on large language models (LLMs)
Experience with training and fine-tuning large language models, including distillation, supervised fine-tuning, and policy optimization
Hands-on experience operating LLMs on NVIDIA GPUs
Strong working knowledge of Linux and J2EE-based distributed systems
Experience with DevOps tooling such as Helm, Ansible, Prometheus, GitLab CI, and Splunk
Understanding of software-defined networking, infrastructure as code, and configuration management
Experience building secure and compliant software solutions in regulated environments
Ability to lead projects with significant technical risk and drive outcomes
Asset: 4+ years of experience in platform operations, SRE, and infrastructure deployment

Responsibilities

Design, develop, and implement infrastructure, platform, deployment, and observability features to support AI workloads
Collaborate with research and infrastructure teams to optimize GPU cluster performance, scalability, and reliability
Enhance operational practices by translating use cases into software tooling to improve system efficiency and stability
Support deployment activities and provide ongoing support for AI/ML developers
Write high-quality, scalable, and reusable code following best practices, including code reviews and unit testing
Partner with product owners to understand detailed requirements and oversee the entire software development lifecycle from design to delivery
Operate and manage large language models on NVIDIA GPUs, ensuring optimal performance
Mentor colleagues, promote knowledge sharing, and foster a collaborative team environment
Continuously explore and experiment with new AI technologies to unlock innovative work experiences

Benefits

Competitive base salary ranging from $173,100 to $303,000, commensurate with experience and location
Equity options and incentive compensation programs
Comprehensive health plans, including flexible spending accounts
401(k) retirement plan with company match
Employee Stock Purchase Program (ESPP) and matching donations
Flexible time off and family leave programs
Access to professional development and growth opportunities
Inclusive and accessible work environment supporting diverse talent

Equal Opportunity

ServiceNow is an equal opportunity employer. We are committed to fostering an inclusive workplace where all qualified applicants receive equal consideration for employment regardless of race, color, creed, religion, sex, sexual orientation, gender identity or expression, national origin, age, disability, veteran status, or any other protected category by law.

United States

Sprachkenntnisse

English

Hinweis für Nutzer

Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klicken Sie auf „Jetzt Bewerben“, um Ihre Bewerbung direkt auf deren Website einzureichen.

Jetzt Bewerben