Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Machine Learning Engineering Manager - Evaluations
Senior Machine Learning Engineer - Model Evaluations, Public Sector
Scale AIUnited StatesSenior Machine Learning Engineer - Model Evaluations, Public SectorThe Public Sector ML team at Scale deploys advanced AI systems-including LLMs, agentic models, and multimodal pipelines-into mission-
Machine Learning Engineer, Model Evaluations (Speech LLM) - San Francisco
PlaudUnited StatesPlaud Inc.Plaud is building the world's most trusted AI work companion for professionals to elevate productivity and performance through note-taking solutions, loved by over 1,500,000 users worldwide
Manager of Machine Learning Engineering
Virtual Vocations IncUnited StatesLeading a team of Forward-Deployed Machine Learning Engineers, the remote Manager of Machine Learning Engineering will manage the delivery of customer-facing AI and ML solutions from prototyping to pr
Senior Machine Learning Engineering Manager
AtexoUnited StatesOverviewWorking at AtlassianAtlassians can choose where they work – whether in an office, from home, or a combination of the two. That way, Atlassians have more control over supporting their family, p
Tech Lead/Manager, Machine Learning Research Scientist- LLM Evals
Scale AIUnited StatesAs the leading data and evaluation partner for frontier AI companies, Scale is dedicated to advancing the evaluation and benchmarking of large language models (LLMs). We are building industry-leading
Principal Machine Learning Engineer, Content Engineering
Paramount Global ServicesUnited StatesPrincipal Machine Learning Engineer, Content EngineeringTechnology New York Full-Time Fully Remote #WeAreParamount on a mission to unleash the power of content… you in? We've got the brands, we've got
AI & Machine Learning Engineering Consultant - Manager - Consulting - Location OPEN
Ernst & Young OmanUnited StatesLocation: Anywhere in Country At EY, we’re all in to shape your future with confidence. We’ll help you succeed in a globally connected powerhouse of diverse teams and take your career wherever you wan
Machine Learning Manager, Security & Research
Intuition MachinesUnited StatesIntuition Machines security products are used at scale by category leaders in every industry. You are probably familiar with our best-known product, the hCaptcha security suite.Our approach is simple:
Applied Machine Learning Engineer II - Advanced Engineering & Technology
Milwaukee ToolUnited StatesApplicants must be authorized to work in the U.S.; Sponsorship is not available for this position at this time. INNOVATE WITHOUT BOUNDARIES! At Milwaukee Tool we firmly believe that our People and our
AI & Machine Learning Engineering Consultant - Senior - Consulting - Location OPEN
Ernst & Young OmanUnited StatesLocation: Anywhere in Country At EY, were all in to shape your future with confidence. Well help you succeed in a globally connected powerhouse of diverse teams and take your career wherever you want
Sr. Director, Machine Learning Engineering (Remote-Eligible)
Capital OneUnited StatesSr. Director, Machine Learning Engineering (Remote-Eligible)OverviewAt Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an i
Manager, Machine Learning Research Scientist, GenAI
Scale AIUnited StatesScale AI accelerates the development of AI systems by providing the data, infrastructure, and tooling that power the most advanced models in the world. Our teams operate at the intersection of cutting
Senior Machine Learning Engineer, Content Engineering
Paramount Global ServicesUnited StatesSenior Machine Learning Engineer, Content EngineeringTechnology New York Full-Time Fully Remote #WeAreParamount on a mission to unleash the power of content… you in? We've got the brands, we've got th
On Call RN - $15,000 Sign-On Bonus or Student Loan Assistance!
MJHSNew YorkOur groundbreaking hospice and palliative care programs offer a significant difference when dealing with a life-limiting condition. We offer a broad range of services in the community or facility-base
On-Site Clinical Supervisor
Psycle On WellnessBaltimorePsycle On Wellness is seeking an On-Site Clinical Supervisor to lead a team of behavioral health professionals. The role is critical to ensuring high quality mental health services are provided to you
In Home Healthcare LVN:Full Time/Part Time Days
Aveanna HealthcareDilleyJoin a Company That Puts People First!Licensed Practical / Vocational Nurse – LPN/LVNOur local office is looking for a team of compassionate nurses to provide care for a very special client/patient. H
Home Health Registered Nurse RN Salaried Full Time
Aveanna HealthcareHardeevilleRegistered Nurse (Home Health)At Aveanna, we believe the best care happens at home—and that great outcomes start with supporting the nurses who deliver that care. When you join Aveanna’s Home Health t
Home Health Full Time Salaried RN 10K Sign On Bonus
Aveanna HealthcareVeronaRegistered Nurse (Home Health)At Aveanna, we believe the best care happens at home—and that great outcomes start with supporting the nurses who deliver that care. When you join Aveanna’s Home Health t
Home Health Licensed Practical Nurse LPN Full Time
Aveanna HealthcareWest Des MoinesMake a Real Difference—One Patient at a Time Now Hiring: Licensed Practical Nurse (LPN), Home Health Full-Time| Monday - Friday visits | Territory: Polk, Warren and DallasThe Licensed Practical Nurse
Home Health Registered Nurse RN Full Time
Aveanna HealthcareSaint PaulPosition Overview: The Registered Nurse – Home Health is responsible for providing and documenting skilled nursing care in accordance with the developed care plan and physicians’ orders for each indiv
Medical Science Liaison, Neuro-Oncology, Mid-Atlantic
Jazz PharmaceuticalsRichmondIf you are a current Jazz employee please apply via the Internal Career site. Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and th
Medical Science Liaison, Neuro-Oncology, Mid-Atlantic
Jazz PharmaceuticalsDoverIf you are a current Jazz employee please apply via the Internal Career site. Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and th
Medical Science Liaison, Neuro-Oncology - Central
Jazz PharmaceuticalsIndianapolisIf you are a current Jazz employee please apply via the Internal Career site. Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and th
Medical Science Liaison, Neuro-Oncology - Central
Jazz PharmaceuticalsLansingIf you are a current Jazz employee please apply via the Internal Career site. Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and th
Medical Science Liaison, Oncology (Northern CA, OR, WA, MT, ID)
Jazz PharmaceuticalsBoise CityIf you are a current Jazz employee please apply via the Internal Career site. Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and th
Senior Machine Learning Engineer - Model Evaluations, Public Sector
- United States
- United States
Über
The Public Sector ML team at Scale deploys advanced AI systems-including LLMs, agentic models, and multimodal pipelines-into mission-critical government environments. We build evaluation frameworks that ensure these models operate reliably, safely, and effectively under real-world constraints. As an ML Engineer, you will design, implement, and scale automated evaluation pipelines that help customers trust and operationalize advanced AI systems across defense, intelligence, and federal missions.
You will:
Develop and maintain automated evaluation pipelines for ML models across functional, performance, robustness, and safety metrics, including LLM-judge-based evaluations.
Design test datasets and benchmarks to measure generalization, bias, explainability, and failure modes.
Build evaluation frameworks for LLM agents, including infrastructure for scenario-based and environment-based testing.
Conduct comparative analyses of model architectures, training procedures, and evaluation outcomes.
Implement tools for continuous monitoring, regression testing, and quality assurance for ML systems.
Design and execute stress tests and red-teaming workflows to uncover vulnerabilities and edge cases.
Collaborate with operations teams and subject matter experts to produce high-quality evaluation datasets.
Comfortable with light travel (approximately 10%) for customer interaction and team needs.
This role will require an active security clearance or the ability to obtain a security clearance.
Ideally you'd have:
Experience in computer vision, deep learning, reinforcement learning, or NLP in production settings.
Strong programming skills in Python; experience with TensorFlow or PyTorch.
Background in algorithms, data structures, and object-oriented programming.
Experience with LLM pipelines, simulation environments, or automated evaluation systems.
Ability to convert research insights into measurable evaluation criteria.
Nice to haves:
Graduate degree in CS, ML, or AI.
Cloud experience (AWS, GCP) and model deployment experience.
Experience with LLM evaluation, CV robustness, or RL validation.
Knowledge of interpretability, adversarial robustness, or AI safety frameworks.
Familiarity with ML evaluation frameworks and agentic model design.
Experience in regulated, classified, or mission-critical ML domains.
Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $240,450—$300,300 USD Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of Washington DC, Texas, Colorado, Hawaii is: $216,300—$269,850 USD
PLEASE NOTE:
Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants.
About Us:
At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta,
Ernst
&
Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications.
We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status.
We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at accommodations@scale.com. Please see the United States Department of Labor's
Know Your Rights poster
for additional information.
We comply with the United States Department of Labor's
Pay Transparency provision
.
PLEASE NOTE:
We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants' needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.