Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Machine Learning Engineer, ML/GenAI Evaluation
Machine Learning Engineer, ML/GenAI Evaluation
AppleUnited StatesMachine Learning Engineer, ML/GenAI EvaluationWork Locations (3) Submit Resume Would you like to contribute to Machine Learning and Generative AI technologies? Are you passionate about measuring what
Machine Learning Engineer: Evaluation
Bedrock Robotics IncSan FranciscoJoin the team bringing advanced autonomy to the built world At Bedrock, we’re moving AI out of the lab and into the real world. Our team is composed of industry veterans who helped launch Waymo, scale
Machine Learning Engineer - AI & ML Evaluation Frameworks
AppleCupertinoCupertino, California, United StatesHardwareThe Health Sensing Machine Learning Interpretability & Analytics (MLIA) team ensures clinical rigor and contextual trust are at the foundation of Apple’s he
Senior Machine Learning Engineer, Simulation Evaluation
WaymoUnited StatesSenior Machine Learning Engineer, Simulation EvaluationWaymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driv
AIML - Sr Machine Learning Engineer, Evaluation
AppleCupertinoAIML - Sr Machine Learning Engineer, Evaluation Cupertino, California, United States Machine Learning and AIWe are seeking a highly skilled and experienced machine learning engineer to join AIML Evalu
Staff Machine Learning Platform Engineer, AI Evaluation
AppleUnited StatesStaff Machine Learning Platform Engineer, AI EvaluationJoin Apple Services Engineering to build the next generation of AI evaluation systems. We are seeking a staff machine learning platform engineer
Senior Machine Learning Engineer - VLM/LLM Evaluation
Neura MarketMountain ViewWaymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building
Machine Learning Engineer- GenAI
AppleUnited StatesRole Number:200648705-3543SummaryImagine what you could do here. At Apple, we believe new insights have a way of becoming excellent products, services, and customer experiences very quickly. Bring pas
Remote | Machine Learning Systems Evaluation Engineer - Up to $90/hour
24-MAG LLCUnited StatesAbout the job Remote | Machine Learning Systems Evaluation Engineer - Up to $90/hourWe are sharing a specialised remote consulting opportunity for experienced machine learning engineers with strong co
Sr Machine Learning Engineer, Tech Lead Autograder Systems, Evaluation
AppleCupertinoSr Machine Learning Engineer, Tech Lead — Autograder Systems, Evaluation Cupertino, California, United States Machine Learning and AI We are looking for a Senior MLE Tech Lead to join a centralized ev
Machine Learning Engineer (GenAI)
Spector.aiMountain ViewRole Description We are seeking an experienced GenAI engineer to join our seasoned founding team to drive the development and innovation of our GenAI platform. Ideal candidates bring experience in bui
Machine Learning Research Engineer, Agents - Enterprise GenAI
Scale AISan FranciscoAI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, h
Senior Machine Learning Engineer - GenAI Platform
CacheflowSan FranciscoP-984Founded in late 2020 by a small group of machine learning engineers and researchers, Mosaic AI enables companies to securely fine‑tune, train and deploy custom AI models on their own data, for ma
Remote Senior Machine Learning Engineer, GenAI Security
grabjobsUnited StatesReddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote
Machine Learning Research Engineer, Agents - Enterprise GenAI
Scale AINew YorkAI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, h
Senior Staff Machine Learning Engineer, GenAI Platform
TensecSpringfieldWho We Are The Machine Learning Platform team at Reddit is a high-impact team that owns the infrastructure that powers recommendations, content discovery, user and content quantification, while direct
Machine Learning Research Engineer, Agent Data Foundation - Enterprise GenAI
AI Chopping Block, Inc.New YorkRole Overview Scale’s rapidly growing Global Public Sector team is focused on using AI to address critical challenges facing the public sector around the world. Our core work consists of:Creating cust
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
Scale AINew YorkAI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, h
Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI
Scale AIUnited StatesStaff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAISan Francisco, CA; New York, NY AI is becoming vitally important in every function of our society. At Scale, our mission
Machine Learning Research Engineer, Agent Data Foundation - Enterprise GenAI
Scale AISan FranciscoAI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, h
Sr Engineer, Machine Learning Engineering (ML Apps)
QualcommUnited StatesCompany: Qualcomm Technologies, Inc.Job Area: Engineering Group, Engineering Group > Machine Learning EngineeringGeneral Summary:Artificial Intelligence is changing the world for the benefit of human
Staff Machine Learning Engineer, ML Infrastructure
SimpliSafe Wireless Home SecurityUnited StatesAbout SimpliSafeWe're a high-tech home security company that's passionate about protecting the life you've built and our mission of keeping Every Home Secure. And we've created a culture here that car
Artificial Intelligence/Machine Learning (AI/ML) Engineer 2 TS/SCI w Poly
PeratonUnited StatesResponsibilities*]:pointer-events-auto R6Vx5W_threadScrollVars scroll-mb-[calc(var(-scroll-root-safe-area-inset-bottom,0px)+var(-thread-response-height))] scroll-mt-(-header-height)" dir="auto" data-t
Staff Machine Learning Engineer, ML Infrastructure
Unity TechnologiesUnited StatesSan Francisco, CA, USAStaff Machine Learning Engineer, ML InfrastructureLocationSan Francisco, CA, USADepartmentAI & Machine LearningRequisition IDJOBREQ-2615904Role descriptionThe opportunityUnity Ve
Staff Machine Learning Engineer - ML Training Infrastructure
General MotorsUnited StatesJob Description**The Role:**We are seeking an experienced, technically strong, impact-driven expert in ML Training Infrastructure with a demonstrated ability to lead through hands-on technical work. I
Über
Work Locations (3) Submit Resume Would you like to contribute to Machine Learning and Generative AI technologies? Are you passionate about measuring what matters and ensuring AI systems work reliably for everyone? Do you believe that rigorous evaluation — including holding models accountable to fairness standards — is what separates great ML from good ML? We truly believe it is! We are defining what exceptional looks like for machine learning across Wallet, Payments, and Commerce. As a Machine Learning Engineer specializing in Evaluation, you will establish the evaluation criteria, metrics frameworks, and quality standards that determine when models are ready to reach hundreds of millions of users. Your judgment shapes model quality and earns the confidence to ship. You'll work at the intersection of rigorous ML science and high-impact product decisions, collaborating closely with ML Engineering, Product, Privacy, and Legal teams. This unique opportunity puts you at the center of model quality — designing adversarial test strategies, surfacing failure modes before they reach users, and owning the sign-off process that ensures Apple's financial features meet the highest bar for accuracy, robustness, and reliability. Responsibilities
Define evaluation criteria and quality metrics for ML models powering Wallet features Design and maintain structured test sets covering the full diversity of real-world scenarios — varied document formats, distributions, languages, edge cases, and adversarial inputs. Develop evaluation methodologies for robustness testing: distribution shift, out-of-distribution generalization, temporal drift, and aggressor scenarios Own fairness evaluation end-to-end — define fairness metrics appropriate to each Wallet feature, build bias test suites across protected attributes and user populations, measure disparate performance across subgroups, and gate model launches on fairness criteria with the same rigor as other conventional metrics. Build user persona–stratified benchmarks that reflect the breadth of Wallet's global user population across spending patterns, locales, and document types Evaluate generative and agentic model outputs — assessing hallucination rates, faithfulness, and groundedness using LLM-as-a-judge frameworks, human evaluation protocols, and prompt regression testing Own model quality sign-off — establish the launch criteria, run final evaluations, and make the call on model readiness before any feature ships Synthesize evaluation results into clear, actionable insights that guide model development priorities and product decisions Partner with ML engineers and Quality engineers to identify failure modes early in the development cycle and close the loop between evaluation findings and model improvements Establish and evangelize evaluation best practices across the Wallet ML team, raising the quality bar for how models are tested, monitored, and maintained post-launch Minimum Qualifications
M.S. in Machine Learning, Computer Science, Statistics, Applied Mathematics, or a related technical field strongly preferred. Bachelor's degree with 7+ years hands-on experience in ML evaluation, model quality, or applied research will be considered 5+ years of hands-on ML experience, with deep expertise in model evaluation, offline metrics design, and behavioral testing Strong track record designing evaluation frameworks for production ML systems — not just accuracy/F1, but precision-recall tradeoffs, calibration, fairness, and task-specific quality dimensions Creative mindset with the ability to translate standard ML evaluation metrics (F1, AUC, etc.) into utility and user trust measures Experience testing for distribution shift, out-of-distribution generalization, and temporal drift in real-world deployed models Proven ability to construct adversarial test suites, aggressor scenarios, and edge-case corpora that surface model failure modes before they reach users Experience with structured and semi-structured document understanding, OCR pipelines, or financial data extraction is a strong plus Strong programming skills in Python; fluency with evaluation tooling, data pipelines, and experiment tracking (e.g., MLflow, W&B, or equivalent) Excellent communication skills — ability to translate metric results into product-quality narratives for engineering and executive audiences Experience owning model quality sign-off in a cross-functional launch process Preferred Qualifications
PhD in Computer Science, Data Science, Statistics, AI/ML, or a related field. Experience with Bayesian or causal graph-based approaches to data generation. Experience with causal approaches to fairness evaluation — counterfactual fairness, causal Shapley values, or structural causal model–based bias auditing. Experience evaluating models under privacy constraints or on-device inference settings is a plus. Familiarity with confidence calibration techniques and uncertainty quantification a plus Background in financial services, fintech, or consumer payment products Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant At Apple, we believe accessibility is a fundamental human right. You'll find that idea reflected in everything here — in our culture, our benefits and our digital tools. By welcoming as many perspectives as possible, we help you build a career where you feel like you belong. Learn about accessibility in Apple's workplace Learn about reasonable accommodations for job applicants Apple accepts applications to this posting on an ongoing basis. Submit Resume Back to search results See all roles in Austin
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.