Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Machine Learning Engineer, ML/GenAI Evaluation
Machine Learning Engineer, ML/GenAI Evaluation
AppleSan DiegoMachine Learning Engineer, ML/GenAI Evaluation San Diego, California, United States Software and ServicesWould you like to contribute to Machine Learning and Generative AI technologies? Are you passio
Remote Lead Financial Analyst - AI Trainer ($50-$60 per hour)
Data AnnotationSan DiegoDataAnnotation is committed to creating high-quality AI. Join our team to help train the next generation of AI while enjoying the flexibility of remote work and the freedom to set your own schedule. T
Remote Financial Analyst - AI Trainer ($50-$60 per hour)
Data AnnotationSan DiegoDataAnnotation is committed to creating high-quality AI. Enjoy the flexibility of remote work and the freedom to set your own schedule. This is an opportunity to work with us as an independent contrac
Broadcast Sales Coordinator: Campaign & Partnerships Support
Pomona CollegeSan DiegoPomona College in San Diego is looking for a full-time Sales Coordinator to support Account Executives with proposal development, campaign tracking, and reporting. Candidates should have at least 2 ye
Sales Coordinator
Pomona CollegeSan Diegoand the job listing Expires on June 29, 2026Local Media San Diego, LLC is seeking aSales Coordinatorwith a minimum of 2 years experience, preferably in radio, television, or broadcasting industry. We
Senior Interventional Cardiology Sales Executive
TeleflexSan DiegoTeleflex in San Diego is looking for a Senior Sales Representative to drive sales growth and expand clinical adoption of interventional cardiology products. The candidate will leverage strong relation
Neuroscience Regional Sales Manager - South Pacific Region
Teva PharmaceuticalsSan DiegoOur Team, Your Impact Teva is searching for a Regional Sales Manager (RSM) to join our Neuroscience sales team in the South Pacific Region. The RSM will be responsible for achieving and delivering res
Lead PCB Design Engineer - HDI, RF & Flex
Harris Geospatial SolutionsSan DiegoHarris Geospatial Solutions in San Diego is seeking a Lead Electrical Engineer specialized in Printed Circuit Board Design. The ideal candidate will utilize advanced design software to create and vali
Sales Manager (Part Time)
Carter's Retail Inc.San DiegoIf you are a CURRENT Carter’s employee, do not apply via this external application. Search "Browse Jobs" in Workday to apply internally. As a Part Time Sales Manager, you will be the first face of the
B2B / Event Sales/Brand Ambassador roles
TOMASan DiegoCompany Description TOMA, The Offline Marketing Agency, specializes in reaching audiences where digital channels cannot, creating meaningful in‑person connections. What started as a small, energetic t
Outside Sales Representative Sales - Outside Sales
Builders FirstSource, Inc.San DiegoPurpose Demonstrates in-depth knowledge experience and skills to effectively represent the company with the largest or most complex customers. Understand customer’s needs and identifies products and s
Principal Mechanical Design Engineer
USA-Medtronic MiniMed, Inc 1017San DiegoAbout the RoleMiniMed is looking for a Principal Mechanical Design Engineer for our San Diego office. The Principal Mechanical Design Engineer leads the development of new electromechanical medical de
Regional Clinical Sales Specialist (San Diego, CA) - Johnson & Johnson MedTech - Orthopaedics
Johnson & Johnson MedTechSan DiegoJob Overview Regional Clinical Sales Specialist – MedTech Sales, Inside Sales (No Commission) – San Diego, California. The role is responsible for advancing the company’s sales of orthopedic surgical
Data Engineer: Build Enterprise Data Pipelines on Fabric
Bumble Bee FoodsSan DiegoBumble Bee Foods in San Diego is seeking a Data Engineer responsible for designing and building an enterprise data platform powering analytics and reporting. This role requires strong expertise in dat
Vice President, Software Engineering
MedImpact Healthcare Systems, Inc.San Diego**Summary**The Vice President, Software Engineering is responsible for all company technology and technological resources, including the buildout of an engineering team. The VP of Software Engineering
Southern CA Regional Director of Sales & Marketing
Oakmont Management GroupSan DiegoOakmont Management Group seeks a Regional Director of Sales and Marketing for Southern California to lead sales efforts across their senior living communities. This dynamic role involves overseeing hi
Website Content & Analytics Sr Manager
FreemanSan DiegoSummary The Senior Manager, Website Content & Analytics is a strategic and hands‑on digital operations leader responsible for the performance, health, and visibility of The Freeman Company's brand web
Senior Mechanical Design Lead — Autonomous UAVs (Equity)
jobr.proSan DiegoShield AI is hiring an Engineering Manager of Mechanical Design in San Diego, California. This role involves leading a team of engineers in developing mechanical systems for cutting-edge autonomous UA
Real-Time Embedded Software Engineer (Link 16) - Aviation & Defense
PDS Tech CommercialSan DiegoPDS Tech Commercial is seeking an Embedded Software Engineer in San Diego, CA to design and develop high-performance software systems for aviation and defense. This role involves working on tactical d
Bilingual Mandarin Sales Coordinator and IVF Coordinator
Reproductive Sciences Medical CenterSan DiegoReproductive Sciences Medical Center (RSMC) is a leading fertility organization offering comprehensive family-building services under one roof. Our mission is to provide an exceptional experience for
Senior Web Content & Analytics Lead
FREEMANSan DiegoThe Freeman Company is seeking a Senior Manager for Website Content & Analytics to oversee and optimize digital performance across several brand web properties. The role requires strong SEO knowledge
Business Analyst II
ICW GroupSan DiegoAt ICW Group, we are hiring team members ready to use their skills and curiosity to help transform the insurance carrier space.Purpose of the Job The purpose of this job is to facilitate business and
Inside Sales Representative
Sedona Staffing San DiegoSan DiegoInside Sales Representative We're seeking a driven, enthusiastic Inside Sales professional to join a fast-paced, growth-focused team. Here, you'll be the front line of engagement - connecting with pro
Nurse / LVN/LPN Job in San Diego, California / Government
CaliforniaSan DiegoI believe that better care begins at home. Compassionate care, uncompromising service and clinical excellence thats what our patients have come to expect from our clinicians. Kindred at Home, a divisi
Licensed Clinical Psychologist
Rula HealthSan DiegoNone
Machine Learning Engineer, ML/GenAI Evaluation
- San Diego, California, United States
- San Diego, California, United States
Über
Would you like to contribute to Machine Learning and Generative AI technologies? Are you passionate about measuring what matters and ensuring AI systems work reliably for everyone? Do you believe that rigorous evaluation — including holding models accountable to fairness standards — is what separates great ML from good ML? We truly believe it is! We are defining what exceptional looks like for machine learning across Wallet, Payments, and Commerce. As a Machine Learning Engineer specializing in Evaluation, you will establish the evaluation criteria, metrics frameworks, and quality standards that determine when models are ready to reach hundreds of millions of users. Your judgment shapes model quality and earns the confidence to ship. You'll work at the intersection of rigorous ML science and high-impact product decisions, collaborating closely with ML Engineering, Product, Privacy, and Legal teams. This unique opportunity puts you at the center of model quality — designing adversarial test strategies, surfacing failure modes before they reach users, and owning the sign-off process that ensures Apple's financial features meet the highest bar for accuracy, robustness, and reliability.
Description The ideal candidate is a rigorous, curious ML practitioner who believes that how you measure a model is just as important as how you train it. You think critically about what metrics actually capture, know how models break in the real world, and hold quality standards others find uncomfortably high — including on dimensions like fairness. You will own the full evaluation lifecycle for ML models across Wallet features — designing test frameworks, adversarial corpora, and benchmarks that reflect the diversity of Apple's global user base, then making the final quality call before any model ships. Your findings directly shape model development priorities and product decisions at scale.
Responsibilities
Define evaluation criteria and quality metrics for ML models powering Wallet features
Design and maintain structured test sets covering the full diversity of real-world scenarios — varied document formats, distributions, languages, edge cases, and adversarial inputs.
Develop evaluation methodologies for robustness testing: distribution shift, out-of-distribution generalization, temporal drift, and aggressor scenarios
Own fairness evaluation end-to-end — define fairness metrics appropriate to each Wallet feature, build bias test suites across protected attributes and user populations, measure disparate performance across subgroups, and gate model launches on fairness criteria with the same rigor as other conventional metrics.
Build user persona–stratified benchmarks that reflect the breadth of Wallet's global user population across spending patterns, locales, and document types
Evaluate generative and agentic model outputs — assessing hallucination rates, faithfulness, and groundedness using LLM-as-a-judge frameworks, human evaluation protocols, and prompt regression testing
Own model quality sign-off — establish the launch criteria, run final evaluations, and make the call on model readiness before any feature ships
Synthesize evaluation results into clear, actionable insights that guide model development priorities and product decisions
Partner with ML engineers and Quality engineers to identify failure modes early in the development cycle and close the loop between evaluation findings and model improvements
Establish and evangelize evaluation best practices across the Wallet ML team, raising the quality bar for how models are tested, monitored, and maintained post-launch
Minimum Qualifications
M.S. in Machine Learning, Computer Science, Statistics, Applied Mathematics, or a related technical field strongly preferred.
Bachelor's degree with 7+ years hands‑on experience in ML evaluation, model quality, or applied research will be considered
5+ years of hands‑on ML experience, with deep expertise in model evaluation, offline metrics design, and behavioral testing
Strong track record designing evaluation frameworks for production ML systems — not just accuracy/F1, but precision‑recall tradeoffs, calibration, fairness, and task‑specific quality dimensions
Creative mindset with the ability to translate standard ML evaluation metrics (F1, AUC, etc.) into utility and user trust measures
Experience testing for distribution shift, out‑of‑distribution generalization, and temporal drift in real‑world deployed models
Proven ability to construct adversarial test suites, aggressor scenarios, and edge‑case corpora that surface model failure modes before they reach users
Experience with structured and semi‑structured document understanding, OCR pipelines, or financial data extraction is a strong plus
Strong programming skills in Python; fluency with evaluation tooling, data pipelines, and experiment tracking (e.g., MLflow, W&B, or equivalent)
Excellent communication skills — ability to translate metric results into product‑quality narratives for engineering and executive audiences
Experience owning model quality sign‑off in a cross‑functional launch process
Preferred Qualifications
PhD in Computer Science, Data Science, Statistics, AI/ML, or a related field.
Experience with Bayesian or causal graph‑based approaches to data generation.
Experience with causal approaches to fairness evaluation — counterfactual fairness, causal Shapley values, or structural causal model‑based bias auditing.
Experience evaluating models under privacy constraints or on‑device inference settings is a plus.
Familiarity with confidence calibration techniques and uncertainty quantification a plus
Background in financial services, fintech, or consumer payment products
At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $171,600 and $302,200, and your base pay will depend on your skills, qualifications, experience, and location.
Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation.
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.
At Apple, we believe accessibility is a fundamental human right. You’ll find that idea reflected in everything here — in our culture, our benefits and our digital tools. By welcoming as many perspectives as possible, we help you build a career where you feel like you belong.
Learn about accessibility in Apple’s workplace
Learn about reasonable accommodations for job applicants
Apple accepts applications to this posting on an ongoing basis.
#J-18808-Ljbffr
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.