Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Machine Learning Engineer, ML/GenAI Evaluation
Machine Learning Engineer, ML/GenAI Evaluation
AppleSan DiegoMachine Learning Engineer, ML/GenAI Evaluation San Diego, California, United States Software and ServicesWould you like to contribute to Machine Learning and Generative AI technologies? Are you passio
Fluids GSE Design Engineer III
Firefly Aerospace Inc.AustinAt Firefly, we’re focused on making space attainable for everyone – including you! Each team member’s passion, dedication, and innovative ideas have a direct impact on fueling our successful trajector
Retail Sales Associate - Customer Experience & Merchandising
Rack Room ShoesAustinJob Description SummaryAdhere to and practice the company’s service standards with each customer. Meet both sales and work goals as directed by members of store management, while meeting the Policies
Senior Manager - Inside Sales (Electrical Construction)
WESCOAustinThe Senior Inside Sales Manager is responsible for leading Inside Sales Managers (ISMs) and/or partnering with Customer Service Managers (CSMs) to provide strategic direction and drive accountability
Optician | Frame Eyewear Stylist & Sales Associate
The Lean Way Consulting LLCAustinCompany DescriptionOptician, Sales Associate, & Frame StylistAre you an intelligent , detail-oriented , and fashion-savvy Optician with a passion for delivering truly exceptional customer experiences
RETAIL - WINE SALES - FT (50695)
Spec's Wines Spirits and Finer FoodsAustinSummary The individuals who perform this job provide extraordinary shopping experiences to our guests and stock merchandise. Essential Duties and Responsibilities Including, but not limited to, the fo
SAFe Release Train Lead & DevOps Architect (Remote)
AntlerAustinAntler is seeking a Release Train Engineer / DevOps Systems Lead to join a 100% remote team supporting a federal government project. The ideal candidate will lead the Agile Release Train and oversee a
Regional Sales Director: New Business Growth
Compass Group, North AmericaAustinCanteen, a division of Compass Group, North America, is seeking a Regional Sales Director in the Austin, TX market. This role emphasizes new business development with great earning potential exceeding
Senior Frontend Engineer (React) Austin, TX, USA · remote · full-time · senior $135k – $180k /[...]
CEDX SystemsAustinAt CEDX Systems we build AI workflow automation for the world's most demanding professional services teams. As a senior frontend engineer (react) on our engineering team, you'll own meaningful outcome
Strategic OEM Sales Lead, Life Sciences & Diagnostics
ThermoFisher ScientificAustinThermoFisher Scientific is seeking an experienced sales professional to join their OEM Sales team in Austin, Texas. In this role, you’ll leverage a comprehensive portfolio to drive strategic growth in
Salesforce Developer
Commerce.com US, Inc.AustinOverviewCommerce is looking for a Salesforce Developer to join our Go To Market (GTM) Business Applications team. You will help build, maintain, and improve the systems that power our sales, marketing
PMTS Test Engineer - Hardware & Software Validation
Advanced Micro DevicesAustinAdvanced Micro Devices is hiring a PMTS Test Engineer in Austin, Texas. The role involves researching, designing, and testing electronic components for semiconductor manufacturing, making use of vario
Strategic Enterprise Tech Sales Executive
UtilicomAustinUtilicom is looking for an experienced Enterprise Account Executive in Austin, Texas, to handle outside sales to enterprise-level customers for internet and telephone services. Responsibilities includ
Enterprise Solutions Engineer: Microsoft Stack & AI-Driven Demos
Togetherwork Holdings, LLCAustinTogetherwork Holdings, LLC is seeking a technical resource for its Enterprise Sales team. You will partner with sales executives to deliver product demonstrations and respond to customer needs in the
Counter Sales - Austin, TX (8103)
Arnold Oil CompanyAustinPosition Overview This position reports directly to the store manager. Counter I is required to perform any or all duties of a General Store Staff as assigned by management (using a company provided v
Remote Part-Time Salesforce Developer for Nonprofits
News Revenue HubAustinThe News Revenue Hub is seeking a Salesforce Developer for a part-time remote contract. This role will involve customizing Salesforce and providing support for various internal needs while working clo
DIRECTOR OF REGIONAL SALES - AUSTIN, TX - REMOTE
Compass Group, North AmericaAustinCanteenAbout Canteen: Canteen brings break time to everyone. We combine food, service, and experience backed by industry-leading technology to help companies create a better workplace and connect thei
Junior Data Scientist
Cushman & WakefieldAustinJob TitleJunior Data Scientist Job Description SummaryThis role sits at the intersection of real estate economics, urban analysis, and data science. The Junior Data Scientist will support the developm
Architect: Design Lead for Collaborative Projects
TradeJobsWorkforceAustinTradeJobsWorkforce is seeking an Architect in Austin, Texas, responsible for researching, designing, and administering building projects for clients. This role involves producing conceptual plans and
SDE II: AI-Driven Web & Mobile Services
Expedia GroupAustinExpedia Group is looking for a Software Development Engineer II located in Austin, Texas. This role involves designing and developing servicing experiences across web and native applications, with a f
Hardware Engineer
Cisco Systems, Inc.AustinThe application window is expected to close on: 07/23/2026Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received.This role requires the emp
Senior Electrical Engineer: Design, Lead Projects & Global Travel
Applied Materials, Inc.AustinApplied Materials, Inc. is seeking an experienced Electrical Engineer in Austin, Texas. You will be responsible for designing and troubleshooting electrical engineering assemblies while performing ana
Senior DevOps Engineer: Cloud Automation & Scale
SquareDomainAustinSquareDomain is seeking a skilled Dev Ops Engineer in Austin, TX, who will troubleshoot and resolve technical issues for cloud services. This role is crucial in enhancing both newly developed and oper
Design Engineer - Renewable Energy
Flintco CareerAustinGreat work starts with great people. At Flintco, you’ll find respect, stability, and opportunity to grow your career. Established in 1908, Flintco maintains offices in 8 major cities and employs more
Logistics Security Analyst
Pinkerton Consulting & Investigations, Inc.AustinOverview170+ Years Strong. Industry Leader. Global Impact. At Pinkerton, the mission is to protect our clients. To do this, we provide enterprise risk management services and programs specifically des
Machine Learning Engineer, ML/GenAI Evaluation
- San Diego, California, United States
- San Diego, California, United States
Über
Would you like to contribute to Machine Learning and Generative AI technologies? Are you passionate about measuring what matters and ensuring AI systems work reliably for everyone? Do you believe that rigorous evaluation — including holding models accountable to fairness standards — is what separates great ML from good ML? We truly believe it is! We are defining what exceptional looks like for machine learning across Wallet, Payments, and Commerce. As a Machine Learning Engineer specializing in Evaluation, you will establish the evaluation criteria, metrics frameworks, and quality standards that determine when models are ready to reach hundreds of millions of users. Your judgment shapes model quality and earns the confidence to ship. You'll work at the intersection of rigorous ML science and high-impact product decisions, collaborating closely with ML Engineering, Product, Privacy, and Legal teams. This unique opportunity puts you at the center of model quality — designing adversarial test strategies, surfacing failure modes before they reach users, and owning the sign-off process that ensures Apple's financial features meet the highest bar for accuracy, robustness, and reliability.
Description The ideal candidate is a rigorous, curious ML practitioner who believes that how you measure a model is just as important as how you train it. You think critically about what metrics actually capture, know how models break in the real world, and hold quality standards others find uncomfortably high — including on dimensions like fairness. You will own the full evaluation lifecycle for ML models across Wallet features — designing test frameworks, adversarial corpora, and benchmarks that reflect the diversity of Apple's global user base, then making the final quality call before any model ships. Your findings directly shape model development priorities and product decisions at scale.
Responsibilities
Define evaluation criteria and quality metrics for ML models powering Wallet features
Design and maintain structured test sets covering the full diversity of real-world scenarios — varied document formats, distributions, languages, edge cases, and adversarial inputs.
Develop evaluation methodologies for robustness testing: distribution shift, out-of-distribution generalization, temporal drift, and aggressor scenarios
Own fairness evaluation end-to-end — define fairness metrics appropriate to each Wallet feature, build bias test suites across protected attributes and user populations, measure disparate performance across subgroups, and gate model launches on fairness criteria with the same rigor as other conventional metrics.
Build user persona–stratified benchmarks that reflect the breadth of Wallet's global user population across spending patterns, locales, and document types
Evaluate generative and agentic model outputs — assessing hallucination rates, faithfulness, and groundedness using LLM-as-a-judge frameworks, human evaluation protocols, and prompt regression testing
Own model quality sign-off — establish the launch criteria, run final evaluations, and make the call on model readiness before any feature ships
Synthesize evaluation results into clear, actionable insights that guide model development priorities and product decisions
Partner with ML engineers and Quality engineers to identify failure modes early in the development cycle and close the loop between evaluation findings and model improvements
Establish and evangelize evaluation best practices across the Wallet ML team, raising the quality bar for how models are tested, monitored, and maintained post-launch
Minimum Qualifications
M.S. in Machine Learning, Computer Science, Statistics, Applied Mathematics, or a related technical field strongly preferred.
Bachelor's degree with 7+ years hands‑on experience in ML evaluation, model quality, or applied research will be considered
5+ years of hands‑on ML experience, with deep expertise in model evaluation, offline metrics design, and behavioral testing
Strong track record designing evaluation frameworks for production ML systems — not just accuracy/F1, but precision‑recall tradeoffs, calibration, fairness, and task‑specific quality dimensions
Creative mindset with the ability to translate standard ML evaluation metrics (F1, AUC, etc.) into utility and user trust measures
Experience testing for distribution shift, out‑of‑distribution generalization, and temporal drift in real‑world deployed models
Proven ability to construct adversarial test suites, aggressor scenarios, and edge‑case corpora that surface model failure modes before they reach users
Experience with structured and semi‑structured document understanding, OCR pipelines, or financial data extraction is a strong plus
Strong programming skills in Python; fluency with evaluation tooling, data pipelines, and experiment tracking (e.g., MLflow, W&B, or equivalent)
Excellent communication skills — ability to translate metric results into product‑quality narratives for engineering and executive audiences
Experience owning model quality sign‑off in a cross‑functional launch process
Preferred Qualifications
PhD in Computer Science, Data Science, Statistics, AI/ML, or a related field.
Experience with Bayesian or causal graph‑based approaches to data generation.
Experience with causal approaches to fairness evaluation — counterfactual fairness, causal Shapley values, or structural causal model‑based bias auditing.
Experience evaluating models under privacy constraints or on‑device inference settings is a plus.
Familiarity with confidence calibration techniques and uncertainty quantification a plus
Background in financial services, fintech, or consumer payment products
At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $171,600 and $302,200, and your base pay will depend on your skills, qualifications, experience, and location.
Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation.
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.
At Apple, we believe accessibility is a fundamental human right. You’ll find that idea reflected in everything here — in our culture, our benefits and our digital tools. By welcoming as many perspectives as possible, we help you build a career where you feel like you belong.
Learn about accessibility in Apple’s workplace
Learn about reasonable accommodations for job applicants
Apple accepts applications to this posting on an ongoing basis.
#J-18808-Ljbffr
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.