Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Cost Engineer/Data Scientist
Senior AI Data Engineer/ Data Scientist
BillenniumPolandBillennium is a global technology company with over 20 years of experience, committed to innovation and empowering businesses. As an employer, we offer a supportive, growth-focused environment where c
Data Scientist
CompunnelSunnyvaleWe are seeking a highly skilled Data Scientist to design and implement predictive models using high-dimensional, real-time datasets. The ideal candidate will apply advanced machine learning and data m
Data Scientist
XpheriumSan DiegoCompany Description Xpherium is an AI-powered business intelligence platform that enables companies to turn data into strategic insights. We specialize in advanced analytics, predictive forecasting, i
Data Scientist
Boeing Employees Credit UnionNew YorkIs it surprising to hear that a financial institution of 1.5 million members and over $30 billion in managed assets say that success comes from focusing on people, not profits? Our "people helping peo
Strategic AI Data Scientist & ML Engineer
PowerToFlyPhoenixPowerToFly is hiring a Data Scientist to work in Arizona. In this role, you will define data strategy and drive AI development, working closely with clients to deliver effective solutions. You are exp
Lead Data Scientist
LexisNexisRaleighThis job is with LexisNexis Legal & Professional®, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter di
Data Engineer/Scientist - AI, ML & Submarine Systems
United Cerebral Palsy of GeorgiaBellinghamSerco Inc. is seeking a Data Engineer/Scientist to support U.S. Navy's Team Submarine Program Offices at the Washington Navy Yard. This role involves participation in technical project management and
Machine Learning - Data Scientist
AppleSunnyvaleDo you have a passion for computer vision and solving deep learning problems? The Video Engineering Data Analytics and Quality group is seeking an expert in evaluating machine learning and deep learni
Remote Data Scientist
Micro1CasselberryData Scientist Job Type: ContractorLocation: RemoteJob Summary Join our client’s team as a Data Scientist and play a pivotal role in transforming data into actionable insights that drive business grow
Data Engineer/Scientist for Navy Submarine Program
SercoHighland BeachPosition Description & Qualifications If you love high profile and challenging projects supporting the US Navy- Serco has a great opportunity for you!Serco has an exciting opportunity for a Data Engin
Lead Data Scientist
Altak Group Inc.AnnapolisJob Title: Lead Data Scientist – Healthcare (AI Architecture Focus) Location: EAST coast (REMOTE)Required Qualifications • 7+ years of experience in data science, with recent experience in a Lead Data
Sr Data Scientist
PayPalScottsdaleJob Title This job will lead the development and implementation of advanced data science models and algorithms. You will work with stakeholders to understand requirements and deliver solutions. Your r
Lead Data Scientist
Altak Group Inc.GermantownJob Title: Lead Data Scientist – Healthcare (AI Architecture Focus) Location: EAST coast (REMOTE)Required Qualifications • 7+ years of experience in data science, with recent experience in a Lead Data
Principal Data Scientist
Citizens BankBostonSenior Data Scientist Citizens Financial Group, Inc. (CFG) seeks a Senior Data Scientist for its Boston, Massachusetts location.Duties Analyze credit risk valuation models, correlations, concentration
Senior Data Scientist/ML Engineer - TS/SCI Poly
CathexisfederalFalls ChurchTeam CATHEXIS elevates the government contracting experience through rapid response, deep skill, and thoughtful problem‑solving and communication. Our core capabilities are our top-tier program and pr
Clinical Data Scientist/ Methodologist
SanofiBridgewaterJob title: Clinical Real World Data Scientist/ MethodologistLocation: US Bridgewater / MorristownAbout the Job Join the team protecting half a billion lives every year with next-gen science, mRNA inno
Data Scientist / Project Manager
PeratonNew YorkResponsibilities Peraton is seeking an experienced Data Scientist / Project Manager to lead a high-performing team of engineers, data scientists, and analysts in direct support of U.S. Army Cyber Comm
Senior Data Scientist II
LexisNexis(LNLP)RaleighAbout the Team:LexisNexis Legal & Professional, which serves customers in more than 150 countries with 11,800 employees worldwide, is part of RELX (www.relx.com), a global provider of information-base
Lead Insurance Data Scientist
Independence American Insurance CompanyNew YorkEstablished in 2021, Independence Pet Holdings is a corporate holding company that manages a diverse and broad portfolio of modern pet health brands and services, including insurance, pet education, l
Senior Data Scientist II
LexisNexis(LNLP)RaleighAbout our Team:LexisNexis Legal & Professional, serving customers in over 150 countries with 11,800 employees worldwide, is part of RELX, a global provider of information-based analytics and decision
Remote Data Scientist / ML Engineer - Impactful, Fast-Paced
Common RoomNew YorkCommon Room is looking for a Data Scientist/Engineer to join their team remotely. The role focuses on working at the intersection of data science and software while helping customers derive insights f
Advisor, Data Scientist - CMC Data Products
Dormont Manufacturing CompanyIndianapolisAt Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work
Manager, Data Scientist - Partnerships Acquisitions
Capital OneNew YorkManager, Data Scientist - Partnerships Acquisitions Data is at the center of everything we do. As a startup, we disrupted the credit card industry by individually personalizing every credit card offer
Manager, Data Scientist - Partnerships Acquisitions
Capital OneChicagoManager, Data Scientist - Partnerships Acquisitions Data is at the center of everything we do. As a startup, we disrupted the credit card industry by individually personalizing every credit card offer
Instrument Design Engineer / Scientist
The American Physical SocietySunnyvaleThe Company You probably know Stanford Research Systems (SRS) from our 40+ year history designing and manufacturing test equipment for research. Products include lock-in amplifiers, atomic clocks, mas
Senior AI Data Engineer/ Data Scientist
- Poland, Ohio, United States
- Poland, Ohio, United States
Über
About the Role: We are looking for a Senior AI Data Engineer / Data Scientist who can turn messy enterprise data into AI-ready, high-quality knowledge assets.
You will lead the cleanup, preparation, and enrichment of unstructured content (SharePoint/document repositories) and structured/semi-structured data (data lakes, databases) so our agents, copilots, and RAG systems are accurate, trustworthy, and scalable.
This is a senior, hands-on role. You will own data quality outcomes end-to-end: discovery - cleanup - enrichment - ingestion - refresh cycles - governance. We value AI-native generalists who can remove bottlenecks by working directly with AI Engineers, Architects, and business stakeholders to decide what data is worth using and how to structure it for retrieval and reasoning.
Our standardized stack includes (and this role actively uses it): ingestion/ETL foundations, Postgres + pgvector as default RAG store, Redis caching, LLM gateway patterns, Langfuse observability, DeepEval/RAGAS evaluation, and Presidio for PII detection/masking when required.
Must-have requirements:
5+ years in data engineering / applied data science / analytics engineering with ownership of production pipelines.
Proven experience working with unstructured enterprise data (documents, PDFs, Office files, wikis, knowledge bases).
Solid understanding of data quality engineering: validation, monitoring, lineage, refresh cycles.
Strong stakeholder skill : can work with business to define what data matters and what “good” looks like.
Nice to have:
Experience with Postgres + pgvector (or similar vector stores), retrieval optimization, and hybrid search concepts.
Familiarity with observability practices for AI pipelines and the use of RAG evaluation metrics (RAGAS-style).
Experience with governance tooling and privacy controls for enterprise AI (e.g., PII workflows).
What you will do:
Lead “data triage” for AI use cases: identify authoritative sources, duplicates, outdated content, and low-quality documents.
Clean, normalize, deduplicate, and standardize enterprise content at scale (documents, PDFs, Word/Excel, wiki pages, etc.).
Define what data should be excluded from AI systems (stale, contradictory, low-trust, or sensitive content).
Unstructured ingestion (SharePoint + document repositories)
Build robust ingestion pipelines for SharePoint and file repositories: parsing, text extraction, structure recovery, and metadata capture.
Implement document normalization strategies (naming, taxonomy, metadata standards, canonical IDs).
Design chunking strategies, metadata enrichment, and document structuring optimized for retrieval performance and cost.
Improve retrieval quality through practical techniques such as filtered retrieval and post-retrieval optimization where appropriate (e.g., reranking), collaborating with AI Engineers on the retrieval interface.
Prepare and maintain “AI-ready knowledge sets” that can be embedded and served via Postgres + pgvector (default).
Data quality, evaluation, and feedback loops (non-negotiable)
Define and implement data quality gates (freshness, completeness, relevance, dedupe rate, metadata coverage).
Partner with AI Engineers to evaluate retrieval and RAG performance using frameworks like RAGAS (answer correctness, context recall/precision) and to monitor trust metrics over time.
Establish human feedback loops where needed (review queues, sampling, targeted audits) to continuously improve data usefulness and user trust. Governance, privacy, and auditability
Apply privacy and enterprise constraints; where required, implement PII detection/masking using Presidio patterns.
Reuse Package reusable “data cleanup + RAG readiness” recipes: ingestion templates, metadata schemas, chunking playbooks, dedupe strategies.
Build a repeatable data foundation that accelerates future use cases (not a one-off cleanup project).
Our offer:
Comprehensive benefits - enjoy Udemy for Business, private medical care, Multisport card, veterinary package, language lessons, and shopping vouchers.
Flexibility - adaptable working hours and remote/hybrid work options to suit your lifestyle & location.
Career growth - access opportunities for professional development and learning, including perks related to our official partnerships with global IT giants: Microsoft, AWS, Snowflake, Salesforce & more.
Global collaboration - work with a diverse, international team.
Innovative environment part of a forward-thinking and growth-oriented workplace.
Engaging community - Work with passionate professionals and participate in team-building events, hackathons, and CSR initiatives to make an impact beyond work.
Team-building events including our company tradition (annual company event in Mazury).
A pleasant surprise to start your journey with us in the form of a welcome pack.
Recruitment process:
HR call
Technical Interview
Final Interview
Decision/ Feedback
Sounds interesting? Click "Apply" and have a chance to hear more!
#J-18808-Ljbffr
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.