Offres d'emploi
Trouvez des postes près de chez vous, sur site, hybrides ou à distance.- Emplois similaires à : DATA SCIENTIST/AI ENGINEER
Senior AI Data Engineer/ Data Scientist
BillenniumPolandBillennium is a global technology company with over 20 years of experience, committed to innovation and empowering businesses. As an employer, we offer a supportive, growth-focused environment where c
Data Scientist / AI/ML Engineer (Imagery) VAWFH
Global InfoTekUnited StatesMachine Learning EngineerClearance Level: TS/SCI US Citizenship: Required Job Classification: Regular, Full-Time Location: Reston, VA Years of Experience: 5-7 Years Education Level: Bachelor degree or
Strategic AI Data Scientist & ML Engineer
PowerToFlyPhoenixPowerToFly is hiring a Data Scientist to work in Arizona. In this role, you will define data strategy and drive AI development, working closely with clients to deliver effective solutions. You are exp
Data Scientist & ML Engineer - Hybrid, Impactful Projects
ManpowerGroup Global, Inc.WaterfordManpowerGroup Global, Inc. is seeking a Data Scientist / Machine Learning Engineer -Software Engineer 4 to join their dynamic team in Charlotte, NC. In this role, you will support innovative projects
Data Scientist
Boeing Employees Credit UnionNew YorkIs it surprising to hear that a financial institution of 1.5 million members and over $30 billion in managed assets say that success comes from focusing on people, not profits? Our "people helping peo
Remote Data Scientist
Micro1StocktonData Scientist Required Skills statistics & Mathematicsdata handlingdata collectingdata analysis and modelingdata visualizationdata clesningprogrammingAbout micro1 micro1 is the leading AI data lab fo
Remote Data Scientist / ML Engineer - Impactful, Fast-Paced
Common RoomNew YorkCommon Room is looking for a Data Scientist/Engineer to join their team remotely. The role focuses on working at the intersection of data science and software while helping customers derive insights f
Lead Data Scientist
Altak Group Inc.AnnapolisJob Title: Lead Data Scientist – Healthcare (AI Architecture Focus) Location: EAST coast (REMOTE)Required Qualifications • 7+ years of experience in data science, with recent experience in a Lead Data
Marketing Data Scientist - Remote Freelance
Brain Trust IncNew YorkBraintrust is seeking a Data Scientist Consultant to work remotely in the United States. This role focuses on marketing data science, where you'll engage with clients to demonstrate the value of solut
Data Scientist -Project Delivery Senior Analyst - AI & Engineering
PowerToFlyEl PasoData Scientist -Project Delivery Senior Analyst - AI & EngineeringAre you an experienced, passionate pioneer in technology who wants to work in a collaborative environment? As an experienced Data Scie
LLM Data Scientist AI Trainer & Evaluator
Mercor IncNorfolkMercor is seeking a talented Data Scientist to join a leading AI lab's GenAI team. This role involves guiding teams on data science methodology and designing complex tasks to improve AI training data
Data Scientist / Project Manager
PeratonNew YorkResponsibilities Peraton is seeking an experienced Data Scientist / Project Manager to lead a high-performing team of engineers, data scientists, and analysts in direct support of U.S. Army Cyber Comm
LLM Data Scientist AI Trainer & Evaluator
Mercor IncGlendaleMercor in Glendale, Arizona, seeks a Data Scientist for their AI lab. The role involves guiding teams in data methodology and enhancing AI training data quality. Candidates should have over 3 years of
LLM Data Scientist AI Trainer & Evaluator
Mercor IncRedondo BeachMercor is seeking a skilled Data Scientist to join a cutting-edge AI lab's GenAI team in Redondo Beach, California. In this W-2 position, you will engage in data science tasks that enhance the develop
LLM Data Scientist AI Trainer & Evaluator
Mercor IncDaytonMercor is seeking talented Data Scientists to join their cutting-edge GenAI team, contributing to the development of advanced Large Language Models. This full-time W-2 employment position, through Cin
GenAI Data Scientist: AI Training & Evaluation
Mercor IncOakleyMercor is seeking a skilled Data Scientist to join a leading AI lab's GenAI team, contributing to advanced Large Language Models. You will guide teams on data science methodology, evaluate data tasks,
LLM Data Scientist AI Trainer & Evaluator
Mercor IncAbileneMercor is looking for a Data Scientist to join their AI lab's GenAI team. The role involves guiding teams on data science methodologies, designing data tasks, and evaluating solutions to enhance AI tr
Data Scientist, Cancer Genomics & Precision Medicine
Caris Life SciencesNew YorkCaris Life Sciences is looking for a Data Scientist in Computational Biology to analyze large cancer datasets, aiming to enhance precision medicine diagnostics. The role involves leveraging genomics d
Principal Data Scientist - Sensor Fusion & AI Platform
WGS SystemsFrederickWGS Systems, LLC in Frederick, Maryland, is seeking a Principal Data Scientist to develop a novel intelligence platform. This role combines advanced data solutions with AI, driving the integration of
GenAI Data Scientist: AI Training & Evaluation
Mercor IncWest Valley CityMercor is looking for talented Data Scientists to join a leading AI lab's GenAI team. The role involves guiding teams, designing analytical solutions, and improving AI training data quality. Ideal can
LLM Data Scientist AI Trainer & Evaluator
Mercor IncDunwoodyMercor is seeking a Data Scientist to join their advanced GenAI team in Dunwoody, Georgia. The role focuses on guiding research teams, designing data tasks, and ensuring high-quality AI training data.
LLM Data Scientist AI Trainer & Evaluator
Mercor IncLake ForestMercor is seeking a talented Data Scientist to join a leading AI lab's team focused on developing advanced Large Language Models. The position requires strong analytical skills and a commitment of 40
Senior Staff Data Scientist, ML Remote (US)
Machinify, Inc.New YorkMachinify is searching for a Staff Data Scientist to join their Pay team, focusing on enhancing machine learning models for health plans. This role involves end-to-end data science tasks, including mo
Data Scientist, Asc Manager - Fleet Analytics
Lockheed Martin CorporationHartfordJob Description Lockheed Martin – Data Analytics Innovations (DAI) is advancing the future of defense through cutting‑edge data science and artificial intelligence. Our mission is to develop and deplo
Remote Data Scientist: Member Insights & Growth
Boeing Employees Credit UnionNew YorkBoeing Employees Credit Union is looking for a Data Scientist to drive member engagement through advanced analytics. You will develop predictive models, collaborate with cross-functional teams, and tr
Senior AI Data Engineer/ Data Scientist
- Poland, Ohio, United States
- Poland, Ohio, United States
À propos
About the Role: We are looking for a Senior AI Data Engineer / Data Scientist who can turn messy enterprise data into AI-ready, high-quality knowledge assets.
You will lead the cleanup, preparation, and enrichment of unstructured content (SharePoint/document repositories) and structured/semi-structured data (data lakes, databases) so our agents, copilots, and RAG systems are accurate, trustworthy, and scalable.
This is a senior, hands-on role. You will own data quality outcomes end-to-end: discovery - cleanup - enrichment - ingestion - refresh cycles - governance. We value AI-native generalists who can remove bottlenecks by working directly with AI Engineers, Architects, and business stakeholders to decide what data is worth using and how to structure it for retrieval and reasoning.
Our standardized stack includes (and this role actively uses it): ingestion/ETL foundations, Postgres + pgvector as default RAG store, Redis caching, LLM gateway patterns, Langfuse observability, DeepEval/RAGAS evaluation, and Presidio for PII detection/masking when required.
Must-have requirements:
5+ years in data engineering / applied data science / analytics engineering with ownership of production pipelines.
Proven experience working with unstructured enterprise data (documents, PDFs, Office files, wikis, knowledge bases).
Solid understanding of data quality engineering: validation, monitoring, lineage, refresh cycles.
Strong stakeholder skill : can work with business to define what data matters and what “good” looks like.
Nice to have:
Experience with Postgres + pgvector (or similar vector stores), retrieval optimization, and hybrid search concepts.
Familiarity with observability practices for AI pipelines and the use of RAG evaluation metrics (RAGAS-style).
Experience with governance tooling and privacy controls for enterprise AI (e.g., PII workflows).
What you will do:
Lead “data triage” for AI use cases: identify authoritative sources, duplicates, outdated content, and low-quality documents.
Clean, normalize, deduplicate, and standardize enterprise content at scale (documents, PDFs, Word/Excel, wiki pages, etc.).
Define what data should be excluded from AI systems (stale, contradictory, low-trust, or sensitive content).
Unstructured ingestion (SharePoint + document repositories)
Build robust ingestion pipelines for SharePoint and file repositories: parsing, text extraction, structure recovery, and metadata capture.
Implement document normalization strategies (naming, taxonomy, metadata standards, canonical IDs).
Design chunking strategies, metadata enrichment, and document structuring optimized for retrieval performance and cost.
Improve retrieval quality through practical techniques such as filtered retrieval and post-retrieval optimization where appropriate (e.g., reranking), collaborating with AI Engineers on the retrieval interface.
Prepare and maintain “AI-ready knowledge sets” that can be embedded and served via Postgres + pgvector (default).
Data quality, evaluation, and feedback loops (non-negotiable)
Define and implement data quality gates (freshness, completeness, relevance, dedupe rate, metadata coverage).
Partner with AI Engineers to evaluate retrieval and RAG performance using frameworks like RAGAS (answer correctness, context recall/precision) and to monitor trust metrics over time.
Establish human feedback loops where needed (review queues, sampling, targeted audits) to continuously improve data usefulness and user trust. Governance, privacy, and auditability
Apply privacy and enterprise constraints; where required, implement PII detection/masking using Presidio patterns.
Reuse Package reusable “data cleanup + RAG readiness” recipes: ingestion templates, metadata schemas, chunking playbooks, dedupe strategies.
Build a repeatable data foundation that accelerates future use cases (not a one-off cleanup project).
Our offer:
Comprehensive benefits - enjoy Udemy for Business, private medical care, Multisport card, veterinary package, language lessons, and shopping vouchers.
Flexibility - adaptable working hours and remote/hybrid work options to suit your lifestyle & location.
Career growth - access opportunities for professional development and learning, including perks related to our official partnerships with global IT giants: Microsoft, AWS, Snowflake, Salesforce & more.
Global collaboration - work with a diverse, international team.
Innovative environment part of a forward-thinking and growth-oriented workplace.
Engaging community - Work with passionate professionals and participate in team-building events, hackathons, and CSR initiatives to make an impact beyond work.
Team-building events including our company tradition (annual company event in Mazury).
A pleasant surprise to start your journey with us in the form of a welcome pack.
Recruitment process:
HR call
Technical Interview
Final Interview
Decision/ Feedback
Sounds interesting? Click "Apply" and have a chance to hear more!
#J-18808-Ljbffr
Compétences linguistiques
- English
Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.