Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: AI Data Scientist
Senior AI Data Engineer/ Data Scientist
BillenniumPolandBillennium is a global technology company with over 20 years of experience, committed to innovation and empowering businesses. As an employer, we offer a supportive, growth-focused environment where c
AI Research Data Scientist
Kireap Technologies India Private LimitedNew BremenJob Position: AI Research Data Scientist Experience: 3–7 years (open to discuss for exceptional candidates) Job Location: Onsite / Remote (India or Germany) Employment Type: Full-time Position Summary
AI ML Data Scientist
Tata Consultancy ServicesUnited StatesAI ML Data ScientistMust Have Technical/Functional Skills-Machine Learning techniques • Unsupervised - K-means Clustering, PCA - Dimension Reduction, Kernel Density Estimations. • Supervised - Regress
AI Data Scientist Sr.
SedgwickUnited StatesBy joining Sedgwick, you'll be part of something truly meaningful. It’s what our 33,000 colleagues do every day for people around the world who are facing the unexpected. We invite you to grow your ca
Data Scientist
CBREUnited StatesAbout the RoleAs a CBRE Data Scientist, you will optimize effectiveness and predict outcomes through business, operations, customer, and economic data in order to develop business intelligence. This j
Data Scientist
Starcom Mediavest Group Germany GmbhUnited StatesCompany DescriptionLotame is a technology company that makes data smarter, faster, and easier to use for digital marketers. Our end-to-end data collaboration platform Spherical empowers thousands of m
Data Scientist
Ai SquareNew YorkAI Research Scientist Location: San Francisco, USALead groundbreaking research in AI, focusing on federated learning, explainable AI, and applied industrial solutions.Develop and publish research in A
Data Scientist
JerryUnited StatesJerry.ai Data Science & Analytics TeamJerry.ai is America's first and only super app to radically simplify car ownership. We are redefining how people manage owning a car, one of their most expensive
Data Scientist
NextpowerUnited StatesEntry-Level Data ScientistThe Robotics & Services team is seeking an entry-level Data Scientist to support the development, automation, and operationalization of customer-facing reporting, workflows,
Data Scientist
MetriportUnited StatesData Scientist Opportunity At MetriportMetriport is an open-source data intelligence platform that helps healthcare organizations access and exchange patient data in real-time. We integrate with all m
AI/ML Engineer / Data Scientist
American Operations Corp.WausauSupports AI/ML-enabled analytics, predictive maintenance analysis, anomaly detection, operational trend analysis, and data-driven modernization initiatives across the BMx FoS. This role evaluates and
AI Design Data Scientist - Innovator
Santander BankBostonSantander Bank is seeking an Associate in Data Science – AI Design to develop intuitive AI experiences. This role focuses on integrating AI and Large Language Models into business applications, workin
AI/ML Environmental Data Scientist
CASE Consultants InternationalAshevilleMay 14Written By Liz TarquinLocation: La Jolla, CA (hybrid eligible depending on role)Clearance: Public Trust Eligible. Must be US citizen or authorized to work in the US.Travel: Not anticipatedPositi
GEN AI Data Scientist Engineer
Tata Consultancy ServicesHartfordResponsibilitiesStrong understanding of core Data Science and Machine Learning concepts.Proven experience building, training, validating, and deploying ML models.Model deployment experience into produ
Data Scientist
VisaAustinAbout Us Visa is a world leader in payments technology, facilitating transactions between consumers, merchants, financial institutions and government entities across more than 200 countries and territ
Data Scientist
Software Technology IncUnited StatesJob TitleTeam culture/work environment: Small team, two partners and PSPs, collaborative. Key projects: Looking at data, looking at the trends in the store. Daily responsibilities: Extracts data from
Data Scientist
ArtechUnited StatesJob ID : 93683-1 Job Title : Data Scientist Location : Dallas TX or Cary NC Duration : 6 months + possible extension Rate Range: $45 - $55/hour on W2/ C2C (All inclusive)IntroductionWe are seeking a h
Data Scientist
Oak St. HealthUnited StatesData ScientistWe're building a world of health around every individual — shaping a more connected, convenient and compassionate health experience. At CVS Health®, you'll be surrounded by passionate co
Data Scientist
Oak St. HealthUnited StatesData ScientistWe're building a world of health around every individual — shaping a more connected, convenient and compassionate health experience. At CVS Health®, you'll be surrounded by passionate co
Data Scientist
Boeing Employees Credit UnionNew YorkIs it surprising to hear that a financial institution of 1.5 million members and over $30 billion in managed assets say that success comes from focusing on people, not profits? Our "people helping peo
Data Scientist
TriMas CorporationUnited StatesData Scientist Company: Allfast Fastening Systems LLC Primary Location: 15200 Don Julian Road, City of Industry, CA 91745 USA Workplace Type: On-site Employment Type: Salaried | Full-Time Function: In
Data Scientist, AI
Munich ReUnited StatesDrive the future of AI-powered decision-making by leading sophisticated machine learning and GenAI solutions that shape strategy and outcomes.American Modern Insurance Group, Inc., a Munich Re company
AI Data Scientist-Furman lab
Buck InstituteNovatoPosition Summary The Buck Institute for Research on Aging is seeking an exceptional, highly motivated AI Data Scientist / Agentic AI Engineer to join a collaborative research team focused on aging, co
Data Scientist / Researcher
WorkstreamNew YorkHOF Capital is a global multi‑stage venture capital (VC) firm that partners with the world's best founders and helps them build generational category‑defining technology companies. Since launching its
Remote Data Scientist
Micro1JenksData Scientist Required Skills statistics & Mathematicsdata handlingdata collectingdata analysis and modelingdata visualizationdata clesningprogrammingAbout micro1 micro1 is the leading AI data lab fo
Senior AI Data Engineer/ Data Scientist
- Poland, Ohio, United States
- Poland, Ohio, United States
Über
About the Role: We are looking for a Senior AI Data Engineer / Data Scientist who can turn messy enterprise data into AI-ready, high-quality knowledge assets.
You will lead the cleanup, preparation, and enrichment of unstructured content (SharePoint/document repositories) and structured/semi-structured data (data lakes, databases) so our agents, copilots, and RAG systems are accurate, trustworthy, and scalable.
This is a senior, hands-on role. You will own data quality outcomes end-to-end: discovery - cleanup - enrichment - ingestion - refresh cycles - governance. We value AI-native generalists who can remove bottlenecks by working directly with AI Engineers, Architects, and business stakeholders to decide what data is worth using and how to structure it for retrieval and reasoning.
Our standardized stack includes (and this role actively uses it): ingestion/ETL foundations, Postgres + pgvector as default RAG store, Redis caching, LLM gateway patterns, Langfuse observability, DeepEval/RAGAS evaluation, and Presidio for PII detection/masking when required.
Must-have requirements:
5+ years in data engineering / applied data science / analytics engineering with ownership of production pipelines.
Proven experience working with unstructured enterprise data (documents, PDFs, Office files, wikis, knowledge bases).
Solid understanding of data quality engineering: validation, monitoring, lineage, refresh cycles.
Strong stakeholder skill : can work with business to define what data matters and what “good” looks like.
Nice to have:
Experience with Postgres + pgvector (or similar vector stores), retrieval optimization, and hybrid search concepts.
Familiarity with observability practices for AI pipelines and the use of RAG evaluation metrics (RAGAS-style).
Experience with governance tooling and privacy controls for enterprise AI (e.g., PII workflows).
What you will do:
Lead “data triage” for AI use cases: identify authoritative sources, duplicates, outdated content, and low-quality documents.
Clean, normalize, deduplicate, and standardize enterprise content at scale (documents, PDFs, Word/Excel, wiki pages, etc.).
Define what data should be excluded from AI systems (stale, contradictory, low-trust, or sensitive content).
Unstructured ingestion (SharePoint + document repositories)
Build robust ingestion pipelines for SharePoint and file repositories: parsing, text extraction, structure recovery, and metadata capture.
Implement document normalization strategies (naming, taxonomy, metadata standards, canonical IDs).
Design chunking strategies, metadata enrichment, and document structuring optimized for retrieval performance and cost.
Improve retrieval quality through practical techniques such as filtered retrieval and post-retrieval optimization where appropriate (e.g., reranking), collaborating with AI Engineers on the retrieval interface.
Prepare and maintain “AI-ready knowledge sets” that can be embedded and served via Postgres + pgvector (default).
Data quality, evaluation, and feedback loops (non-negotiable)
Define and implement data quality gates (freshness, completeness, relevance, dedupe rate, metadata coverage).
Partner with AI Engineers to evaluate retrieval and RAG performance using frameworks like RAGAS (answer correctness, context recall/precision) and to monitor trust metrics over time.
Establish human feedback loops where needed (review queues, sampling, targeted audits) to continuously improve data usefulness and user trust. Governance, privacy, and auditability
Apply privacy and enterprise constraints; where required, implement PII detection/masking using Presidio patterns.
Reuse Package reusable “data cleanup + RAG readiness” recipes: ingestion templates, metadata schemas, chunking playbooks, dedupe strategies.
Build a repeatable data foundation that accelerates future use cases (not a one-off cleanup project).
Our offer:
Comprehensive benefits - enjoy Udemy for Business, private medical care, Multisport card, veterinary package, language lessons, and shopping vouchers.
Flexibility - adaptable working hours and remote/hybrid work options to suit your lifestyle & location.
Career growth - access opportunities for professional development and learning, including perks related to our official partnerships with global IT giants: Microsoft, AWS, Snowflake, Salesforce & more.
Global collaboration - work with a diverse, international team.
Innovative environment part of a forward-thinking and growth-oriented workplace.
Engaging community - Work with passionate professionals and participate in team-building events, hackathons, and CSR initiatives to make an impact beyond work.
Team-building events including our company tradition (annual company event in Mazury).
A pleasant surprise to start your journey with us in the form of a welcome pack.
Recruitment process:
HR call
Technical Interview
Final Interview
Decision/ Feedback
Sounds interesting? Click "Apply" and have a chance to hear more!
#J-18808-Ljbffr
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.