Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Behavior Detection Officer (BDO) - No Experience Required
Senior AI Data Engineer/ Data Scientist
BillenniumPolandBillennium is a global technology company with over 20 years of experience, committed to innovation and empowering businesses. As an employer, we offer a supportive, growth-focused environment where c
Remote Backend Engineer (Node.js/TypeScript) for High-Load APIs
Action1 CorporationPolandAction1 Corporation, located in the Town of Poland, NY, is seeking a skilled Backend Developer to join their innovative development team. This fully remote position offers the flexibility to work from
Senior Data Engineer (Java, Databricks)
EPAM Systems IncPolandWe are looking for a Senior Data Engineer to join our agile team and help design, develop, and evolve the strategic integration backbone between Dealstores, Operations, and Regulatory systems within o
Senior Embedded Systems Developer with Zephyr RTOS
Hitachi Data SystemsPolandSenior Embedded Systems Developer (Zephyr RTOS) Global Logic is looking to add a Senior Embedded Systems Developer (Zephyr RTOS) to its team.In this role, you will be part of a distributed team which
Senior Pre-Sales Solutions Architect (Remote, German)
United States Digital Space LLCPolandUnited States Digital Space LLC is seeking a Pre-Sales Solution Architect based remotely to guide customers in designing scalable systems with our data platform. This role requires a strong technical
Perennial Systems- Machine Learning Engineer
NexthirePolandExperience: 3 to 4 yearsRole: Machine Learning EngineerRemote, PunePerennial Systems is looking for a skilled Machine Learning Engineer with hands‑on experience deploying models on Google Cloud Platfo
Embedded Software Engineer - Automotive RTOS
TalentwelovePolandTalentweloveis the fastest-growing HR startup in Romania, and the first fully digitalized Talent Acquisition Partner, which is also available at a global level. Our solutions cover end-to-end talent a
Lead QA Engineer: Drive Quality & Test Automation
Riskmethods Sp. Z o.o.PolandRiskmethods Sp. Z o.o. is seeking a QA Team Lead to guide and coach the QA & Test team. The successful candidate should have a minimum of 5 years of quality assurance experience and a strong grasp of
Cyber Security Risk Analyst
EuroclearPolandLocations Poland Belgium United Kingdom FranceJob Description Cyber Security AnalystEuroclear is a global critical financial infrastructure company. Security is at the core of the company’s services,
QA Automation Lead Strategy, Architecture & Mentorship
LM Wind Power / GEPolandLM Wind Power / GE is seeking a QA Automation Team Lead to provide technical leadership and drive quality across our automation strategy. This hybrid role involves reporting to our Kraków office 2-3 d
Lead QA Engineer (Python)
Riskmethods Sp. Z o.o.PolandSphera is a leading global provider of enterprise software and services that enables companies to manage and optimize their environmental, health, safety and sustainability. Our mission is to create a
Senior Business Analyst - Regulatory Data & Reporting
EPAM Systems IncPolandEPAM Systems is seeking an experienced Business Analyst to join a client project within a leading global investment bank. In this office-based role, you will focus on regulatory operations, analyzing
Warsaw B2B SaaS Sales Exec - VDR & M&A Focus
IDEALS IncPolandIdeals is seeking a founding Sales Executive in the Town of Poland, responsible for acquiring new customers and managing a sales pipeline. Ideal candidates need 1-2 years of B2B sales experience, nati
Senior Data Engineer (Data Science)
Luxoft PolandPolandPrivate Medical & Dental care & Life InsuranceInternal Mobility program - possibility of rotation between projects, locations, accountsProject Description Join the Data Engineering team to contribute
Frontend Engineer (React)
RE Partners ConsultingPolandWho We Are We are a fast growing business and technology consultant company co-founded in 2019. We offer a custom‑tailored, white‑glove engineering service fit for our clients, because a digital trans
Hybrid Cloud AI Security Architect (Warsaw)
NeontriPolandFirma Neontri w Nowym Jorku poszukuje Cloud Solution Architect do projektowania i rozwoju bezpiecznych rozwiązań AI, takich jak systemy architektury, mechanizmy Guardrails i cele AI. Wymagane są umiej
Fullstack Engineer: Data Pipelines & ETL, Remote
First Connect Insurance ServicesPolandFirst Connect Insurance in New York is looking for a skilled Engineer to build and evolve data processing systems that transform insurance data into valuable metrics. You'll design ETL pipelines and c
Senior SEO Content Strategist
ValtechPolandAt Valtech, you’ll find an environment designed for continuous learning, meaningful impact, and professional growth. Whether you're pioneering new digital solutions, challenging conventional thinking
Web Product Designer Trinetix Responds Quickly $
MADFISHPolandWe are looking for a Senior Web Product Designer to join our Design Team and contribute to the development of large-scale platforms used by global clients.This role is for a Senior Product Designer wh
Senior Data Engineer: ML-Driven Pipelines & OpenSearch
Luxoft PolandPolandLuxoft Poland is seeking a Data Engineer to join the Data Engineering team, focusing on the improvement of an internal assistant using hosted APIs. The role requires maintaining and enhancing content
Senior Backend/Full-Stack Engineer - Data & Reporting
SpotOnPolandSpotOn, located in the Town of Poland, New York, is seeking a Senior Software Engineer specializing in backend or full-stack development. You will design, build, and own the systems and experiences th
Graduate Data Scientist & Analyst Hybrid & High-Growth
RevolutPolandRevolut is seeking ambitious graduates for their Graduate Programme as Data Scientists and Analysts. This hybrid role involves working on real projects, enhancing data infrastructures, and collaborati
Senior Analytics Engineer (GCP) - Data Platform Innovator
XebiaPolandXebia is looking for an experienced Analytics Engineer / Data Engineer to lead the development and optimization of data models primarily in dbt/Dataform on Google BigQuery. The role will involve colla
T Hub - Senior Java Backend Developer
MagentateamPolandDesign, develop, and maintain scalable and high-performance backend applications with Kotlin (mainly), Java, Spring, Docker, Kubernetes or other relevant technologiesCollaborate with cross-functional
Unity Developer (Mobile Games)
VGW Malta LimitedPolandVGW is an interactive entertainment company, harnessing technology and creativity to deliver world-class, free-to-play online social games. We have an exciting opportunity to join our Growth Products
Senior AI Data Engineer/ Data Scientist
- Poland, Ohio, United States
- Poland, Ohio, United States
Über
About the Role: We are looking for a Senior AI Data Engineer / Data Scientist who can turn messy enterprise data into AI-ready, high-quality knowledge assets.
You will lead the cleanup, preparation, and enrichment of unstructured content (SharePoint/document repositories) and structured/semi-structured data (data lakes, databases) so our agents, copilots, and RAG systems are accurate, trustworthy, and scalable.
This is a senior, hands-on role. You will own data quality outcomes end-to-end: discovery - cleanup - enrichment - ingestion - refresh cycles - governance. We value AI-native generalists who can remove bottlenecks by working directly with AI Engineers, Architects, and business stakeholders to decide what data is worth using and how to structure it for retrieval and reasoning.
Our standardized stack includes (and this role actively uses it): ingestion/ETL foundations, Postgres + pgvector as default RAG store, Redis caching, LLM gateway patterns, Langfuse observability, DeepEval/RAGAS evaluation, and Presidio for PII detection/masking when required.
Must-have requirements:
5+ years in data engineering / applied data science / analytics engineering with ownership of production pipelines.
Proven experience working with unstructured enterprise data (documents, PDFs, Office files, wikis, knowledge bases).
Solid understanding of data quality engineering: validation, monitoring, lineage, refresh cycles.
Strong stakeholder skill : can work with business to define what data matters and what “good” looks like.
Nice to have:
Experience with Postgres + pgvector (or similar vector stores), retrieval optimization, and hybrid search concepts.
Familiarity with observability practices for AI pipelines and the use of RAG evaluation metrics (RAGAS-style).
Experience with governance tooling and privacy controls for enterprise AI (e.g., PII workflows).
What you will do:
Lead “data triage” for AI use cases: identify authoritative sources, duplicates, outdated content, and low-quality documents.
Clean, normalize, deduplicate, and standardize enterprise content at scale (documents, PDFs, Word/Excel, wiki pages, etc.).
Define what data should be excluded from AI systems (stale, contradictory, low-trust, or sensitive content).
Unstructured ingestion (SharePoint + document repositories)
Build robust ingestion pipelines for SharePoint and file repositories: parsing, text extraction, structure recovery, and metadata capture.
Implement document normalization strategies (naming, taxonomy, metadata standards, canonical IDs).
Design chunking strategies, metadata enrichment, and document structuring optimized for retrieval performance and cost.
Improve retrieval quality through practical techniques such as filtered retrieval and post-retrieval optimization where appropriate (e.g., reranking), collaborating with AI Engineers on the retrieval interface.
Prepare and maintain “AI-ready knowledge sets” that can be embedded and served via Postgres + pgvector (default).
Data quality, evaluation, and feedback loops (non-negotiable)
Define and implement data quality gates (freshness, completeness, relevance, dedupe rate, metadata coverage).
Partner with AI Engineers to evaluate retrieval and RAG performance using frameworks like RAGAS (answer correctness, context recall/precision) and to monitor trust metrics over time.
Establish human feedback loops where needed (review queues, sampling, targeted audits) to continuously improve data usefulness and user trust. Governance, privacy, and auditability
Apply privacy and enterprise constraints; where required, implement PII detection/masking using Presidio patterns.
Reuse Package reusable “data cleanup + RAG readiness” recipes: ingestion templates, metadata schemas, chunking playbooks, dedupe strategies.
Build a repeatable data foundation that accelerates future use cases (not a one-off cleanup project).
Our offer:
Comprehensive benefits - enjoy Udemy for Business, private medical care, Multisport card, veterinary package, language lessons, and shopping vouchers.
Flexibility - adaptable working hours and remote/hybrid work options to suit your lifestyle & location.
Career growth - access opportunities for professional development and learning, including perks related to our official partnerships with global IT giants: Microsoft, AWS, Snowflake, Salesforce & more.
Global collaboration - work with a diverse, international team.
Innovative environment part of a forward-thinking and growth-oriented workplace.
Engaging community - Work with passionate professionals and participate in team-building events, hackathons, and CSR initiatives to make an impact beyond work.
Team-building events including our company tradition (annual company event in Mazury).
A pleasant surprise to start your journey with us in the form of a welcome pack.
Recruitment process:
HR call
Technical Interview
Final Interview
Decision/ Feedback
Sounds interesting? Click "Apply" and have a chance to hear more!
#J-18808-Ljbffr
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.