This job offer is no longer available
GenAI Data Engineer
DCV Technologies
- London, England, United Kingdom
- London, England, United Kingdom
About
Your responsibilities:
* Design and maintain scalable data pipelines using PySpark, Python, and distributed computing frameworks to support high‑volume data processing.
* Architect and optimize AWS-based data and AI infrastructure, ensuring secure, performant, and cost‑efficient ingestion, transformation, and storage.
* Develop, finetune, benchmark, and evaluate GenAI/LLM models, including custom training and inference optimization.
* Implement and maintain RAG pipelines, vector databases, and document-processing workflows for enterprise GenAI applications.
* Build reusable frameworks for prompt management, evaluation, and GenAI operations.
* Collaborate with cross-functional teams to integrate GenAI capabilities into production systems and ensure high-quality data, governance, and operational reliability
Your Profile
Essential skills/knowledge/experience:
* Strong experience with PySpark, distributed data processing, and largescale ETL/ELT pipelines.
* Strong SQL expertise including star/snowflake schema design, indexing strategies, writing optimized queries, and implementing CDC, SCD Type 1/2/3 patterns for reliable data warehousing.
* Advanced proficiency in Python for data engineering, automation, and ML/GenAI integration.
* Hands on expertise with AWS...
Languages
- English
Notice for Users
This job was posted by one of our partners. You can view the original job source here.