AI Sr. Data Engineer
Lenovo
- Edinburgh, Scotland, United Kingdom
- Edinburgh, Scotland, United Kingdom
À propos
Data Creation & Annotation: Design, build, and implement processes for creating task‑specific training datasets. This may include data labeling, annotation, and data augmentation techniques. Data Pipeline Development: Leverage tools and technologies to accelerate dataset creation and improvement. This includes scripting, automation, and potentially working with data labeling platforms. Data Quality & Evaluation: Perform thorough data analysis to assess data quality, identify anomalies, and ensure data integrity. Utilize machine learning tools and techniques to evaluate dataset performance and identify areas for improvement. Big Data Technologies: Utilize database systems (SQL and NoSQL) and big data tools (e.g., Spark, Hadoop, cloud‑based data warehouses like Snowflake/Redshift/BigQuery) to process, transform, and store large datasets. Data Governance & Lineage: Implement and maintain data governance best practices, including data source tracking, data lineage documentation, and license management. Ensure compliance with data privacy regulations. Collaboration with Model Developers: Work closely with machine learning engineers and data scientists to understand their data requirements, provide clean and well‑documented datasets, and iterate on data solutions based on model performance feedback. Documentation: Create and maintain clear and concise documentation for data pipelines, data quality checks, and data governance procedures. Stay Current: Keep up‑to‑date with the latest advancements in data engineering, machine learning, and data governance. Qualifications
Education: Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, Statistics, Mathematics, or a related field. Experience: 15+ years of experience in a data engineering or data science role. Programming Skills: Mastery in Python and SQL. Experience with other languages (e.g., Java, Scala) is a plus. Database Skills: Strong experience with relational databases (e.g., PostgreSQL, MySQL) and NoSQL databases (e.g., MongoDB, Cassandra). Big Data Tools: Experience with big data technologies such as Spark, Hadoop or cloud‑based data warehousing solutions (Snowflake, Redshift, BigQuery). Data Manipulation: Proficiency in data manipulation and cleaning techniques using tools like Pandas, NumPy, and other data processing libraries. ML Fundamentals: Solid understanding of machine learning concepts and techniques, including data preprocessing, feature engineering, and model evaluation. Data Governance: Understanding of data governance principles and practices, including data lineage, data quality, and data security. Communication Skills: Excellent written and verbal communication skills, with the ability to explain complex technical concepts to both technical and non‑technical audiences. Problem Solving: Strong analytical and problem‑solving skills. Bonus Points
Experience with data labeling platforms (e.g., Labelbox, Scale AI, Amazon SageMaker Ground Truth). Experience with MLOps practices and tools (e.g., Kubeflow, MLflow). Experience with cloud platforms (e.g., AWS, Azure, GCP). Experience with data visualization tools (e.g., Tableau, Power BI). Experience with building and maintaining data pipelines using orchestration tools (e.g. Airflow, Prefect) What we offer
Opportunities for career advancement and personal development Access to a diverse range of training programs Performance‑based rewards that celebrate your achievements Flexibility with a hybrid work model (3:2) that blends home and office life Electric car salary sacrifice scheme Life insurance This role is open for the
Edinburgh, Scotland
location only. Candidates must be based there, as the position requires working from the office at least three days per week (3:2 hybrid policy). Seniority level:
Mid‑Senior level Employment type:
Full‑time Job function:
Information Technology and Research Industries:
IT Services and IT Consulting
#J-18808-Ljbffr
Compétences linguistiques
- English
Avis aux utilisateurs
Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.