Data Engineer with PySpark & DPL

Purple Drive

United States

United States

Trouver des emplois similaires

À propos

Required Skills & Qualifications
Strong hands-on expertise with:
PySpark
(RDD, Data Frames, Spark SQL, performance tuning) DPL
(Data Pipeline Language / relevant tool-specific DPL)
Proficiency in
Python
for data engineering workflows. Experience with distributed computing and big data technologies (Spark, Hadoop, Delta Lake). Strong SQL skills and experience with relational and NoSQL databases. Experience building ETL/ELT pipelines on cloud platforms (AWS / Azure / GCP). Familiarity with CI/CD, Git, and containerization (Docker/Kubernetes) is a plus. Bachelor's or Master's in Computer Science, Engineering, or related field. Preferred Skills
Experience with orchestration tools (Airflow, ADF, Argo, Prefect). Knowledge of data warehousing concepts (Star schema, SCD, normalization). Experience with streaming platforms (Kafka, Kinesis, Spark Streaming). Exposure to data governance, security, and compliance frameworks. Experience working in Agile environments.

United States

Compétences linguistiques

English

Avis aux utilisateurs

Cette offre a été publiée par l’un de nos partenaires. Vous pouvez consulter l’offre originale ici.

Trouver des emplois similaires