À propos
Design and operate large-scale data pipelines supporting AI training and evaluation workflows Build ingestion systems for diverse data types, including text, images, audio, and video Implement data quality assurance processes at petabyte scale, including cleaning and deduplication
Required Qualifications
Bachelor's or Master's degree in Computer Science or a related field Six or more years of data engineering experience, particularly with ML or AI workloads Strong proficiency in Python and at least one JVM or systems language Deep experience with modern data processing frameworks such as Spark, Ray, or Beam Hands-on experience operating petabyte-scale storage and pipeline systems
Compétences linguistiques
- English
Avis aux utilisateurs
Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.