À propos
Design and operate large-scale data pipelines supporting AI training, evaluation, and continual improvement workflows Build ingestion systems for diverse modalities including text, image, audio, video, and structured signals Implement data cleaning, deduplication, filtering, and quality assurance at petabyte scale
Required Qualifications
Bachelor's or Master's degree in Computer Science or a related field Six or more years of data engineering experience, with significant work supporting ML or AI workloads Strong proficiency in Python and at least one JVM or systems language Deep experience with modern data processing frameworks such as Spark, Ray, or Beam Hands-on experience operating petabyte-scale storage and pipeline systems
Compétences linguistiques
- English
Avis aux utilisateurs
Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.