Über
Design and operate large-scale data pipelines supporting AI training and evaluation workflows Build ingestion systems for diverse data types, including text, images, audio, and video Implement data quality assurance processes at petabyte scale, including cleaning and deduplication
Required Qualifications
Bachelor's or Master's degree in Computer Science or a related field Six or more years of data engineering experience, particularly with ML or AI workloads Strong proficiency in Python and at least one JVM or systems language Deep experience with modern data processing frameworks such as Spark, Ray, or Beam Hands-on experience operating petabyte-scale storage and pipeline systems
Sprachkenntnisse
- English
Hinweis für Nutzer
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.