About
Design and operate large-scale data pipelines supporting AI training and evaluation workflows Build ingestion systems for diverse data types, including text, images, audio, and video Implement data quality assurance processes at petabyte scale, including cleaning and deduplication
Required Qualifications
Bachelor's or Master's degree in Computer Science or a related field Six or more years of data engineering experience, particularly with ML or AI workloads Strong proficiency in Python and at least one JVM or systems language Deep experience with modern data processing frameworks such as Spark, Ray, or Beam Hands-on experience operating petabyte-scale storage and pipeline systems
Languages
- English
Notice for Users
This job comes from a TieTalent partner platform. Click "Apply Now" to submit your application directly on their site.