About
Design and operate large-scale data pipelines supporting AI training, evaluation, and continual improvement workflows Build ingestion systems for diverse modalities including text, image, audio, video, and structured signals Implement data cleaning, deduplication, filtering, and quality assurance at petabyte scale
Required Qualifications
Bachelor's or Master's degree in Computer Science or a related field Six or more years of data engineering experience, with significant work supporting ML or AI workloads Strong proficiency in Python and at least one JVM or systems language Deep experience with modern data processing frameworks such as Spark, Ray, or Beam Hands-on experience operating petabyte-scale storage and pipeline systems
Languages
- English
Notice for Users
This job comes from a TieTalent partner platform. Click "Apply Now" to submit your application directly on their site.