This job offer is no longer available
About
Design and operate large-scale data pipelines supporting AI training and evaluation workflows Build ingestion systems for diverse data modalities and implement data cleaning and quality assurance at petabyte scale Collaborate with ML researchers to align data systems with model development needs and drive observability of data quality
Required Qualifications
Bachelor's or Master's degree in Computer Science or a related field Six or more years of data engineering experience with a focus on ML or AI workloads Strong proficiency in Python and at least one JVM or systems language Deep experience with modern data processing frameworks such as Spark, Ray, or Beam Hands-on experience operating petabyte-scale storage and pipeline systems
Languages
- English
Notice for Users
This job was posted by one of our partners. You can view the original job source here.