XX
Senior Data Engineer, ML PlatformRedditUnited States

This job offer is no longer available

XX

Senior Data Engineer, ML Platform

Reddit
  • US
    United States
  • US
    United States

About

Senior Data Engineer, ML Platform
Remote - United States Who We Are: The Machine Learning Platform team at Reddit is a high-impact team that owns the infrastructure that powers recommendations, content discovery, user and content quantification, while directly impacting other teams such as Growth, Ads, Feeds, and Core Machine Learning teams. What You'll Do: As a Senior Data Engineer, you will lead development of data pipelines and workflow for large scale ML models at Reddit. Responsibilities: Design and implement scalable and secure data processing pipelines and storage environments that prepare our source of truth datasets for our models. Ensure data is cleansed, mapped, transformed, and otherwise optimized for storage and use according to business and technical requirements. Build effective data pipelines and workflows to streamline data ingestion, processing, and distribution tasks. Setting up and operating data workflow management tools for SQL code versioning, dependency tracing, etc. Load transformed data into storage and reporting structures in destinations including data warehouse, reporting systems and analytics applications. Monitor and troubleshoot issues with the data environment to maintain high availability and performance. Support monitoring and observability across training datasets, model metrics and implement diagnostic tools for metric movements. Maintain effective documentation regarding data procedures, systems, and architectures to maintain clarity and enable easy collaboration. Qualifications: 5+ years of experience in Data Engineering or ML Infrastructure Experience with large scale data transforms to prepare graph data Experience with Graph DB, Spark, Kafka pipelines Experience working with Airflow and MLFlow Experience with storage frameworks like BQ, parquet, iceberg Awareness of ML models and architectures is a huge plus. Strong focus on scalability, reliability, performance, and ease of use. You are an undying advocate for platform users and have a deep intuition for the machine learning development lifecycle. Strong organizational & communication skills Benefits: Comprehensive Healthcare Benefits and Income Replacement Programs 401k Match Family Planning Support Gender-Affirming Care Mental Health & Coaching Benefits Flexible Vacation & Reddit Global Days off Generous paid Parental Leave Paid Volunteer time off Pay Transparency: This job posting may span more than one career level. In addition to base salary, this job is eligible to receive equity in the form of restricted stock units, and depending on the position offered, it may also be eligible to receive a commission. Additionally, Reddit offers a wide range of benefits to U.S.-based employees, including medical, dental, and vision insurance, 401(k) program with employer match, generous time off for vacation, and parental leave. To learn more, please visit. The base pay range for this position is: $190,800 - $267,100 USD
  • United States

Languages

  • English
Notice for Users

This job was posted by one of our partners. You can view the original job source here.