À propos
GCP Data Engineer (MLOps) Start Date:
Targeting April 1 Location:
Remote (USA). Preferred: Texas or New Jersey
About the role We are seeking a GCP Data Engineer with strong MLOps experience to build, scale, and operationalize data and ML pipelines on Google Cloud. You will partner with Data Science, Product, and Platform teams to deliver reliable, production-grade workflows for batch and real-time machine learning, while driving model performance monitoring and operational excellence.
Key responsibilities
Design, develop, and optimize scalable data pipelines and ML workflows on GCP with BigQuery and Spark Build robust ELT/ETL processes and data models supporting ML feature stores, training datasets, and production inference Orchestrate pipelines and jobs, enabling dependency management, retries, and observability (e.g., Airflow) Implement CI/CD and automation for data/ML pipelines, including packaging, versioning, and environment promotion Develop event-driven and micro-batch processes for real-time ML inference (e.g., via Cloud Functions) and low-latency data preparation Establish model performance monitoring, drift detection, data quality checks, and alerting dashboards Collaborate closely with Data Scientists to productionize models and establish reproducible training/inference workflows Enforce best practices for code quality, testing, documentation, and cost/performance optimization on GCP Troubleshoot production issues, drive root-cause analysis, and implement durable fixes and postmortems
Must-have qualifications
Hands-on experience with Google Cloud (BigQuery) in production environments Strong Spark expertise (data processing, optimization, and job orchestration) Advanced proficiency in Python and SQL for data engineering and ML pipeline development Demonstrated experience building and supporting production-grade data/ML pipelines
Good-to-have (preferred) skills
GCP services: Airflow, gcloud (CLI), Cloud Functions Solid understanding of core ML concepts (training, evaluation, deployment patterns) ML model performance monitoring (data/feature drift, model decay, alerting, dashboards) Explainable AI (xAI) and LLM concepts (prompting, evaluation, guardrails) Real-time machine learning patterns (feature serving, low-latency inference, event-driven architectures) Experience with packaging, testing, and CI/CD for ML (artifact/version management, reproducibility)
Compétences linguistiques
- English
Avis aux utilisateurs
Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.