Senior Machine Learning Operations and Data EngineerEmancro • Berkeley, California, United States
This job offer is no longer available
Senior Machine Learning Operations and Data Engineer
Emancro
- Berkeley, California, United States
- Berkeley, California, United States
About
Start date: As soon as possible, no later than June 1st 2024 Design, develop, and maintain scalable data pipelines and ETL processes to extract, transform, and load data at large scale (in the order of 100sTB) Setting up and maintaining cloud-database (e.g. DynamoDB, Postgres etc.) Manage containerized environments (e.g., Docker, Kubernetes) for running machine learning workloads. Setting up and Maintaining Cloud multi-GPU training infrastructure (GCP, AWS, Azure) with Pytorch and Jax, (both model and data parallelism) Setting up and Maintaining MLOps frameworks, e.g. ClearML, ZenML etc. Implement CI/CD pipelines and automation tools to streamline the model development and deployment process. Deploying ML Models on the cloud for low-latency production/serving Key Qualifications
Expert knowledge of using and configuring GCP (Vertex), AWS, Azure Python: 5+ years of experience Machine Learning libraries: Pytorch, Jax, model and data parallelism Development tools: Bash, Git Data Science frameworks: Databricks Data Logging: Weights and Biases Optional Qualifications
Experience training LLMs and VLMs Emancro is committed to equal employment opportunities regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status.
#J-18808-Ljbffr
Languages
- English
Notice for Users
This job was posted by one of our partners. You can view the original job source here.