About
Biohub operates one of the largest AI compute clusters dedicated to biology, spanning three frontier research institutes with some of the worlds leading biologists. Were not a startup trying to find product‑market fit, and were not a pharma company optimizing a pipeline. Were building frontier AI for fundamental science, as open science, at a scale no one else is doing. This is a unique moment for scientific acceleration. The problems are among the hardest and most impactful problems you can choose to work on, and we move at a pace that meets this moment.
Frontier molecular modeling, from protein language models (e.g., ESM) to structure prediction (e.g., ESMFold) and beyond.
Scaled biological foundation models trained on some of the largest GPU clusters dedicated to science
Imaging foundation models trained across the worlds largest microscopy datasets
Reasoning and agentic systems that connect frontier LLMs with biological foundation models
Mechanistic interpretability of biological foundation models: extracting new biological knowledge directly from model weights
Scientific data at unprecedented scale: AI systems to collect, curate, and learn from some of the richest biological datasets ever assembled
Join our Team! As an ML Engineer, youll join some of the strongest infrastructure engineers in AI, building the systems that connect everything together. The infrastructure problems you solve directly determine what science becomes possible.
What Youll Do Build and maintain pre‑training infrastructure across thousands of GPUs.
Design and optimize GPU‑native data loading pipelines for scientific data workloads at petabyte scale.
Build I/O and pipeline systems for biological data unlike anything in standard AI: microscopy volumes, transcriptomics, spatial genomics.
Define the abstractions that researchers will build on for years.
Own the ML lifecycle: artifact tracking, fine‑tuning pipelines, monitoring, and production reliability.
Build the DevOps and tooling that make every engineer and researcher more productive.
Deploy Biohubs technology, powering the tools scientists use worldwide.
What Youll Bring Were looking for engineers whove built infrastructure for large‑scale ML systems and are energized by problems that dont have existing solutions yet.
Hands‑on Pytorch: custom training loops, distributed training, or low‑level performance work
GPU‑native data I/O and large‑scale tensor formats (Zarr, HDF5, TensorStore)
Distributed computing frameworks (Spark, Dask, Ray)
Docker and Kubernetes
A track record of building systems that other engineers and researchers depend on
Experience in building AI agent frameworks is a plus
Compensation The future anticipated Redwood City, CA, and New York City, NY base pay range for a role in this field is $150,000 to $350,000+ annually. Compensation ranges will vary based on job‑related skills, level of experience, and knowledge. Actual placement in range is based on job‑related skills and experience, as evaluated throughout the interview process.
Benefits for the Whole You We’re thankful to have an incredible team behind our work. To honor their commitment, we offer a wide range of benefits to support the people who make all we do possible.
Provides a generous employer match on employee 401(k) contributions to support planning for the future.
Paid time off to volunteer at an organization of your choice.
Relocation support for employees who need assistance moving
As set forth in the organization’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.
J-18808-Ljbffr
Languages
- English
Notice for Users
This job comes from a TieTalent partner platform. Click "Apply Now" to submit your application directly on their site.