À propos
Founding Data Engineer, Clinical Trials and Oncology Data
Location:
Hybrid, San Francisco and/or Los Angeles
Experience:
3 to 7 years
Type:
Full-time
Stage:
Early, founding engineering hire
About Miraei
Miraei is building the deal engine for life sciences.
Business development in life sciences is still driven by fragmented data, manual research, and slow, relationship-heavy workflows. Miraei changes that by structuring and continuously tracking clinical trials and scientific data, then transforming it into actionable intelligence that powers how deals are identified, evaluated, and executed.
We start by helping vendors and diagnostics companies identify and engage the right biopharma partners around active and emerging clinical trials. Over time, Miraei becomes the platform where life sciences deals occur end to end, from vendors to biopharma, biopharma to biotechs, and cross-border partnerships such as biopharma seeking assets and collaborators internationally.
We are venture-backed and are generating revenue from enterprise customers.
The role
We are hiring a Founding Data Engineer to design and own the core data architecture, pipelines, and processes that powers Miraei. This role is responsible for building the canonical data models for clinical trial intelligence and ensuring our data pipelines are scalable and reliable as we ingest more sources, trials, and send out real-time updates.
This is a hands-on individual contributor role. You will write production code, make architectural decisions, and shape the long-term data foundation of the company.
What you will do
- Design and implement core data schemas for clinical trial data and data sources related to clinical assets, including
- Trials, arms, cohorts, endpoints, biomarkers, sponsors, and timelines
- Longitudinal versioning across abstracts, amendments, and readouts
- Press releases, news, and publications
- Build hierarchical taxonomies and ontologies for oncology and clinical research
- Indications, modalities, mechanisms of action, biomarkers, endpoints
- Architect and maintain data ingestion pipelines from
- Conference abstracts
- Clinical trial registries
- Publications and structured internal outputs
- Enable longitudinal tracking and alerting as trials evolve over time
- Partner closely with product and ML to ensure the data model supports downstream reasoning and user workflows
- Make pragmatic early-stage tradeoffs and evolve the system as the company scales
What we're looking for
- 3 to 7 years of experience as a data engineer or analytics engineer
- Prior experience working with clinical trial or life sciences data strongly preferred
- Pharma, biotech, diagnostics, CRO, real-world data, or clinical informatics
- Startup experience required
- You have built systems in ambiguous, fast-moving environments
- Strong fundamentals in:
- Database design (OLTP/OLAP), data modeling, metadata management, and schema design
- Skills in building reliable ETL/ELT pipelines, data integration, transformation, validation, and orchestration
- SQL/Python/Bash scripting
- Cloud-based data infrastructure (AWS/GCP)
- Experience with modern software development tools, such as version control (git), automations/CI/CD (GitHub actions, Jenkins, etc), Docker containerization, etc
- Comfortable owning systems end to end as a senior IC
- Clear communicator who can explain tradeoffs and push back when needed
- Must be authorized to work in the United States. Visa sponsorship is not available for this position.
Nice to have
- Oncology domain expertise or familiarity
- Experience with ontology, RAG/knowledge graph, vector databases or other information retrieval experience
- Exposure to ML feature pipelines, context engineering, prompt engineering, and other AI-adjacent systems
Compensation and benefits
- Base salary:
$150k to $180k, depending on experience - Equity:
0.75% to 1.5% fully diluted, 4-year vest with 1-year cliff - Benefits:
Full benefits included
Why this role matters
The data layer is the product. Decisions made here will define what Miraei can and cannot become. This is a foundational role with real ownership, autonomy, and long-term impact.
Compétences linguistiques
- English
Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.