Offres d'emploi
Trouvez des postes près de chez vous, sur site, hybrides ou à distance.- Emplois similaires à : AI Data Engineer - Scientific Data Platforms (Remote)
AI Data Engineer - Scientific Data Platforms (Remote)
Astrix TechnologyUnited StatesAI Data Engineer - Scientific Data Platforms (Remote)Science & ResearchSouth San Francisco, CA, USAdded - 15/06/2026Pay Rate Low: 35 | Pay Rate High: 40Our client is a leading global biotechnology and
Scientific Lead - Scientific Data Engineer
BioSpace, Inc.San FranciscoAt Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work
Remote AI Scientific Reasoning Data Engineer
Codefeast EnterprisesNew YorkCodefeast Enterprises seeks a Scientific Reasoning & Discovery Engineer to design high-quality datasets enhancing scientific reasoning capabilities of LLMs. The role involves creating tasks that requi
Data Engineer (Data Strategy & Platforms)
Elios AIHoustonData Engineer (Data Strategy & Platforms) Location: Houston, Texas (hybrid) | Type: Full Time | Experience: 3 to 7+ yearsAbout the Role You will work with clients to design, build, and optimize modern
Data Engineer - Data Platforms-Azure
CovetitUnited StatesData EngineerA data engineer with expertise in Azure toolset advises on, develops, and maintains data engineering solutions on the Azure Cloud ecosystem. They design, build, and operate batch and real
US Data Engineer - Data Platforms
ArtechUnited StatesData EngineerArtech is currently seeking to add to the below position Location: Phoenix, AZ (Onsite) Duration: 12 Months (Long Term) Rate Range: $65/Hr Required skills: Senior Data Engineer – Spark/Py
Lead Scientific Data Engineer
Virtual Vocations IncUnited StatesProviding senior technical leadership, the full-time Lead Scientific Data Engineer will develop implementation roadmaps and architectures for core scientific data systems, while leading the design of
Senior Data Engineer, Healthcare Data Platforms
Hispanic Alliance for Career EnhancementSioux FallsCVS Health is seeking an experienced Data Engineer to join our team in South Dakota. This role will focus on building scalable data pipelines and engineering solutions to manage large data sets while
Staff Data Engineer - Remote, ELT & AI Platforms
B CapitalSanta BarbaraB Capital is looking for a Staff Data Engineer to provide technical leadership across Data Engineering in a healthcare data environment. You will ensure the delivery of scalable data solutions, integr
Senior Data Engineer — Identity Data Platforms
The Walt Disney CompanySanta MonicaThe Walt Disney Company is seeking a Sr Data Engineer to focus on building and maintaining data products that drive advertising performance across various platforms. The ideal candidate will have over
Senior Data Engineer: Scalable Pipelines & Data Platforms
IGH ICW GROUP HOLDINGS, INC.NorforkIGH ICW GROUP HOLDINGS, INC. seeks a Data Engineer III focused on designing and implementing data pipelines and integration solutions. You'll collaborate with multiple teams to solve significant data
Senior Data Engineer Scalable Data Platforms & Pipelines
MasterCardO'FallonMasterCard is looking for a Senior Data Engineer in O’Fallon, Missouri to build and optimize data platforms that enable advanced analytics and machine learning. You'll collaborate with engineering, pr
Senior Big Data Engineer - ETL & Data Platforms
NGDataNew YorkNGDATA is looking for a Big Data Engineer to oversee the complete lifecycle of customer engagements, including design, development, and support. The candidate will also collaborate with teams to desig
Strategist - Analytics & Data Platforms (Remote)
PretzelsHersheyPretzels, Inc is looking for a Strategist Reporting Capabilities professional in Hershey, PA or remote to lead analytics initiatives and deliver actionable business insights. You will partner with var
Senior Data Engineer: Cloud Data Platforms & AI
ValtechPolandValtech is seeking an experienced Senior Data Engineer to design and optimize cloud-based data platforms. You will build and manage data pipelines that support analytics and AI-driven applications, en
Azure Data Engineer - Lakehouse & AI Platforms
Navitus Health SolutionsMadisonNavitus Health Solutions is seeking an experienced Data Engineer to design and implement modern cloud-native data platforms. You will collaborate with various teams to develop scalable data solutions
Senior Data Engineer: Scalable Pipelines & Platforms
HNTBAtlantaHNTB Corporation is seeking a Data Engineer III to develop scalable data solutions and lead pipeline architecture. The ideal candidate should have experience with data lifecycle management and a stron
Senior Staff Data Engineer - Hybrid Cloud Data Platforms
The HartfordHartfordThe Hartford is seeking a Senior Staff Data Engineer to shape data ingestion and transformation for Sales and Underwriting. This role involves working with cloud-based data platforms and offers a hybr
Lead Data Engineer (Enterprise Platforms Technology)
Capital One National AssociationPlanoLead Data Engineer (Enterprise Platforms Technology) Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast‑paced, collaborative, inclusi
Lead Data Engineer - Cloud Data Platforms & Real-Time
LennarIrvingLennar USA is seeking a Lead Data Engineer to oversee the data engineering team and drive data initiatives. This role involves managing projects, ensuring scalability of data solutions, and collaborat
Senior Data Engineer: Create Next-Gen Data Platforms
UlineWaukeganUline is seeking a Senior Data Engineer to enhance data engineering and analytics. Located in Waukegan, Illinois, the role focuses on creating and maintaining innovative data systems and analytics pla
Senior Data Engineer: Build Scalable Data Platforms & Analytics
CompunnelDurhamCompunnel, Inc. is looking for a Senior Data Engineer to design and maintain scalable data solutions that support critical business operations. Ideal candidates will have over 7 years of experience in
Full Stack Application Engineer (Analytics & Data Platforms) - Forsyth - Remote
CarepathRxUnited StatesData-Driven Application DeveloperBring data to life through modern applications and intelligent experiences. In this role, you will help design and build scalable, user-friendly solutions that connect
Staff Data Engineer End-to-End Biotech Data Platforms
GRAIL, Inc.DurhamGRAIL is seeking a Staff Data Engineer to lead data systems design and development, contributing to cancer detection advancements. Based in Durham, NC, this hybrid role involves collaboration with cro
Data Engineer for AI/ML Data Platforms & LLM Pipelines
Nightwing Intelligence Solutions, LLCSterlingNightwing Intelligence Solutions, LLC is seeking a Data Engineer to lead data architecture and engineering delivery for AI/ML solutions, ensuring data is secure, observable, and scalable. The candidat
AI Data Engineer - Scientific Data Platforms (Remote)
- United States
- United States
À propos
Science & Research
South San Francisco, CA, US
Added - 15/06/2026
Pay Rate Low: 35 | Pay Rate High: 40
Our client is a leading global biotechnology and pharmaceutical organization driven by a mission to innovate, continuously advance science, and ensure everyone has access to the healthcare they need.
Title:
AI Data Engineer - Scientific Data Platforms
Location:
Remote, Must work PST
Pay rate:
$35-38/hr (Depends on experience level)
Schedule:
Full-time (40 hours/week)
Duration:
1-year contract, (Plus benefits)
Position Overview
This role addresses a critical need in scaling our AI models for drug discovery by building largely automated, scalable, agent-driven data ingestion and curation pipelines for genomics data. This includes metadata inference, constructing performant query architectures, and transforming high-dimensional datasets (e.g., single-cell omics, clinical trials) into AI-ready training formats.
Key Responsibilities
Build an agentic data ingestion pipeline and move beyond bespoke steps toward agents that teams can reliably use as a shared, deployed service.
Triage and prioritize incoming requests to ingest specific datasets. Clean and organize data, building the first-pass cleaning and organization steps into the agentic flow.
Validate cross-modal linkage. Add automated checks that catch when ingested data does not connect correctly and flag low-quality or mismatched records.
Version every dataset, retaining and making prior versions addressable. Preserve raw data and provenance, ensuring agent workflows log validation and transformation steps so lineage is fully traceable.
Partner with AI, software engineering, and computational biology groups to co-define data standards and conventions.
Qualifications & Requirements
Demonstrated experience building multi-agent workflows or LLM workflows using tools/frameworks such as LangGraph or LlamaIndex, including tool/function calling and asynchronous task execution.
Strong Python skills for data manipulation, working with APIs and databases, and handling heterogeneous data formats.
Familiarity with dataset versioning approaches (e.g., DVC, lakeFS, or equivalent).
Comfortable with or showing a strong willingness to learn common omics data formats like AnnData, H5AD, and TileDB.
No deep bioinformatics expertise required; just a basic conceptual understanding of different modalities (e.g., RNA-seq vs. scRNA-seq vs. WES; genomics vs. transcriptomics vs. proteomics vs. metabolomics).
Comfortable writing unit and functional tests to ensure data processing workflows are reliable and reproducible.
Degree in a technical field or equivalent practical experience.
Must be Authorized to work in the United States without Sponsorship.
Nice to Have
Experience deploying agent workflows as a shared service (e.g., FastAPI or MCP endpoints).
Exposure to cloud platforms (AWS, GCP) and containerization (Docker).
Familiarity with scientific workflow managers such as Nextflow or Snakemake.
INDBH
#LI-MG1
We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law.
Compétences linguistiques
- English
Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.