Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Data Engineer E3
Data Engineer
BiohubNew YorkBiohub is the first large-scale initiative bringing frontier AI models, massive compute, and frontier experimental capabilities under one roof. We're building a general‑purpose system to accelerate sc
Data Engineer
Per ScholasNew YorkFor 30 years, Per Scholas has been on a mission to drive mobility and opportunity in the ever-advancing technology landscape by unlocking the untapped potential of individuals, uplifting communities,
Data Engineer
USI Insurance ServicesNew YorkGeneral Description Participate in the design and development of data pipelines. Collaborate with various cross functional teams to understand data requirements and deliver data driven solutions. Incl
Data Engineer
Star Seven Six, LtdNew YorkStar Seven Six is a company dedicated to innovation, star talent, and revolutionizing business and technology solutions. As your advocate, the common thread across our associates is crafting the right
Data Engineer
MunetrixNew YorkDefinable Solutions, Inc. (parent company of Munetrix and School Data Solutions) is an industry leader in education data management tools and performance analytics applications. We partner with K-12 s
Data Engineer
D&G SolutionsNew YorkData Engineer Remote / Occasional Onsite in DC area | NCR Support as Required | Secret with TS EligibleWhy This Role Exists The work we do has real Mission impact, and the people behind it matter just
Data Engineer
Versa NetworksNew YorkAbout Us At Versa Networks, we're revolutionizing the way businesses connect, secure, and optimize their networks. Our mission is to secure anywhere, anytime access to anything. As a leader in Secure
Data Engineer
SumerSports LLCNew YorkSumerSports is a leading football intelligence technology company that specializes in providing an innovative suite of products for football fans and NFL clubs. We are a collection of executives, engi
Data Engineer
CGS Federal (Contact Government Services)New YorkEmployment Type: Full-Time, Mid-levelDepartment: Business IntelligenceCGS is seeking a passionate and driven Data Engineer to support a rapidly growing Data Analytics and Business Intelligence platfor
Data Engineer
EZ TextingNew YorkOverview EZ Texting is seeking a Data Engineer to take ownership of our data pipelines, data integrity, security, and infrastructure. You’ll work with a modern, high-performing data stack, including B
Data Engineer
IDTNew YorkThis is a full-time work-from-home opportunity for a star Data Engineer from LATAM. IDT(www.idt.net) is an American telecommunications company founded in 1990 and headquartered in New Jersey. Today it
Data Engineer
Convo CommunicationsNew YorkFounded in March 2009, Convo is the world’s largest Deaf-owned company, with hundreds of colleagues serving communities across multiple countries and sign languages. We exist because conversations tra
Data Engineer
AccelerantNew YorkPlease note: this role is remote but candidates need to be on Eastern timezone. Job Description At Accelerant, we're not just keeping pace with technological advancements, we're leading the charge. As
Data Engineer
Elder ResearchNew YorkGet AI-powered advice on this job and more exclusive features. Location: Remote, with a preference for those in the District of Columbia, Maryland, and Virginia Clearance Required: Secret People Cente
Junior Data Engineer
FedTecNew YorkFedTec is seeking a skilled Junior Data Engineer who will support the design, development, and deployment of PowerBI dashboards and PowerApps. This role contributes directly to development tasks, data
Senior Data Engineer
HiretrussNew YorkOur client is a leading service commerce platform offering vertically tailored, integrated SaaS solutions that empower over 690,000 global service-based businesses to accelerate growth, optimize opera
Data Engineer II
SamsaraNew YorkWho we are Samsara (NYSE: IOT) is the pioneer of the Connected Operations Cloud, which is a platform that enables organizations that depend on physical operations to harness Internet of Things (IoT) d
Senior Data Engineer
Beam ImpactNew YorkAre you looking to join a highly motivated team working for social impact? At Beam we’re on a mission to shift $10 Billion to high-impact nonprofits.About the role As a Senior Data Engineer on our Eng
Associate Data Engineer
CandidNew YorkDescriptionPosition summary Candid is a nonprofit that provides the most comprehensive data and insights about the social sector. We get you the information you need to do good. Candid currently has a
Principal Data Engineer
Change.org, PBCNew YorkOverview HHAeXchange is the leading technology platform for home and community-based care. Founded in 2008, HHAeXchange was born out of an idea to create a fully comprehensive end-to-end homecare solu
Data Solutions Engineer
Paychex Deutschland GmbHNew YorkImagine Your Future with Us! Since 1971, Paychex has been at the forefront of simplifying HR, payroll, and benefits for American businesses. Our digital HR technology and advisory solutions cater to t
Senior Data Engineer
CloudDevsNew YorkWhat we’re building and why we’re building it. Every month, millions of people use America’s Rewards App, earning rewards for buying brands they love – and a whole lot more. Whether shopping in the gr
Senior Data Engineer
TebraNew YorkTebra only initiates contact with candidates via email from an official Tebra email address (@tebra.com , @patientpop.com , or @kareo.com ) or through our applicant tracking system, Greenhouse.We will
Staff Data Engineer
Lob.com IncNew YorkLob was founded in 2013 by technical co‑founders with a vision to connect the world one mailbox at a time. Today, we’re transforming the way businesses use direct mail and bringing the power of techno
Data Engineer (IC1)
ClassLinkNew YorkJoin to apply for the Data Engineer (IC1) role at ClassLink Join to apply for the Data Engineer (IC1) role at ClassLink Get AI-powered advice on this job and more exclusive features. This range is pro
Über
The Team Biohub is a 501(c)(3) biomedical research organization building the first large‑scale scientific initiative combining frontier AI with frontier biology to solve disease. We build the technology to help scientists around the world use AI‑powered biology to study how cells operate, organize, and work as part of systems to understand why disease happens and how to correct it. With our compute capacity, AI research and engineering, and state‑of‑the‑art technology for measuring, imaging, and programming biology, we are enabling scientists worldwide to use AI‑powered biology to advance our understanding of human health.
The Opportunity The role is part of the Data Engineering team, which focuses on owning the strategy, sourcing and implementation for data supporting AI research and development. Our goal is to maximize the speed, agility, and capability of biological AI research by connecting public data resources and Biohub's experimental platforms to AI systems. The data that trains biological frontier models comes in dozens of modalities (sequences, images, spatial coordinates, time series, molecular structures, metadata, publication artifacts) each with its own noise characteristics, biases, and information content. The question of how to represent this data for learning is one of the most important open problems in biological AI.
As a data engineer at Biohub, you'll be designing systems that ingest data from public repositories, transform heterogeneous biological formats into AI‑ready datasets, combine that with proprietary datasets, and deliver training datasets to researchers pushing the boundaries of what's possible in biological AI. The infrastructure you build will directly shape what our models can learn.
We're a small team with significant resources and long time horizons. We use AI tools aggressively in our own work—Claude Code, agents for workflow automation, LLMs for metadata extraction. We care about code quality, operational reliability, and building systems that scale. And we care about the biology: we want engineers who can recognize when a pipeline output is technically correct but scientifically wrong.
If you want to work at the intersection of large‑scale infrastructure and frontier science, with real autonomy and the chance to build something genuinely new, we'd like to talk.
What You'll Do
Design and build data pipelines that process genomic and imaging data at petabyte scale
Solve performance and bandwidth challenges with creative engineering
Build agent‑based systems for automated dataset curation, quality control, and workflow generation
Create tooling for data cataloging and registration that makes datasets discoverable and accessible
Collaborate with AI Research teams to translate model requirements into data specifications, and with our scientists to integrate public and internal data into large‑scale AI‑ready datasets
Improve pipeline reliability and observability, working toward 99%+ success rates without manual intervention
What You’ll Bring
8+ years experience building reliable, operable data systems at scale (100s terabytes to petabytes)
Strong software engineering fundamentals
Experience deploying distributed computing frameworks like Databricks, Spark, or Ray for large‑scale data processing
Experience with cloud infrastructure (AWS preferred) and HPC environments
Comfort with ambiguity; ability to make progress when requirements are evolving
Interest in AI‑native development practices and tooling
Nice to have: Background in computational biology, bioinformatics, or life sciences and experience with genomics datasets and formats (FASTQ, BAM, VCF) or imaging formats (OME‑Zarr, HDF5)
Compensation The future anticipated Redwood City, CA, and New York City, NY base pay range for a role in this field is $241,000–$338,000 annually. Compensation ranges will vary based on job‑related skills, level of experience, and knowledge. Actual placement in range is based on job‑related skills and experience, as evaluated throughout the interview process.
Benefits for the Whole You
Provides a generous employer match on employee 401(k) contributions to support planning for the future
Paid time off to volunteer at an organization of your choice
Funding for select family‑forming benefits
Relocation support for employees who need assistance moving
If you’re interested in a role but your previous experience doesn’t perfectly align with each qualification in the job description, we still encourage you to apply as you may be the perfect fit for this or another role.
#J-18808-Ljbffr
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.