Job Opportunities
Find jobs near you, whether onsite, hybrid, or remote.- Similar Jobs to: Jr Data Engineer
Data Engineer
BiohubNew YorkBiohub is the first large-scale initiative bringing frontier AI models, massive compute, and frontier experimental capabilities under one roof. We're building a general‑purpose system to accelerate sc
Data Engineer
Saxon GlobalNew YorkJob Description Python Data Engineer Experience: 7+ YearsLocation: RemoteJoining: Immediate Joiners PreferredWe are looking for a highly skilled Python Data Engineer with strong expertise in big data
Data Engineer
VericenceNew YorkAbout Vericence Vericence is a digital engineering and technology consulting firm helping enterprises build AI-driven platforms, modernize legacy systems, and scale innovation through cloud, data, and
Data Engineer
HYGONew YorkHygo reaches 250 million people every month with original video content built around the things people are passionate about. We're not a tech company that dabbles in media — we're a global media compa
Data Engineer
TavusNew YorkSenior Data Engineer About Us At Tavus, we're building the human layer of AI. Our mission is to make human-AI interaction as natural as face-to-face interaction, enabling the human touch where it has
Data Engineer
Per ScholasNew YorkFor 30 years, Per Scholas has been on a mission to drive mobility and opportunity in the ever-advancing technology landscape by unlocking the untapped potential of individuals, uplifting communities,
Data Engineer
USI Insurance ServicesNew YorkGeneral Description Participate in the design and development of data pipelines. Collaborate with various cross functional teams to understand data requirements and deliver data driven solutions. Incl
Data Engineer
CGS Federal (Contact Government Services)New YorkEmployment Type: Full-Time, Mid-levelDepartment: Business IntelligenceCGS is seeking a passionate and driven Data Engineer to support a rapidly growing Data Analytics and Business Intelligence platfor
Data Engineer
RateHubNew YorkWHO IS RATEHUB? We’re a company on a mission - We are a unique Canadian Fintech that is the one stop shop for all financial needs of the consumer - Save, Spend, Borrow, Invest, and Protect. Every sing
Data Engineer
SilverchairNew YorkDEI Statement At Silverchair, we celebrate and embrace diversity in all its forms. We are committed to fostering an inclusive environment from the moment you consider joining our team. We actively enc
Data Engineer
Cybermedia TechnologiesNew YorkCTEC is a leading technology firm that provides modernization, digital transformation, and application development services to the U.S. Federal Government. Headquartered in McLean, VA, CTEC has over 3
Data Engineer
EZ TextingNew YorkOverview EZ Texting is seeking a Data Engineer to take ownership of our data pipelines, data integrity, security, and infrastructure. You’ll work with a modern, high-performing data stack, including B
Data Engineer
IDTNew YorkThis is a full-time work-from-home opportunity for a star Data Engineer from LATAM. IDT(www.idt.net) is an American telecommunications company founded in 1990 and headquartered in New Jersey. Today it
Data Engineer
Convo CommunicationsNew YorkFounded in March 2009, Convo is the world’s largest Deaf-owned company, with hundreds of colleagues serving communities across multiple countries and sign languages. We exist because conversations tra
Data Engineer
AccelerantNew YorkPlease note: this role is remote but candidates need to be on Eastern timezone. Job Description At Accelerant, we're not just keeping pace with technological advancements, we're leading the charge. As
Data Engineer
Elder ResearchNew YorkGet AI-powered advice on this job and more exclusive features. Location: Remote, with a preference for those in the District of Columbia, Maryland, and Virginia Clearance Required: Secret People Cente
Data Engineer
LeadStack Inc.New YorkLOCATION: Open to remote – (Cincinnati, Chicago, Charlotte, Boca Raton, San Jose preferred) RATE: 70-80/hr DURATION: 6m CTH WORK AUTH: USC/GCH – need to be able convert TOP SKILLS: GCP, Vertex AI – ne
Data Engineer
Versa NetworksNew YorkAbout Us At Versa Networks, we're revolutionizing the way businesses connect, secure, and optimize their networks. Our mission is to secure anywhere, anytime access to anything. As a leader in Secure
Data Engineer
SumerSports LLCNew YorkSumerSports is a leading football intelligence technology company that specializes in providing an innovative suite of products for football fans and NFL clubs. We are a collection of executives, engi
Data Engineer
D&G SolutionsNew YorkData Engineer Remote / Occasional Onsite in DC area | NCR Support as Required | Secret with TS EligibleWhy This Role Exists The work we do has real Mission impact, and the people behind it matter just
Data Engineer
MunetrixNew YorkDefinable Solutions, Inc. (parent company of Munetrix and School Data Solutions) is an industry leader in education data management tools and performance analytics applications. We partner with K-12 s
Data Engineer
Star Seven Six, LtdNew YorkStar Seven Six is a company dedicated to innovation, star talent, and revolutionizing business and technology solutions. As your advocate, the common thread across our associates is crafting the right
Data Engineer
NscaleNew YorkAbout Nscale Nscale is the GPU cloud engineered for AI. We provide cost‑effective, high‑performance infrastructure for AI start‑ups and large enterprise customers. Nscale enables AI‑focused companies
Senior Data Engineer
MozillaNew YorkAbout this team and role The Mozilla Corporation is wholly owned by the non‑profit 501(c) Mozilla Foundation. This means we aren’t beholden to any shareholders — only to our mission. Along with thousa
Senior Data Engineer
Socure IncNew YorkWhy Socure? Socure is building the identity trust infrastructure for the digital economy — verifying 100% of good identities in real time and stopping fraud before it starts. The mission is big, the p
About
The Team Biohub is a 501(c)(3) biomedical research organization building the first large‑scale scientific initiative combining frontier AI with frontier biology to solve disease. We build the technology to help scientists around the world use AI‑powered biology to study how cells operate, organize, and work as part of systems to understand why disease happens and how to correct it. With our compute capacity, AI research and engineering, and state‑of‑the‑art technology for measuring, imaging, and programming biology, we are enabling scientists worldwide to use AI‑powered biology to advance our understanding of human health.
The Opportunity The role is part of the Data Engineering team, which focuses on owning the strategy, sourcing and implementation for data supporting AI research and development. Our goal is to maximize the speed, agility, and capability of biological AI research by connecting public data resources and Biohub's experimental platforms to AI systems. The data that trains biological frontier models comes in dozens of modalities (sequences, images, spatial coordinates, time series, molecular structures, metadata, publication artifacts) each with its own noise characteristics, biases, and information content. The question of how to represent this data for learning is one of the most important open problems in biological AI.
As a data engineer at Biohub, you'll be designing systems that ingest data from public repositories, transform heterogeneous biological formats into AI‑ready datasets, combine that with proprietary datasets, and deliver training datasets to researchers pushing the boundaries of what's possible in biological AI. The infrastructure you build will directly shape what our models can learn.
We're a small team with significant resources and long time horizons. We use AI tools aggressively in our own work—Claude Code, agents for workflow automation, LLMs for metadata extraction. We care about code quality, operational reliability, and building systems that scale. And we care about the biology: we want engineers who can recognize when a pipeline output is technically correct but scientifically wrong.
If you want to work at the intersection of large‑scale infrastructure and frontier science, with real autonomy and the chance to build something genuinely new, we'd like to talk.
What You'll Do
Design and build data pipelines that process genomic and imaging data at petabyte scale
Solve performance and bandwidth challenges with creative engineering
Build agent‑based systems for automated dataset curation, quality control, and workflow generation
Create tooling for data cataloging and registration that makes datasets discoverable and accessible
Collaborate with AI Research teams to translate model requirements into data specifications, and with our scientists to integrate public and internal data into large‑scale AI‑ready datasets
Improve pipeline reliability and observability, working toward 99%+ success rates without manual intervention
What You’ll Bring
8+ years experience building reliable, operable data systems at scale (100s terabytes to petabytes)
Strong software engineering fundamentals
Experience deploying distributed computing frameworks like Databricks, Spark, or Ray for large‑scale data processing
Experience with cloud infrastructure (AWS preferred) and HPC environments
Comfort with ambiguity; ability to make progress when requirements are evolving
Interest in AI‑native development practices and tooling
Nice to have: Background in computational biology, bioinformatics, or life sciences and experience with genomics datasets and formats (FASTQ, BAM, VCF) or imaging formats (OME‑Zarr, HDF5)
Compensation The future anticipated Redwood City, CA, and New York City, NY base pay range for a role in this field is $241,000–$338,000 annually. Compensation ranges will vary based on job‑related skills, level of experience, and knowledge. Actual placement in range is based on job‑related skills and experience, as evaluated throughout the interview process.
Benefits for the Whole You
Provides a generous employer match on employee 401(k) contributions to support planning for the future
Paid time off to volunteer at an organization of your choice
Funding for select family‑forming benefits
Relocation support for employees who need assistance moving
If you’re interested in a role but your previous experience doesn’t perfectly align with each qualification in the job description, we still encourage you to apply as you may be the perfect fit for this or another role.
#J-18808-Ljbffr
Languages
- English
This job comes from a TieTalent partner platform. Click "Apply Now" to submit your application directly on their site.