Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Remote AI Data Engineer — Scalable Data Pipelines
Data Engineer Scalable Pipelines NYC
GuacNew YorkAt Guac, we're solving grocery food waste with AI. We forecast exactly how much of each product will sell, helping grocery retailers order and produce the perfect amount of inventory — and we're build
Data Engineer — Scalable Pipelines & Equity
Notion Labs, Inc.New YorkAbout The Role: As Notion continues to grow rapidly, we're seeking talented data engineers to join our team and help us build foundational datasets and pipelines, as well as the infrastructure that su
Data Engineer I: Scalable Pipelines & Data Security
Sentry InsuranceNew YorkData Engineers are responsible for designing, implementing, and maintaining data systems, data processes and databases. In this role, you’ll be responsible for implementing scalable data movement proc
Senior Data Engineer — Scalable Data Pipelines & Privacy
American Civil Liberties Union, Inc.New YorkThe American Civil Liberties Union, Inc. is seeking a Senior Data Engineer for a full-time hybrid position in New York, NY. The role involves designing and maintaining data pipelines that enhance fund
Senior Data Engineer — Scalable Music Data Pipelines
ChartmetricNew YorkChartmetric, a startup in New York, seeks a Senior Data Engineer to build and optimize ETL pipelines for over 10M artists, and maintain a complex multi-cloud data ecosystem. The ideal candidate will h
Data Engineer Build Scalable AI Data Pipelines
United States Digital Space LLCNew YorkSpecialist Solutions Architect Manager for Healthcare & Life Sciences (HLS) Mission: Reporting to the Tech GM of Field Engineering, the Specialist Solutions Architect Manager for Healthcare & Life Sci
Senior Data Engineer: Scalable Pipelines & Analytics
WellSky CorporationNew YorkWellSky Corporation is looking for a Staff Data Engineer to design, build, and maintain scalable data pipelines and infrastructure. The role will involve collaboration with data scientists, analysts,
Senior Data Engineer - Scalable Pipelines & Data Products
Take-TwoNew YorkTake-Two Interactive Software, Inc. is seeking a motivated Senior Data Engineer to join our Data Engineering team in New York City. The ideal candidate should have strong Python skills and experience
Senior Data Engineer: Architect Scalable Data Pipelines
RazorfishNew YorkRazorfish is seeking a Sr. Data Engineering Analyst in New York. This role involves optimizing data architecture and supporting data teams in delivering quality data pipelines. The successful candidat
Fabric Data Engineer — Build Scalable Data Pipelines
B2N Management Consulting KeralaNew YorkB2N Management Consulting Kerala is looking for a Software Developer to join their team in the United States. The successful candidate will perform data analysis, translate requirements into scalable
Data Engineer, Health Tech: Scalable Data Pipelines
Fuze Health Inc.New YorkFuze Health Inc. is looking for a Data Engineer to build and maintain data platforms for operational teams and decision-makers. This role involves deploying new data pipelines, designing observability
Senior Data Engineer: Build Scalable Data Pipelines & Analytics
StrykerNew YorkStryker Corporation is seeking a Sr. Data Engineer to design and implement data pipelines for analytical capabilities. This role requires proficiency in Python and SQL, and experience with cloud envir
Agentic AI Data Engineer: Scalable Enterprise Pipelines
KyndrylNew YorkKyndryl is seeking an Agentic AI Data Engineer to build and optimize data platforms for enterprise-scale AI. You will design and implement data pipelines, manage vector databases, and ensure data qual
Data Engineer - NYC Hybrid, Scalable Pipelines & Analytics
Fubo SportsbookNew YorkFubo Sportsbook is seeking a Software Engineer to join their Data Engineering team. This NYC-based hybrid role involves designing and building data ingestion pipelines, analytics systems, and microser
AI Data Engineer: NLP, LLMs & Scalable Pipelines
WiproNew YorkWipro is seeking AI/ML specialists to design and optimize models for data analysis and NLP applications. Ideal candidates should be IIT graduates with strong skills in Python, TensorFlow, and experien
Remote Data Engineer II: Scalable Spark/Python Pipelines
Magnite, Inc.New YorkMagnite, Inc. is seeking a Data Engineer II to build and optimize data pipelines utilizing Apache Spark and Python. This role emphasizes collaboration within teams to enhance audience data pipelines a
Senior Data Engineer - LATAM: Build Scalable Pipelines
Nimble Gravity, LLCNew YorkNimble Gravity, LLC is seeking a Data Engineer to develop and scale data solutions in the United States. The role involves building high-performance data pipelines and collaborating with business stak
Data Engineer: Build Scalable Pipelines & Cross‑Team Impact
Talentzo DelhiNew YorkTalentzo Delhi is looking for a data engineer to develop and enhance data engineering solutions. The role involves building reliable and scalable data pipelines and collaborating closely with cross‑fu
Data Engineer Associate: Build Scalable ETL Pipelines in NYC
AretoveNew YorkAretove Inc is seeking a Data Engineer Associate to join their Business Intelligence Practice in New York City. You will build and maintain data pipelines to deliver impactful insights for clients. Th
Remote Data Engineer: Pipelines & Insights
SwiftCruitNew YorkSwiftCruit is seeking a Data Engineer to manage and process data for analytical and operational needs. This remote role involves working with data pipelines and ensuring data quality. The ideal candid
Remote Salesforce Data Engineer for Scalable CRM Data
Downtown Boulder PartnershipNew YorkDowntown Boulder Partnership is seeking a Data Engineer to join our boutique web development agency. This role is essential in managing data within Salesforce, ensuring seamless integration and qualit
Senior Data Engineer: Scalable Data Platforms + Equity
SecurityScorecardNew YorkSecurityScorecard is looking for a Senior Data Engineer who will play a crucial role in building and maintaining scalable applications. You’ll own projects throughout the Software Development Life Cyc
Senior Azure Data Engineer Remote Data Pipelines
Value Innovation LabsNew YorkValue Innovation Labs is seeking a Senior Data Engineer to design and maintain scalable data pipelines primarily using Azure services. This role is critical for ensuring high-quality datasets are avai
Remote Senior Data Engineer — Fintech Data Pipelines
Branch Messenger Inc.New YorkBranch Messenger Inc. is seeking a Senior Data Engineer to join their team in the U.S. In this role, you'll own end-to-end data systems and pipelines that other teams rely on. The ideal candidate has
Senior Data Engineer Data Pipelines & Insights
Paramount PicturesNew YorkParamount Pictures in New York seeks a Senior Data Engineer to develop processes and systems for analyzing unstructured data. You will build data marts and own the lifecycle of data from ingestion to
Data Engineer Scalable Pipelines NYC
- New York, New York, United States
- New York, New York, United States
Über
The grocery industry is enormous (it accounts for 4% of GDP) — and grocery food waste is a huge cost to grocers' bottom lines, but also to our planet.
Today, we're working with major supermarket chains in the US and Canada, and we've scaled to 7‑figures in ARR. We're backed by leading investors including YCombinator, 1984 Ventures, Collaborative Fund, and angels from Open AI, Instacart, and Citadel Securities.
We've brought together an exceptional team from Palantir, BCG, Oxford, Cambridge, and MIT to solve intellectually challenging problems and tackle food insecurity and waste with technology.
We're looking for talented data engineers in NYC to join our mission.
About the Role As a Data Engineer at Guac, you'll own the data infrastructure that powers our forecasts — the pipelines that ingest billions of rows of transaction, inventory, and operational data from grocers across the continent, and the systems that turn that data into accurate predictions multiple times a day.
You'll shape how we model new customers' data, build pipelines that scale across chains with hundreds of stores, and work on our ML systems to make them faster and more accurate. You'll occasionally work directly with customers' technical teams to understand their data and business logic — but the bulk of your time is on engineering.
Your responsibilities will include:
Data & Pipelines
Design and build ETL pipelines that process billions of rows of data multiple times per day across customers, using Python, Dagster, and Pub/Sub
Model new customer datasets and own the data layer for new deployments — from raw integration to forecast‑ready
Optimize our ML pipelines for demand forecasting — making them faster, cheaper, and more accurate at scale
Partner with customers' technical teams to understand their data systems and business logic, and translate that into our pipelines
Backend
Contribute to backend services (Python/FastAPI) that power our ordering and production planning products
Build internal tools and APIs that expose forecasts and data to our application layer
Expose our data and systems to LLMs via MCP servers, tool‑use APIs, and similar protocols
About You
3+ years of relevant data engineering experience
Strong proficiency in Python (Pandas, etc.) and SQL
Proven experience designing and implementing ETL systems across large distributed datasets, using orchestration tools like Dagster or Airflow
Comfortable operating with ambiguity and minimal process — you thrive when given a problem and trusted to figure out the solution
AI‑native: you use Claude Code, Cursor, or similar AI coding tools daily and ship significantly faster because of it
(Bonus) Experience optimizing ML pipelines or working closely with ML/forecasting systems
(Bonus) Experience with distributed computing frameworks like PySpark or Dask
What We Offer
First‑hand experience building an early‑stage startup with real ownership
Compensation: $150k–$250k base + competitive equity
Fully employer‑paid healthcare (medical, dental, and vision)
Unlimited vacation days
Fully covered food expenses in the office (lunch/dinner)
Free Equinox membership
Our Tech Stack
Languages & Frameworks: Python, FastAPI, SQL
Data & Pipelines: Dagster, Pub/Sub, BigQuery, Postgres, Dask, Pandas
Cloud & Infrastructure: GCP, Terraform, Docker
AI: MCP servers, Anthropic/OpenAI APIs, agentic tooling
Note: this is a 5x day a week in person role in NYC
#J-18808-Ljbffr
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.