Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Founding Data Engineer (Core Data Platform)
Data Engineer (Founding Team)
FabrionSan FranciscoData/ETL Engineer (Founding Team) Location: San Francisco Bay AreaType: Full-TimeCompensation: Competitive salary + early-stage equityBacked by 8VC, we're building a world-class team to tackle one of
Founding Backend Platform Engineer, AI-Driven IoT
The Electric PlantSan FranciscoThe Electric Plant is looking for a Founding Software Engineer in San Francisco to build the software platform integrating IoT hardware and AI. You'll influence architectural decisions and collaborate
Founding Data Analyst
Social LeverageSan FranciscoPosition Overview SyntheticFi, a YC-backed company, is growing at an extraordinary 20%+ MoM over the past year. We provide a product that our target customers (financial advisors) view as a true diffe
Founding Analytics Engineer Data Architect for Growth
Success Matcher RecruitmentSan FranciscoSuccess Matcher Recruitment is seeking a founding analytics engineer to build a robust data layer from scratch. This role offers total ownership over analytics data models and direct influence on comp
Founding Data Engineer: Build Scalable Pipelines (Hybrid SF)
MundiSan FranciscoProbably Genetic in San Francisco is looking for a Founding Data Engineer to develop data infrastructure that supports both internal insights and customer solutions. You will work collaboratively with
Founding Data Engineer ($145k-$215k + Equity) at Stealth YC company
Jack & Jill/External ATSSan FranciscoFounding Data Engineer Salary: $145,000 - $215,000 + EquityCompany Description: VC-backed healthtech AI startupJob Description: You will own the data foundation for an AI-first medical practice, build
Senior Data Engineer - Hybrid City Data Platform
City-and-County-of-SAN-FranciscSan FranciscoThe City and County of San Francisco is hiring a Senior Data Engineer for the DataSF team to ensure robust data infrastructure. This role includes managing Snowflake, developing data pipelines, and co
Senior Data Engineer: Own Real-Time Data Platform
TRIUMPHSan FranciscoTRIUMPH is seeking its first dedicated Data Engineer to own the full data stack at our rapidly growing gaming platform. You will be responsible for architecting and optimizing our data warehouse using
Platform Data Engineer II: BigQuery & Cloud Pipelines
Neon RedwoodSan FranciscoNeon Redwood, a data services consulting company in San Francisco, is seeking an experienced Data Engineer II to enhance their data infrastructure. The candidate should have at least 2 years of experi
Data Engineer, Knowledge Graphs Biotech AI Platform
MithrlSan FranciscoMithrl is seeking a Data Engineer, Knowledge Graphs to build the infrastructure for their biological knowledge layer. In this role, you will partner closely with data scientists to create scalable ETL
Staff Frontend Engineer, Client Data & Networking Platform
NerdleveltechSan FranciscoAirbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every co
Cloud-Native Data Platform Engineer for Scalable Analytics
Women In BioSan FranciscoWomen In Bio is seeking a Senior Data Platform Engineer in San Francisco, CA. This role involves implementing end-to-end data solutions and managing cloud infrastructure. Candidates should have a Bach
Staff Backend & Data Engineer AI-Driven Health Platform
Trial Library, Inc.San FranciscoTrial Library, Inc. is seeking a Staff Engineer based in San Francisco, CA to lead backend development and data platform enhancements. This role focuses on leveraging cloud-native architecture to impr
Data Platform Engineer Scale Real-Time Analytics & Infra
FairygodbossSan FranciscoDoorDash is seeking a Data Platform Engineer based in the Bay Area to lead the vision and strategy for a rapidly growing analytics framework. You will scale the platform for increasing data workloads
Senior Backend Engineer: AI-Driven Schema & Data Platform
Madrona Venture LabsSan FranciscoMadrona Venture Labs in San Francisco is seeking a talented engineer to design the core EAV data model and build a Git-style version-control engine. You'll work with AI-assisted tooling and ensure the
Nurse Practitioner or Physician Assistant (Castro) - Sign-On Bonus Available
One MedicalSan FranciscoAbout Us One Medical is a primary care solution challenging the industry status quo by making quality care more affordable, accessible and enjoyable. But this isn’t your average doctor’s office. We
Primary Care Physician (Spear Street) - Sign-On Bonus Available
One MedicalSan FranciscoAbout Us One Medical is a primary care solution challenging the industry status quo by making quality care more affordable, accessible and enjoyable. But this isn’t your average doctor’s office. We
Nurse Practitioner or Physician Assistant (Pacific Heights) - Sign-On Bonus Available
One MedicalSan FranciscoAbout Us One Medical is a primary care solution challenging the industry status quo by making quality care more affordable, accessible and enjoyable. But this isn’t your average doctor’s office. We
Primary Care Physician (Transbay Center) - Sign-On Bonus Available
One MedicalSan FranciscoAbout Us One Medical is a primary care solution challenging the industry status quo by making quality care more affordable, accessible and enjoyable. But this isn’t your average doctor’s office. We
Nurse Practitioner or Physician Assistant (Duboce Triangle) - Sign-On Bonus Available
One MedicalSan FranciscoAbout Us One Medical is a primary care solution challenging the industry status quo by making quality care more affordable, accessible and enjoyable. But this isn’t your average doctor’s office. We
Expanded Care Family Nurse Practitioner or Physician Assistant (All Ages) (Sign-on Bonus Available)
One MedicalSan FranciscoAbout Us One Medical is a primary care solution challenging the industry status quo by making quality care more affordable, accessible and enjoyable. But this isn’t your average doctor’s office. We
Nurse Practitioner or Physician Assistant (Transbay Center) - Sign-On Bonus Available
One MedicalSan FranciscoAbout Us One Medical is a primary care solution challenging the industry status quo by making quality care more affordable, accessible and enjoyable. But this isn’t your average doctor’s office. We
Data Scientist, Customer & Product Insights
NasdaqSan FranciscoNasdaq, Inc. is hiring a Data Science Analyst to transform customer and product data into actionable insights. You will design analytical frameworks, build AI models, and collaborate with teams to enh
Physician / Gynecology / Utah / Permanent / Physician - Obstetrics / Gynecology - Laborist in Utah J
UtahSan FranciscoAre you a Laborist physician searching for your next exciting locum tenens opportunity? This position with one of VISTA's healthcare partners in Utah might just be the opportunity for you!Opportunity
Strategic Chief of Staff to CRO - Biotech Reg & Ops
BridgeBio PharmaSan FranciscoBridgeBio Pharma in Washington DC seeks a Chief of Staff to the Chief Regulatory Officer. This dynamic leadership role centers on enabling strategic initiatives across Regulatory Affairs, Portfolio St
Data Engineer (Founding Team)
- San Francisco, California, United States
- San Francisco, California, United States
Über
Type: Full-Time
Compensation: Competitive salary + early-stage equity
Backed by 8VC, we're building a world-class team to tackle one of the industry’s most critical infrastructure problems.
About the Role We’re building a multi-tenant, AI-native platform where enterprise data becomes actionable through semantic enrichment, intelligent agents, and governed interoperability. At the heart of this architecture lies our Data Fabric — an intelligent, governed layer that turns fragmented and siloed data into a connected ontology ready for model training, vector search, and insight-to-action workflows.
We\u2019re looking for engineers who enjoy hard data problems at scale : messy unstructured data, schema drift, multi-source joins, security models, and AI-ready semantic enrichment. You’ll build the backend systems, data pipelines, connector frameworks, and graph-based knowledge models that fuel agentic applications.
If you\u2019ve worked on streaming unstructured pipelines, built connectors into ugly legacy systems, or mapped knowledge graphs that scale — this role will feel like home.
Responsibilities
Build highly reliable, scalable data ingestion and transformation pipelines across structured, semi-structured, and unstructured data sources
Develop and maintain a connector framework for ingesting from enterprise systems (ERPs, PLMs, CRMs, legacy data stores, email, Excel, docs, etc.)
Design and maintain the data fabric layer — including a knowledge graph (Neo4j or Puppygraph) enriched with ontologies, metadata, and relationships
Normalize and vectorize data for downstream AI/LLM workflows — enabling retrieval-augmented generation (RAG), summarization, and alerting
Create and manage data contracts, access layers, lineage, and governance mechanisms
Build and expose secure APIs for downstream services, agents, and users to query enriched semantic data
Collaborate with ML/LLM teams to feed high-quality enterprise data into model training and tuning pipelines
What We’re Looking For Core Experience:
5+ years building large-scale data infrastructure in production environments
Deep experience with ingestion frameworks (Kafka, Airbyte, Meltano, Fivetran) and data pipeline orchestration (Airflow, Dagster, Prefect)
Comfortable processing unstructured data formats: PDFs, Excel, emails, logs, CSVs, web APIs
Experience working with columnar stores, object storage, and lakehouse formats (Iceberg, Delta, Parquet)
Strong background in knowledge graphs or semantic modeling (e.g. Neo4j, RDF, Gremlin, Puppygraph)
Familiarity with GraphQL, RESTful APIs, and designing developer-friendly data access layers
Experience implementing data governance : RBAC, ABAC, data contracts, lineage, data quality checks
Mindset & Culture Fit:
You\u2019re a system thinker: you want to model the real world, not just process it
Comfortable navigating ambiguous data models and building from scratch
Passionate about enabling AI systems with real-world, messy enterprise data
Pragmatic about scalability, observability, and schema evolution
Value autonomy, high trust, and meaningful ownership over infrastructure
Bonus Skills Prior work with vector DBs (e.g. Weaviate, Qdrant, Pinecone) and embedding pipelines
Experience building or contributing to enterprise connector ecosystems
Knowledge of ontology versioning , graph diffing , or semantic schema alignment
Familiarity with data fabric patterns (e.g. Palantir Ontology, Linked Data, W3C standards)
Familiar with fine-tuning LLMs or enabling RAG pipelines using enterprise knowledge
Experience enforcing data access policy with tools like OPA , Keycloak , Snowflake row-level security
Why This Role Matters Agents are only as smart as the data they operate on. This role builds the foundation — the semantic, governed, connected substrate — that makes autonomous decision-making and agent action possible. From factory ERP records to geopolitical news alerts, the data fabric unifies it all.
If you\u2019re excited to tame complexity, unify chaos, and power intelligent systems with trusted data — we’d love to hear from you.
#J-18808-Ljbffr
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.