Dieses Stellenangebot ist nicht mehr verfügbar
Senior Analytics Engineer
Carbon Arc
- New York, New York, United States
- New York, New York, United States
Über
We are a high-trust, high-impact team that takes end-to-end ownership of complex data problems, working across petabyte-scale datasets in a fast-paced, collaborative environment.
We’re looking for a Senior Analytics Engineer to own and advance critical data pipelines, build production-grade analytics infrastructure, and contribute to the ML and AI products that differentiate Carbon Arc. You will work across the full data stack — from raw extraction through certified delivery — designing scalable systems that ensure data quality, enrich our ontology, and power knowledge graph-driven applications. This role blends data engineering rigor with analytical depth and product sensibility.
What You'll Do
• Own one or more alternative datasets end-to-end through our data pipeline (extraction, cleaning, transformation, certification, and delivery), ensuring data quality and timeliness across petabyte-scale workloads.
• Build and maintain Entity Explorer Data (EXD) tables that serve as the foundation for client-facing APIs, dashboards, and analytical products, including demographic, financial, and behavioral metrics at varying entity and temporal granularities.
• Design and execute ontology mapping pipelines — resolving entities (brands, companies, products, locations) across disparate data sources using deterministic and probabilistic methods, and maintaining mapping infrastructure at scale.
• Develop and run automated EDA and confidence scoring pipelines to evaluate data quality, detect anomalies, and quantify panel representativeness across datasets and time periods.
• Contribute to the construction and maintenance of the Carbon Arc knowledge graphs.
• Build and extend internal tooling and infrastructure - including report templates, monitoring frameworks, bulk data generation, and Pydantic-based validation models - to improve team velocity and pipeline reliability.
• Collaborate with Engineering, Product, and Insights teams to translate business requirements into scalable data solutions, and participate in cross-functional initiatives such as data infrastructure migration, metadata management, and platform cost optimization.
What You'll Bring
• 4+ years of experience building and maintaining production data pipelines and analytics infrastructure at scale.
• BS or MS in Computer Science, Data Science, Statistics, Engineering, or a related quantitative discipline.
• Deep proficiency in Python and SQL, with hands-on experience writing PySpark and working with distributed query engines (Trino, Spark).
• Strong experience with columnar data formats and lakehouse architectures (Apache Iceberg, Parquet, S3-backed data lakes).
• Familiarity with data transformation frameworks such as DBT and pipeline orchestration tools (Airflow, or equivalent).
• Experience with entity resolution, ontology design, or semantic data modeling across structured and semi-structured datasets.
• Demonstrated ability to take ambiguous data problems from scoping through production deployment with minimal oversight.
• Clear and precise technical communication, with a track record of strong documentation and cross-functional collaboration.
Nice to Have
• Hands-on experience with graph databases (Neo4j) and knowledge graph construction, including ontology-driven node/relationship modeling.
• Familiarity with GenAI tooling and LLM-based applications, including RAG architectures, prompt engineering, and tool-use frameworks (e.g., MCP).
• Experience with StarRocks, ClickHouse, or other OLAP engines for high-performance analytical queries.
• Experience building internal developer tools, CLI applications, or data quality monitoring dashboards.
What We Offer
• Fully paid healthcare benefits (Medical, Dental, Vision)
• Remote work options
• Paid time off
• Generous parental and family leave policies
• Strong work culture focused on integrity, transparency and excellence.
Location Carbon Arc is headquartered in New York City and operates as a remote-first company. Team members are expected to travel for in-person onsite gatherings at least one week per quarter to collaborate, plan, and connect as a team.
Disclaimer Note to Recruiters and Placement Agencies:
Carbon Arc does not accept unsolicited resumes from recruiters or placement agencies. Any resume submitted without a prior written agreement will be deemed the property of Carbon Arc, and no fee will be paid in the event of a hire.
Equal Opportunity Employer
Carbon Arc is an Equal Opportunity Employer and is committed to fair and equitable hiring practices. All hiring decisions are based on strategic business needs, job requirements, and individual qualifications. All candidates are considered without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, disability, genetic information, marital status, pregnancy, veteran or military status, citizenship status, or any other characteristic protected by applicable federal, state, or local law.
Accommodations
Carbon Arc is committed to providing reasonable accommodations throughout the hiring process for qualified individuals with disabilities. If you require an accommodation to participate in the application or interview process, please contact us at careers@carbonarc.co.
Apply for this role Submit your application below and we'll be in touch.
First Name *
Last Name *
Email *
Phone Number *
Location *
Work Authorization *
Will you now or in the future require employment visa sponsorship? *
#J-18808-Ljbffr
Sprachkenntnisse
- English
Hinweis für Nutzer
Dieses Stellenangebot wurde von einem unserer Partner veröffentlicht. Sie können das Originalangebot einsehen hier.