Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: Data Engineer
Data Engineer
Code4libOklahoma CityStart date: Available starting mid-May 2026 End date: 3 months after first date of work.HBS’s Baker Library is seeking a temporary Data Engineer to help launch a faculty citation data project aimed at
Data Engineer
Redhorse InternationalLondon*Data Engineer *Bringing to the UK unique graph data expertise, Redhorse International is preparing to set up a Data Practice, specialising in graph technologies. Focusing initially on a particular pi
Data Engineer
VORKISNew BraunfelsJob Summary We are an innovative company looking for a skilled Middle Data Engineer to join our dynamic team. Our collaborative culture fosters creativity and leverages the latest technologies. As a k
Data Engineer
CVS HealthWellesleyWe’re building a world of health around every individual — shaping a more connected, convenient and compassionate health experience. At CVS Health®, you’ll be surrounded by passionate colleagues who c
Data Engineer
ZoetisParsippanyThe candidate will play a critical role in the development of an advanced‑analytics enabled enhancement of Zoetis' go‑to‑market sales strategy. The candidate will be responsible for shaping the core d
Data Engineer
Verdesian Life Sciences LLCCaryJob Category:Information Systems-TechnologyRequisition Number:DATAE001777Locations Showing 1 locationResponsibilitiesDesign, build, and maintain SQL-based data pipelines to support reliable data extra
Data Engineer
ROLEChicagoCareer Area: Technology, Digital and DataJob Description: Your Work Shapes the World at Caterpillar Inc.When you join Caterpillar, you'rejoining a global team who cares not just about the work we do –
Data Engineer
San Diego Community Power (CA)San DiegoAbout the Role The San Diego Community Power (SDCP) is seeking a seasoned Data Engineer to join our growing team of analytics experts who will be responsible for designing, maintaining, expanding, and
Data Engineer
NewGen Technologies (Maryland)HerndonThe Data Engineer will lead the data architecture and engineering delivery that enables AI/ML/GenAI solutions, ensuring data is trusted, secure, observable, and scalable from ingestion through consump
Data Engineer
Verdesian Life SciencesCaryJob Category: Information Systems-TechnologyRequisition Number: DATAE001777Locations Showing 1 locationResponsibilitiesDesign, build, and maintain SQL-based data pipelines to support reliable data ext
Data Engineer
General MotorsWarren## Data EngineerApplyremote type:Hybridlocations:Warren, Michigan, United States of America:Austin, Texas, United States of America:Mountain View, California, United States of Americatime type:Full ti
Data Engineer
ASB ResourcesCharlotteRole Responsibilities:Design, code, test, debug, and document technology solutions for complex projects and programs.Review and analyze large-scale technology solutions to align with tactical and stra
Data Engineer
Fervo EnergyHoustonFervo is building the most cost-effective, repeatable geothermal power plants in the world. Scaling that mission depends on a trustworthy, well-governed data foundation that turns raw sensor signals,
Data Engineer
K&L Gates LLPCharlotteJob Summary We are seeking an experienced Data Engineer with a strong focus on Microsoft Fabric to design, implement, and maintain enterprise data solutions. The ideal candidate will have a minimum of
Data Engineer
ChatGPT JobsLos AngelesOverview Data Engineer at Regard. Location: San Francisco, CA (Hybrid) or Remote (within NYC, LA, or SF metro areas).About The Role As a Data Engineer at Regard, you will build and maintain data pipel
Data Engineer
3 Oaks GamingPolandWe're looking for a Data Engineer to support the Commercial Department and analyze financial data from CRM , Back Office , and Jira . In this role, you'll be responsible for running monthly invoicing
Data Engineer
CreateFutureManchesterWorking at CreateFutureCreateFuture is an AI-native consulting partner where people do work that matters and are supported to do it well. We work alongside organisations such as PayPal, adidas, NatWes
Data Engineer
Absolute Business Solutions Corp.VirginiaCareer Opportunities with Absolute Business Solutions CorpA great place to work.Careers At Absolute Business Solutions CorpCurrent job opportunities are posted here as they become available.ABSC is se
Data Engineer
Unchain DataDenverAbout Kharon Kharon is a highly disruptive and incredibly innovative organization that navigates risk at the intersection of global security threats and international commerce. Operating at the nexus
Data Engineer
Inside Higher EdAlbanyThe Data Engineer is an integral part of the Information Technology Services department (ITS), reporting to the Director of Business Intelligence (BI). The role will be involved in efforts to ensure t
Data Engineer
ActivelyNew YorkAbout Actively AI Actively AI is defining a new category: Intelligence‑Led Revenue. Revenue organizations have always been bottlenecked on human capacity. Reps triage which accounts get attention. Con
Data Engineer
FCP EuroMilfordFCP Euro is seeking a highly skilled Data Engineer with hands‑on experience across the modern data stack to design, build, and maintain our data infrastructure. In this role, you will manage end‑to‑en
Data Engineer
Colgate-PalmolivePiscatawayRequisition ID 174077 -Posted - Information Technology - United States - New Jersey - Piscataway - Colgate-Palmolive - Travel - up to 10% of time - HybridNo Relocation Assistance OfferedJob Number#174
Data Engineer
OFG BancorpCharlotteOFG Bancorp is looking for an experienced Mid‑Level Data Engineer to design, build, and operate scalable data pipelines and data platforms supporting banking and financial services data domains, inclu
Data Engineer
NorthstratWausauNorthstratis seeking aDataEngineer to join the agile development team. The team builds andmaintains ETL pipelines that enable full spectrum data operation from ingest to query exceeding current and ex
Data Engineer
- Oklahoma City, Oklahoma, United States
- Oklahoma City, Oklahoma, United States
Über
HBS’s Baker Library is seeking a temporary Data Engineer to help launch a faculty citation data project aimed at better understanding how its collections support and influence scholarly research. This initiative involves identifying faculty publications, extracting their cited references, and analyzing the relationships within this data to generate meaningful insights into patterns of use and library collection impact. By analyzing citations, the project seeks to surface evidence of how Baker’s resources contribute to the research ecosystem at HBS.
Reporting to Baker Library’s User Needs and Assessment Librarian, this temporary Data Engineer role will focus on the final phase of the project, where a corpus of raw citation data has already been collected and aggregated from multiple sources. At this stage, the data requires careful cleaning, normalization, and transformation to ensure it is accurate, consistent, and suitable for analysis. The individual in this role will work with this messy dataset to standardize fields, resolve inconsistencies, and prepare the data for downstream analytical work. This phase is critical to ensuring the reliability and interpretability of the project’s findings and will directly shape the quality of insights generated about Baker’s impact.
This is a temporary, full-time, remote position. Employees in fully remote positions must work all scheduled hours in a Harvard registered state in compliance with the University’s Policy on Employment Outside of Massachusetts . Specific hours and work days will be determined by business needs and are subject to change with appropriate advanced notice.
Responsibilities
Clean and normalize raw citation data by resolving inconsistencies in author names, publication titles, journal names, and other variables
Co‑develop and apply standardized schemas for field names and data structures to ensure consistency across the dataset
Design and implement reproducible data cleaning workflows using scripts that can be reused
Co‑create or locate unique identifiers (e.g., for authors, works, journals) to enable accurate linking and deduplication across records
Perform record linkage and deduplication using techniques such as fuzzy matching and string comparison
Assess and improve data quality by identifying missing, inconsistent, or anomalous values and determining appropriate remediation strategies
Conduct exploratory analysis to evaluate the completeness and reliability of the dataset, including identifying patterns of data gaps
Collaborate with project stakeholders to align data cleaning decisions with project goals
Explore connection points for citation data with other HBS administrative datasets
Document data transformations, data dictionaries, and workflows to support transparency, reproducibility, and future project phases
Qualifications
Experience working with messy, real‑world datasets
Advanced proficiency in R (preferred), using libraries such as dplyr, tidyr, and tidyverse, or Python, using libraries such as pandas
Familiarity with regular expressions (regex), string comparison, and fuzzy matching
Proficient understanding of standardization principles and controlled vocabularies
Ability to balance precision and pragmatism when making decisions in the absence of perfect information
Comfort documenting processes and decisions for both technical and non‑technical audiences
Ability to work independently while also seeking input when project ambiguity or edge cases arise
Ability to envision how data cleaning and manipulation serve larger project goals
Basic understanding of academic publishing and citation formats
Proficiency in Microsoft Office tools (Outlook email, Teams sites, folder management, file retrieval)
#J-18808-Ljbffr
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.