XX
Data Engineer IIThorlabsUnited States
XX

Data Engineer II

Thorlabs
  • US
    United States
  • US
    United States

À propos

At Thorlabs, we design and manufacture components, instruments, and systems that transform the world by identifying, enabling, and accelerating key photonics (i.e., light-based) technologies. Backed by a dedicated workforce of more than 3,000 employees worldwide, Thorlabs contributes to cutting-edge research and real-world innovation. Whether you're early in your career or bringing years of experience, you’ll find opportunities to grow, take ownership, and make meaningful contributions from day one. We know every employee brings unique talents and perspectives that fuel our success, and we seek driven individuals who are excited to make an impact in a fun, fast-moving culture. The Data Engineer builds, maintains, and improves reliable data pipelines and models that power analytics, reporting, and data-driven decision-making at Thorlabs. This position develops ETL/ELT processes, validates and optimizes SQL and compute logic, documents lineage and definitions, and collaborates with Data Analysts, Data Stewards, and business partners to deliver trusted and performant datasets. The role requires foundational understanding of Python/PySpark and basic AI/ML enablement (e.g., simple notebooks, scikit-learn examples, Azure ML notebooks) to support data preparation and experimentation. Although the location of the position is in Newton, NJ, from time to time it may be required to undertake duties at other Thorlabs locations. Essential Job Functions include the following , but are not limited to: Develop and maintain ELT/ETL pipelines (Synapse Pipelines, Data Factory, SQL, notebooks) moving data from D365 into ADLS and Synapse; schedule, monitor, and troubleshoot jobs for reliability. Write efficient SQL (joins, views, CTEs, window functions) and basic PySpark/Python to transform and model data for analytics and certified datasets. Implement and monitor data quality checks; work with Data Stewards to triage issues and contribute to rootcause analysis and remediation. Apply foundational query and file-format optimizations (e.g., partitioning) under guidance; escalate platformlevel tuning or performance concerns to the Lead. Maintain metadata, lineage, and clear documentation for tables, transformations, business rules, and dataset logic. Use Git and Azure DevOps for version control, code reviews, task management, and release coordination. Support analysts and report authors during user acceptance testing; address defects and iterate improvements. Contribute to basic AI/ML enablement by preparing features, managing datasets, running training or inference notebooks, and assisting with experiment tracking under guidance.
In addition to the essential functions and duties listed above, all positions are also responsible for: Meeting company standards pertaining to quantity and quality of work performed on an ongoing basis, performing all work related tasks in a manner that is in compliance with all Company policies and procedures. Adhering to Company policies, procedures, and directives regarding standards of workplace behavior in completing job duties and assignments. The Company retains the right to change or assign other duties to this position. Physical Activities:
This is largely a sedentary role; however, some filing is required. This may include lifting files, opening cabinets, bending, or standing as necessar Requirements Experience: 5+ years of experience in data engineering, analytics engineering, or related internships/co-ops. Foundational SQL skills (joins, aggregations, CTEs; capable of reading and modifying queries). Exposure to Azure data services such as Synapse (serverless/dedicated), ADLS, and Data Factory/Synapse Pipelines. Basic experience with Python or PySpark for data preparation or orchestration. Familiarity with BI/reporting datasets (Power BI preferred) and data lineage concepts. • Experience using Git/Azure DevOps for work items, code reviews, and basic CI/CD processes. Strong written and verbal communication skills; organized, detail-oriented, and comfortable working with deadlines. Education: Bachelor’s degree in Computer Science, Information Systems, Data/Computer Engineering, Analytics, or a related field. Relevant Microsoft or Analytics/Engineering certifications are a plus. Specialized Knowledge and Skills: Understanding of ELT/ETL concepts, data modeling for analytics (star/snowflake), and performance fundamentals (partitioning, file formats). Proficiency in SQL with the ability to write maintainable transformation code and validate results. Basic Python or PySpark capabilities; comfortable working in notebooks for data prep and simple orchestration tasks. Familiarity with Azure Synapse, ADLS, and orchestration tools; understanding of Azure DevOps and Git workflows. Awareness of AI/ML concepts: ability to run or modify simple notebooks, understand basic model types (classification/regression), and support pipelines used for experimentation. Strong documentation habits and collaboration skills; proficiency in Microsoft Office (Outlook, Excel, PowerPoint, Word). Pay range for this position is $ 85,000-$106,000 depending on experience Thorlabs values its diverse environment and is proud to be an Equal Employment Opportunity/Affirmative Action Employer. All qualified individuals will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age or veteran status. Job descriptions are not intended as and do not create employment contracts. The organization maintains its status as an at-will employer. Employees can be terminated for any reason not prohibited by law. Benefits Thorlabs offers a complete benefits package that includes medical, dental and vision insurance, company paid life insurance, a generous PTO package, a 401(k) plan, and tuition reimbursement just to name a few..
  • United States

Compétences linguistiques

  • English
Avis aux utilisateurs

Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.