Jobbörse
Finde Jobs in deiner Nähe – ob vor Ort, hybrid oder remote.- Ähnliche Jobs zu: PySpark Data Engineer
PySpark Data Engineer
Diverse LynxUnited StatesData EngineerWe are seeking a highly skilled and motivated Data Engineer to play a pivotal role in designing, building, and optimizing our next-generation scalable data pipelines. This position requir
PySpark Data Engineer
Holistic Partners, IncIrvingVisa U.S. Citizens/Green Card/OPT-EAD ONLY due to legal or government contract requirementSummary As a Data Engineer, you will be responsible for designing, developing, and maintaining data solutions
Senior PySpark Data Engineer
COVETUSIrvingAs a Data Engineer, you will be responsible for designing, developing, and maintaining data solutions for data generation, collection, and processing in Big Data environment using predominantly PySpar
Sr. Pyspark Data Engineer
SARIAN CoUnited StatesRole: Sr. PySpark Data Engineer - FulltimeLocation: Irving, TXJob Description:We are seeking a skilledPySpark Data Engineerto join our team. The ideal candidate will have expertise in big data process
Senior PySpark Data Engineer
Tata Consultancy ServicesIrvingJob Description As a Data Engineer, you will be responsible for designing, developing, and maintaining data solutions for data generation, collection, and processing in a Big Data environment using pr
Hadoop & PySpark Data Engineer
InfosysCharlotteInfosys Limited is seeking talented individuals in Charlotte, NC to contribute to financial services transformations using advanced AI technologies. Responsibilities include documenting business requi
Senior PySpark & Big Data Engineer
VirtusaNew YorkVirtusa is seeking a Big Data professional in New York, New York, to design and deploy solutions using technologies like PySpark and Python. The ideal candidate will possess strong analytical and comm
PySpark Data Engineer – Spark, Hadoop & AI Analytics
InfosysIrvingInfosys is looking for a candidate to join their Data and Analytics (DNA) unit, transforming data into actionable insights. You will contribute to software solutions, facilitate discussions, and enhan
Senior PySpark Data Engineer - Cloud ETL & Pipelines
Holistic Partners, IncIrvingHolistic Partners, Inc in Irving, Texas, is seeking a Data Engineer to design, develop, and maintain data solutions in a Big Data environment. The role focuses on using PySpark to create data pipeline
Senior Big Data PySpark Consultant Design & Deliver
VirtusaNew YorkVirtusa is seeking a BigData PySpark Consultant based in New York, NY. In this role, you’ll be responsible for designing, building, and deploying solutions in Bigdata and PySpark. The ideal candidate
Digital - Senior Data Engineer / Data Engineer, Digital Technology
AritziaSeattleResponsibilities Design, develop, and optimize scalable data pipelines (batch and real‑time) that power unified customer data and insights Partner closely with analytics, data science, and business st
Data Engineer
Neurons Lab LTDNew YorkAbout the project (description, duration, stage) Join Neurons Lab as a Data Engineer on a new engagement with a regulated UK & Ireland credit and lending company . The client has lifted data from mult
Data Engineer
Duck River Electric Membership CorporationSumtervilleGeneral Purpose of Job Data Engineer role is responsible for design, build, test, deployment, and support of enterprise data pipelines (CI/CD), lakehouse tables, transformation frameworks, change data
Data Engineer
Tata Consultancy ServicesO'FallonProgramming: Java (Core), Python (for Airflow), Unix Shell Scripting.Big Data/Storage: Apache Spark, MinIO, AWS S3.Key ResponsibilitiesPipeline Orchestration: Design and develop complex, reusable DAGs
Data Engineer
Blue Cross Blue Shield companiesChicagoJob DescriptionDesign, build, and maintain reliable, high-performance data pipelines for large-scale structured and unstructured healthcare data.Use PySpark and modern cloud-based tools (Databricks, A
Data Engineer
Berkshire Hathaway GuardPlanoOverview Good Things Start Here.Good things are happening at Berkshire Hathaway GUARD Insurance Companies—an A+ (Superior) rated, nationwide Property & Casualty insurer backed by Berkshire Hathaway. W
Data Engineer
EquifaxUnited StatesEquifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to h
Data Engineer
Adelante HealthcareUnited States*We do not offer visa sponsorship and are unable to support work authorization or visa transfer of any kind. Applicants MUST be authorized to work in the United States without sponsorship now or in th
Data Engineer
Corning IncorporatedUnited StatesRequisition Number: 74967The company built on breakthroughs. Join us.Corning is one of the world's leading innovators in glass, ceramic, and materials science. From the depths of the ocean to the fart
Data Engineer
Berkshire Hathaway GUARD Insurance CompaniesUnited StatesOverviewGood Things Start Here.Good things are happening at Berkshire Hathaway GUARD Insurance Companies-an A+ (Superior) rated, nationwide Property & Casualty insurer backed by Berkshire Hathaway. Wi
Data Engineer
Steel DynamicsUnited StatesSubsidiarySteel DynamicsOverviewLocation:Position can be executed from any of the following Steel Dynamics offices. This position is an onsite position. Butler, IN Columbus, MS Terre Haute, IN Pittsbu
Data Engineer
ROI Healthcare Solutions, LLCUnited StatesAbout the Role We are a fast-paced, high-growth company with a strong data analytics foundation - and we're looking for a Data Engineer to own, grow, and elevate our capabilities. You'll take the lead
Data Engineer
Intellisoft TechnologiesUnited StatesSenior Data EngineerPosition: Senior Data Engineer Location: Remote Duration: 6+ Months Client: National Grid - Boston Visa Restrictions: None Sub Vending: No Pay Rate: $58/Hr on W2 without benefits B
Data Engineer
Oak St. HealthUnited StatesData EngineerWe're building a world of health around every individual — shaping a more connected, convenient and compassionate health experience. At CVS Health®, you'll be surrounded by passionate col
Data Engineer
6AM CityColumbiaJob Description Integer Technologies LLC Columbia, South Carolina Metropolitan Area (On-site)What we do Integer Technologies is an applied research and product development company founded by scientist
Über
We are seeking a highly skilled and motivated Data Engineer to play a pivotal role in designing, building, and optimizing our next-generation scalable data pipelines. This position requires expertise in processing massive datasets using cutting-edge technologies like Apache Spark, PySpark, and Hive within a dynamic cloud environment. Your primary objective will be to ensure the utmost data reliability, speed, and efficiency, providing a robust foundation for downstream business intelligence and advanced analytics initiatives. Roles & Responsibilities:
Data Pipeline Development & Maintenance: Design, build, and maintain highly scalable and efficient ETL/ELT data pipelines utilizing PySpark and Spark SQL for complex data transformations. Cloud Data Infrastructure Management: Deploy, manage, and scale critical data infrastructure components on leading cloud platforms such as Amazon Web Services (AWS) (e.g., EMR, Glue), Microsoft Azure (e.g., Databricks, Synapse), or Google Cloud Platform (GCP). Data Warehousing & Storage Optimization: Strategically manage data layout, partitioning, and indexing within Apache Hive and various cloud data lake solutions to optimize performance and accessibility. Performance Tuning & Optimization: Proactively identify and resolve performance bottlenecks in Spark jobs, leveraging Spark UI for in-depth analysis, effectively managing data skewness, and optimizing memory utilization. Diverse Data Integration: Develop robust solutions for ingesting high-volume and diverse datasets from both structured relational databases and unstructured flat files into our data ecosystem. Automated Workflow Orchestration: Implement and manage automated data workflows using industry-standard scheduling tools like Apache Airflow or platform-native schedulers, ensuring timely and reliable data delivery. Strategic Collaboration: Partner closely with data scientists, business analysts, and cross-functional enterprise teams to translate complex business requirements into technically sound and efficient data solutions. Qualifications:
Big Data Frameworks Expertise: Demonstrated high proficiency in Apache Spark architecture, including a deep understanding of drivers, executors, and Directed Acyclic Graphs (DAGs). Advanced Programming: Exceptional coding skills in Python and extensive experience with the PySpark API for developing intricate data transformations and processing logic. Querying & Schema Management: Strong command of HiveQL and ANSI SQL, coupled with expertise in data partitioning techniques and effective schema definition. Optimized Storage Formats: In-depth understanding and practical experience with optimized big data storage file formats such as Parquet, ORC, and Avro. Cloud Ecosystem Development: Hands-on development experience utilizing cloud-native big data utilities (e.g., AWS EMR, Azure Databricks) with in major cloud platforms. Data Warehousing Fundamentals: Solid foundation in Dimensional Data Modeling, including Star and Snowflake schemas, and practical experience with Data Lakes concepts and implementation. Preferred Qualifications CI/CD & DevOps Automation: Experience with Continuous Integration/Continuous Deployment (CI/CD) practices and automation tools like Git, Jenkins, or Ansible. NoSQL Database Integration: Exposure to and experience with NoSQL databases such as HBase, Cassandra, or MongoDB. Professional Cloud Certifications: Relevant professional cloud certifications (e.g., AWS Certified Data Engineer, Microsoft Certified: Azure Data Engineer Associate) are highly valued Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.
Sprachkenntnisse
- English
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.