- +3
- +14
- Texas, United States
Über
Experience in automating ETL processes/pipelines and AWS data infrastructure using Python.
Proficient with AWS components like S3, Athena, EMR, Glue, Redshift, Kinesis, and SageMaker.
Expertise in SQL, Unix/Linux scripting, and developing/testing on Cloud and On-Prem ETL technologies (Ab Initio, AWS Glue, Informatica, Alteryx Experience in data migration from on-premise to cloud is a plus.
Extensive experience in DevOps/DataOps environments.
Familiarity with SageMaker, Machine Learning Studio, and H2O is an added advantage.
Well-versed in the strategies for Cloud/On-Prem ETL testing.
Skilled in interpreting and analyzing data from multiple source systems for data integration and reporting.
Knowledgeable in data modeling and data warehousing concepts with a focus on Cloud/On-Prem ETL environments.
Executes data analytics and data integration testing within time and budget constraints.
Nice to have:
Experience using Jenkins and GitLab.
Familiar with both Waterfall and Agile methodologies.
Experience in testing storage tools like S3 and HDFS.
Wünschenswerte Fähigkeiten
- Python
- AWS
- SQL
- Unix
- Alteryx
- DevOps
- H2O
- Data Integration
- Data Modeling
- Data Warehousing
- Data Analytics
- Jenkins
- Gitlab
- HDFS
Berufserfahrung
- Data Engineer
- Data Infrastructure
- DevOps
Sprachkenntnisse
- English