Senior Data Engineer / Data Curator
TSMC
- Phoenix, Arizona, United States
- Phoenix, Arizona, United States
About
Design and implement data pipelines for processing, cleaning, and curating large datasets used in model training and fine-tuning. Automate data cleaning processes (e.g., removing noise, duplicates, irrelevant content) and ensure datasets are appropriately labeled and structured. Collaborate with model teams to ensure data aligns with model requirements and performance goals. Assess and mitigate bias in datasets, ensuring that models are trained on diverse and representative data. Manage data storage and retrieval strategies, ensuring scalability and data consistency across different environments. Conduct regular audits to ensure data integrity, privacy, and security compliance.
Minimum Qualifications/Requirements Education: Minimum degree required: Bachelor's degree in Computer Science, Data Science, or a related field. Technical Skills: 5+ years of experience in data engineering, data wrangling, or data curation, particularly in machine learning or AI-driven environments. Strong proficiency in Python (Pandas, NumPy) and SQL for data manipulation and querying. Familiarity with cloud-based data storage (AWS S3, Google Cloud Storage, etc.) and distributed systems for managing large datasets. Experience with data annotation tools and platforms for manual or semi-automated labeling. Experience managing data pipelines with tools like Apache Kafka, Apache Airflow, or similar ETL tools. Strong knowledge of AI ethics, data privacy, and compliance standards (GDPR, CCPA, etc.). Bonus: Experience with vector databases and indexing for LLMs (e.g., FAISS, Pinecone). Interpersonal Skills: Communication Computer proficiency Presentation skills Listening Teamwork Candidates must be willing and able to work on-site at our Phoenix Arizona facility. TSMC is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, disability, veteran status, or any other protected characteristic. We encourage all qualified individuals to apply, and we welcome applications from individuals with diverse backgrounds and experiences. Candidates must be able to perform the essential functions of the job with or without a reasonable accommodation.
Languages
- English
Notice for Users
This job comes from a TieTalent partner platform. Click "Apply Now" to submit your application directly on their site.