About
Build reliable data pipelines to clean, aggregate, and transform large volumes of data from multiple sources. Develop versatile software components to extract useful information from various unstructured or semi-structured text data. Implement advanced search functionalities and improve the efficiency of search indexing. Work closely with data scientists to develop, test and iterate data models and algorithms. Contribute to company-wide data privacy compliance efforts. Minimum 6 years of experience in the field. Extensive experience in building large scale data pipelines with mainstream big data stack. Strong expertise in extracting useful information from unstructured and semi-structured text data. Strong software development skills and highly proficient with Java is a plus. Professional working experience with Elasticsearch, Apache Beam, Spark, and GCP Dataflow a big plus. Strong expertise in NLP or Text Mining is also a big plus.
Languages
- English
Notice for Users
This job comes from a TieTalent partner platform. Click "Apply Now" to submit your application directly on their site.