Über
Responsibilities Build reliable data pipelines to clean, aggregate, and transform large volumes of data from multiple sources. Develop versatile software components to extract useful information from various unstructured or semi-structured text data. Implement advanced search functionalities and improve the efficiency of search indexing. Work closely with data scientists to develop, test and iterate data models and algorithms. Contribute to company-wide data privacy compliance efforts. Required Minimum 6 years of experience in the field Extensive experience in building large scale data pipelines with mainstream big data stack. Strong expertise in extracting useful information from unstructured and semi-structured text data. Strong software development skills and highly proficient with Java is a plus. Professional working experience with Elasticsearch, Apache Beam, Spark, and GCP Dataflow a big plus. Strong expertise in NLP or Text Mining is also a big plus.
Sprachkenntnisse
- English
Hinweis für Nutzer
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.