XX
Data Analyst - Python and SparkCitigroup Inc.United States
XX

Data Analyst - Python and Spark

Citigroup Inc.
  • US
    United States
  • US
    United States

À propos

Job Summary
We are seeking a highly motivated and skilled Data Analyst with expertise in Python and Spark to join our team. The ideal candidate will be responsible for collecting, processing, and performing statistical analyses on large datasets to provide actionable insights. This role involves working closely with various departments to understand business requirements, develop data solutions, and present findings to stakeholders. Responsibilities
Data Collection & Processing: Extract, transform, and load (ETL) data from various sources using Python, SQL, and Spark. Clean, validate, and organize raw data to ensure accuracy and completeness. Develop and maintain robust data pipelines for efficient data ingestion and processing. Data Analysis & Modeling: Perform exploratory data analysis (EDA) to identify trends, patterns, and anomalies. Apply statistical methods and machine learning techniques to develop predictive models and insights. Utilize Spark for large-scale data processing and complex analytical tasks. Reporting & Visualization: Create clear, concise, and compelling reports, dashboards, and visualizations using tools like Tableau, Power BI, or Matplotlib/Seaborn in Python. Communicate complex analytical findings to technical and non-technical stakeholders effectively. Collaboration & Strategy: Work cross-functionally with data engineers, product managers, and business stakeholders to define key performance indicators (KPIs) and analytical requirements. Contribute to the strategic development of data-driven decision-making processes within the organization. Identify opportunities for process improvement and data optimization. Documentation & Maintenance: Document data sources, methodologies, and analysis processes. Monitor data quality and integrity, troubleshooting issues as they arise. Qualifications
Education:
Bachelor\'s or Master\'s degree in Computer Science, Statistics, Mathematics, Economics, or a related quantitative field Experience: Proven experience of 3+ years as a Data Analyst or in a similar role. Strong proficiency in Python for data analysis (Pandas, NumPy, Scikit-learn). Demonstrable experience with Apache Spark for big data processing and analysis. Solid understanding of SQL for data extraction and manipulation. Technical Skills: Expertise in data manipulation, statistical analysis, and data visualization. Experience with cloud platforms (AWS, Azure, GCP) and big data technologies is a plus. Familiarity with version control systems (e.g., Git). Soft Skills:
Excellent problem-solving abilities and analytical thinking. Strong communication and presentation skills, with the ability to convey complex information clearly. Ability to work independently and collaboratively in a fast-paced environment.
Preferred Skills
Experience with data warehousing concepts and tools. Knowledge of machine learning algorithms and their practical application. Familiarity with BI tools such as Tableau, Power BI, or Looker. Experience with other programming languages (e.g., R, Scala). Understanding of data governance and data security best practices. Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
#J-18808-Ljbffr
  • United States

Compétences linguistiques

  • English
Avis aux utilisateurs

Cette offre provient d’une plateforme partenaire de TieTalent. Cliquez sur « Postuler maintenant » pour soumettre votre candidature directement sur leur site.