Über
Cloud Infrastructure & Data Warehousing (8+ yrs overall, 4+ yrs in AWS). Proficiency building and optimizing data pipelines using AWS services such as S3, Redshift, Glue, IAM, Kinesis, and EMR. Experience across GCP (BigQuery, Dataflow) and Azure (Synapse, Data Factory). Optimizing data warehouses (Redshift, Snowflake, BigQuery) and managing Data Lakes (S3, Delta Lake) for scalable, low-latency analytics. Ensuring cost efficiency, scalability, and compliance (CPRA, HIPAA) while supporting a migration toward Flink-based near real-time architecture.
Data Quality & Governance (8+ Years). Experience implementing scalable data validation, quality checks (e.g., deduplication, consistency), and error-handling mechanisms tailored for operational reporting pipelines, ensuring high-fidelity data for real-time dashboards and analytics. Proficiency in designing and enforcing data governance practices, including metadata management, lineage tracking for auditable reporting, and compliance with regulations like CPRA or HIPAA in Data Lake environments (e.g., AWS S3, Delta Lake).
Performance Optimization (3+ Years). Experience optimizing data pipelines, queries, and large-scale datasets for efficiency and scalability in operational reporting systems, with a focus on achieving low-latency delivery. Proficiency in tuning high-throughput streaming systems, including optimizing resource usage and implementing best practices for partitioning, caching, and indexing.
Security & Compliance (3+ Years). Experience implementing data security measures, including encryption, role-based access control (RBAC), and data masking, to protect sensitive data in operational reporting pipelines and Data Lakes (e.g., AWS S3, Delta Lake). Strong understanding of compliance standards such as HIPAA and CPRA, with hands-on expertise in applying these standards to streaming systems like Apache Kafka and Apache Flink. Demonstrated ability to ensure auditability and security in data workflows, supporting reliable and compliant near real-time analytics during the transition from micro-batching to a Flink-based architecture.
Collaboration & Communication (5+ Years). Strong ability to work cross-functionally with business analysts, product managers, leadership, and other stakeholders to define and deliver operational reporting requirements. Exceptional communication skills to translate complex technical concepts into clear, actionable insights for non-technical audiences. Proven adaptability to thrive in a fast-paced startup environment, collaborating effectively to support the rapid development and evolution of a near real-time data platform while aligning with Rulas mission to improve mental health care outcomes.
Preferred Qualifications While having the preferred qualifications enhances your candidacy, having all of them is not mandatory. We encourage all interested applicants to apply, even those who may not meet every preferred requirement. Hands-on experience with AWS tools like S3, Glue, EMR, SageMaker, and Lambda for building scalable ETL/ELT pipelines optimized for ML/LLM training, including feature engineering, data versioning, and handling large-scale unstructured data
Demonstrated ability to maintain data integrity and accuracy in streaming systems like Apache Kafka and Apache Flink, supporting reliable operational insights during the transition from micro-batching to a near real-time architecture.
Familiarity with infrastructure as code (IaC) tools like Terraform or CloudFormation for managing cloud resources.
Experience implementing and maintaining CI/CD pipelines for data workflows.
Demonstrated ability to enhance pipeline performance to support near real-time analytics while maintaining cost efficiency and reliability during the transition from micro-batching to a streaming architecture.
Strong ability to partner with data scientists and ML engineers to design efficient pipelines, using orchestration tools (e.g., Airflow, Dagster) for incremental loading and cost optimization, while monitoring performance metrics like latency and resource utilization in AWS environments.
We're serious about your well-being! As part of our team, full-time employees receive: 100% remote work environment (US-based only):
Working hours to support a healthy work-life balance, ensuring you can meet both professional and personal commitments
Attractive pay and benefits : Full transparency of pay ranges regardless of where you live in the United States
Comprehensive health benefits : Medical, dental, vision, life, disability, and FSA/HSA
401(k) plan access : Start saving for your future
Generous time-off policies : Including 2 company-wide shutdown weeks each year for self-care (for most employees)
Paid parental leave : Available for all parents, including birthing, non-birthing, adopting, and fostering
Employee Assistance Program (EAP) : Support for your mental and physical health
New hire home office stipend : Set up your workspace for success
Quarterly department stipend : Fund team-building activities or in-person gatherings
Wellness events and lunch & learns : Explore a variety of engaging topics
Community and employee resource groups : Participate in groups that celebrate employee identity and lived experiences, fostering a sense of community and belonging for all
Our team We believe that diversity, equity, and inclusion are fundamental to our mission of making mental healthcare work for everyone. We are dedicated to having a culture of inclusion that will support our employees in feeling safe, seen, heard, and valued.
Sprachkenntnisse
- English
Hinweis für Nutzer
Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klicken Sie auf „Jetzt Bewerben“, um Ihre Bewerbung direkt auf deren Website einzureichen.