The Role:
The Staff Data Engineer will lead the design, architecture, implementation, and delivery of data infrastructure supporting AI-powered solutions and insights for researchers, patients, and stakeholders. You'll work closely with clinicians, data scientists, engineers, and business teams to build scalable data pipelines, optimize storage and retrieval, and ensure data quality and accessibility.
Passionate about data-driven decisions, you thrive in fast-paced environments and influence key technical choices. You are open to adapting strong opinions and focused on delivering solutions quickly. We're looking for a dynamic player-coach who can excel as a senior contributor and transition into a leadership role as the data team grows.
This is a full-time, hybrid role based out of San Mateo, CA and requires 2-3 days in office.
Responsibilities:
- Design and implement scalable and reliable data infrastructure for clinical, business, and operational needs, including real-time/batch pipelines and warehousing solutions
- Optimize data systems for high-performance querying and analysis and develop data infrastructure supporting AI solutions in clinical settings
- Stay current with emerging data technologies and methodologies, and apply them strategically to ensure data quality, integrity, and accessibility
- Build and mentor a high-performing data engineering team, fostering innovation and continuous learning
- Champion a data-centric approach throughout the organization, aligning data strategies with business objectives
Must-Have Skills:
- 7+ years of experience as a data engineer with a proven track record of designing, implementing, and managing large-scale data systems.
- Deep expertise in data modeling, data architecture and modern data warehousing solutions (e.g. Clickhouse, Snowflake, Redshift, or BigQuery)
- Extensive experience with data pipeline tools and frameworks such as Apache Airflow, Spark, Kafka, and ETL technologies
- Strong proficiency in SQL and expertise in programming languages such as Python, R
Preferred Skills:
- Proven ability to optimize data systems for performance and cost-efficiency, particularly for large-scale clinical and business datasets
- Strong skills in data visualization tools and BI platforms (e.g., Superset, Tableau, Power BI)
- Hands-on experience with cloud platforms (e.g., AWS, Google Cloud, Azure) and their associated data services.
- Experience with CI/CD practices for data pipelines and infrastructure-as-code
- Excellent problem-solving skills and attention to detail
- Strong communication skills
- Proven experience in interviewing engineering candidates, mentoring team members, and fostering professional growth within a technology team
Our Preferred candidate will also have:
- Experience with real-time and batch processing of large scale data
- Experience with machine learning data pipelines and integration
- Knowledge of data security and privacy regulations and best practices a plus
- Passion for improving data systems and solving complex data challenges, particularly within the healthcare sector
At Citizen Health, we are committed to fostering a diverse and inclusive workplace. We are an equal opportunity employer and welcome candidates from all backgrounds to apply. We particularly encourage applications from individuals with personal or family experiences with rare diseases, as your insights could be invaluable to our mission.
California Base Pay Range: $200,000 - $230,000
This salary range is an estimate, and the actual salary may vary based on a wide range of factors, including your skills, qualifications, experience and location. This position is eligible for stock options and other benefits including but not limited to Medical, Dental, Vision insurance. We also offer unlimited PTO and a paid new parent leave program so you can spend time with your new child.