Senior Data Engineer
Remote (United States)
Compensation
Estimated hourly pay: $58.85 - $93.99 per hour based on 2,080 hours per year.
Additional compensation: Equity and benefits.
About the Role
This opportunity is for a Senior Data Engineer to help build a modern data platform that supports personalized, ethical, secure, and scalable healthcare technology. The role focuses on designing and implementing robust data pipelines, partnering with analytics, data science, machine learning, and engineering teams, and creating a compliant, privacy-focused data foundation.
This position supports high-impact work across data infrastructure, reporting, experimentation, governance, and member-focused product experiences.
Employment Type
Full-Time
Work Eligibility
For candidates whose primary residence is in the greater San Francisco area, this role follows a hybrid model with 3 days per week in the office and remote flexibility for the remaining workdays.
What You’ll Do
- Contribute to data platform architecture and implement robust data pipelines.
- Ingest, aggregate, and index diverse data sources into the organization’s data lake.
- Help create a secure, compliant, and privacy-focused data warehousing solution designed for healthcare requirements.
- Partner with data analytics teams to deliver a data platform that supports accurate and actionable reporting on key business metrics.
- Collaborate with data science and machine learning teams to build tools and capabilities that support rapid experimentation and innovation.
- Support peers and promote a culture that treats data as a strategic asset across the organization.
Qualifications
- 4+ years of proven experience designing and implementing large-scale data systems.
- Strong understanding of core Python and Python tooling, including pip and poetry.
- Experience with popular Python frameworks such as pytest or pydantic.
- Strong PySpark experience, including the DataFrame API.
- Understanding of PySpark internal architecture and optimization techniques.
- Experience with Databricks is preferred.
- Experience with Redshift and/or Snowflake is also relevant.
- Demonstrated expertise in architectural patterns for high-volume ETL pipelines.
- Experience with data modeling, Medallion architecture, pipeline design, metrics calculation, and technical documentation.
- Excellent verbal and written communication skills for effective cross-functional collaboration.
- Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
Preferred Qualifications
- Experience with deployment and infrastructure management using Terraform.
- Experience implementing governance, privacy, and security frameworks across a data platform.
- Familiarity with PII/PHI data, HIPAA, and biometrics data.
- Experience migrating to or working with dbt for transformations, documentation, and lineage tracking.
- Experience developing AI-ready architecture, such as semantic layers that standardize business logic for Agentic AI enablement.
Benefits
- Stock awards.
- Comprehensive healthcare coverage.
- Monthly wellness stipend.
- Retirement savings match.
- Lifetime membership to a wellness platform.
- Generous parental leave.
- Competitive total rewards package.
Looking for more opportunities?
View All Jobs