logo inner

Vice President, Data Warehouse Engineering

HealthVerityPhiladelphia, Pennsylvania, United StatesRemote, Onsite
This job is no longer open

How you will help


The Vice President, Data Warehouse Engineering will drive the design, scalability, and optimization of HealthVerity's healthcare-focused data warehouse ecosystem—the most extensive and diverse in the United States. This role is ideal for a strategic, technical leader with deep expertise in healthcare data architecture, including the design of secure, scalable infrastructures capable of integrating and normalizing clinical, transactional, and operational data from thousands of healthcare providers and data partners.

You will design and manage innovative, standards-compliant architectures that support complex, large-scale analytics and product development, shaping the future of healthcare data solutions.

What you will do


  • Architect Scalable Data Solutions: Design and implement the enterprise-wide healthcare data warehouse architecture, integrating complex data sources such as EHR systems, claims data, lab results, and patient registries into a unified data platform.
  • Healthcare-Specific Data Modeling: Build and optimize data models to support healthcare-specific use cases, including longitudinal patient analysis, population health, and clinical research. Leverage best practices in dimensional modeling, data vault modeling, and OLAP architectures.
  • Standards-Based Integration: Ensure data interoperability by integrating data using standards like ANSI 837/835, HL7, FHIR, and ontologies like ICD-10, CPT, NDC, and LOINC. Architect solutions that facilitate smooth data exchange with external partners while ensuring data lineage and traceability.
  • Data Governance and Quality: Establish robust data governance frameworks tailored to healthcare compliance requirements (HIPAA, HITECH). Implement tools and processes to monitor, cleanse, and validate incoming data streams, ensuring accuracy and consistency across all datasets.
  • Data Pipeline Optimization: Architect high-performance ETL/ELT pipelines that efficiently ingest, transform, and load multi-terabyte datasets in near real time. Leverage Databricks DLT and workflows, Apache Airflow, dbt, or equivalent tools to automate data workflows while supporting both batch and streaming data ingestion.
  • Cloud-Native Architecture: Design cloud-native solutions using AWS services like Redshift, S3, Glue, and Lambda and Databricks services like Unity catalog, Autoloader, and Delta Live Tables to manage high-throughput data ingestion and processing at scale. Explore hybrid models when on-premise integration is needed for sensitive healthcare data.
  • Data Security & Privacy: Lead the implementation of security frameworks ensuring data confidentiality, integrity, and availability. Incorporate advanced security measures such as encryption (in transit and at rest), access controls, and secure data-sharing mechanisms.
  • Master Data Management (MDM): Design and maintain master reference data systems, establishing consistency across providers, patients, medications, diagnoses, and procedures. Implement entity resolution algorithms to deduplicate and merge patient records.
  • Advanced Query Performance: Optimize query execution for analytical workloads on large datasets using indexing, partitioning, and materialized views. Architect efficient reporting layers that provide sub-second query responses for mission-critical applications.
  • Real-Time Data Integration: Design architectures to support real-time or near real-time data integration and streaming use cases using technologies like Apache Kafka, AWS Kinesis, or Google Pub/Sub.
  • Cross-Functional Collaboration: Collaborate with data scientists, analysts, and product teams to design data marts and semantic layers that meet the needs of both operational and analytical workloads.
  • Scalable Metadata Management: Implement metadata management and data cataloging tools (e.g., Apache Iceberg, Delta Tables, and Unity Catalog) to enhance data discovery, lineage tracking, and impact analysis.

Required skills and experience


  • Extensive Data Architecture Expertise: 10+ years of hands-on experience designing, developing, and scaling large healthcare data warehouse ecosystems with multi-petabyte architectures.
  • Healthcare Data Knowledge: Deep familiarity with healthcare-specific data sources (e.g., EHRs, claims, labs), and experience designing architectures compliant with healthcare regulations such as HIPAA and HITECH.
  • Cloud-Native & Hybrid Solutions: Proven experience implementing cloud-based data platforms using AWS, Azure, or GCP. Experience with hybrid models that integrate on-premise data centers is a plus.
  • Advanced Data Modeling: Strong knowledge of OLTP, OLAP, star schema, and snowflake schema designs, with an emphasis on healthcare data models. Experience with dimensional modeling and data vault approaches.
  • ETL/ELT Pipeline Design: Expertise in building scalable and automated ETL/ELT pipelines using tools like Apache NiFi, Airflow, dbt, or Informatica to manage diverse, high-volume data sources. Experience migrating from EMR and Hive to Databricks or Snowflake is desirable.
  • Interoperability Standards: Practical experience working with healthcare messaging and data exchange standards, including ANSI x12, HL7, FHIR, and CCD.
  • Distributed Computing: Hands-on experience with distributed data processing frameworks like Apache Spark or Dask for large-scale data transformations and analytics.
  • Big Data Systems: Experience with big data ecosystems, including Hive, Presto, and NoSQL databases, to support complex analytical workloads. Additionally, experience optimizing these systems with data systems like DuckDB is desirable.
  • Data Quality & Governance: Expertise in implementing automated data quality checks and governance frameworks using tools like Great Expectations, DataOps solutions, or custom-built frameworks. Experience with Data Observability frameworks such as Anomolo or Monte Carlo is a plus.
  • Data Security & Privacy: Strong understanding of data encryption techniques, role-based access control (RBAC), and tokenization to protect sensitive healthcare information.
  • Agile Development & DevOps: Experience integrating CI/CD pipelines, automated testing, and DevOps practices into data architecture projects to ensure reliability and scalability.
  • Master Data Management: Expertise in designing and maintaining MDM solutions that ensure consistency across core healthcare domains, including patient, provider, and diagnosis records.
  • Scalable Data Storage: Familiarity with distributed storage systems optimized for healthcare data, including columnar databases (e.g., Redshift, BigQuery, Casandra) and object storage solutions (e.g., S3, GCS).

Base salary for the role is commensurate with experience and can range between $180,000 - 275,000 + annual bonus opportunity.

Hiring Locations


We prioritize hiring team members from the Philadelphia area whenever possible, as our main office is located in Center City, Philadelphia, where we operate on a hybrid model with in-office work required two days a week. However, for certain roles, we’re also open to hiring from our hub locations, where we have groups of remote workers. Please note that, due to tax and labor regulations, we can only hire from specific states, but remote work is supported in the following key hubs and approved states:Hub Locations:

  • Philadelphia, Pennsylvania
  • Boston, Massachusetts
  • New York City, New York
  • Baltimore, Maryland
  • Washington D.C.
  • Charlotte, North Carolina
  • Raleigh-Durham, North Carolina
  • Atlanta, Georgia
  • Chicago, Illinois

Approved States:

CT, DE, FL, GA, IL, IN, MA, MD, MI, NC, NJ, NY, OH, PA, RI, TN and VA.

About HealthVerity


HealthVerity synchronizes transformational technologies with the nation’s largest healthcare and consumer data ecosystem to power previously unattainable outcomes and fundamentally advance the science. We offer a comprehensive, yet flexible approach, based on the foundational elements of Identity, Privacy, Governance and Exchange (IPGE), that synchronizes unparalleled Identity management with built-in Privacy compliance and Governance, providing the ability to discover and Exchange a near limitless combination of data at a record pace.

Together with our partners in life sciences, government and insurance, we are Synchronizing the Science. To learn more about HealthVerity, visit healthverity.com.

Why you'll love working here


We are making a difference – Our technology is at the forefront of some of the biggest healthcare challenges in the world. We are one team – Our people define our culture and always will. We take time out to celebrate each other at the end of every week through company-wide shout outs, and acknowledge the value that each of us adds towards our greater mission. Come share all you have to offer.We are learners – Every team member is continually learning, no matter if we've been in a role for one year or much longer.

We are committed to learning and implementing what is best for our clients, partners, and each other.

Benefits & Perks


  • Compensation: competitive base salary & annual bonus opportunity (for non-commissioned roles)
  • Benefits: comprehensive benefits with coverage on Day 1, medical, dental, vision, 401k, stock options
  • Flexible location: our HQ is in Philadelphia. We offer both hybrid roles and those with quarterly travel.
  • Generous PTO: Take time off as needed, targeted at 4 weeks per year, including vacation, personal and sick time, plus paid parental leave.
  • Comprehensive and individualized onboarding: mentorship program, departmental talks, and a library of resources are available beginning day 1 for each new team member to minimize the stress of starting a new job
  • Professional development: biweekly 1:1s, hands-on leadership that is goal-and growth-oriented for each team member, and an annual budget to support professional development pursuits

We believe incorporating different ideas, perspectives and backgrounds make us stronger and encourages an environment where ageism, racism, sexism, ableism, homophobia, transphobia or any other form of discrimination are not tolerated. All qualified job applicants will be given consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, or on the basis of disability. At HealthVerity, we’re working towards an innovative and connected future for healthcare data and believe the future is better together.

We can only do that if everyone has a seat at the table.If you require a reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process, please direct your inquiries to careers@healthverity.comRemote opportunities are not available in all areas and require team members to work from a fixed location due to tax and labor law implications - specific questions about remote positions can be discussed during the interview process with your recruiter.

This job is no longer open

Life at HealthVerity

HealthVerity, Inc., based in Philadelphia, a leader in cloud solutions for networked data, enhances the transparency, connectivity, privacy of traditional and emerging healthcare data. We leverage innovative technologies to help our clients discover, license and link data inside their Enterprise and across the widest range of top tier data providers. We empower customers to gain new perspectives on patient activity while ensuring complete privacy management and HIPAA compliance.
Thrive Here & What We Value- Making a Difference in Healthcare Challenges- One Team, Celebrating Each Other's Value- Learners and Committed to Learning and Implementing Best Practices- Comprehensive Benefits Package with coverage on Day 1- Flexible Location Options- Professional Development Opportunities- Equity Inclusion and Diversity Statement
Your tracker settings

We use cookies and similar methods to recognize visitors and remember their preferences. We also use them to measure ad campaign effectiveness, target ads and analyze site traffic. To learn more about these methods, including how to disable them, view our Cookie Policy or Privacy Policy.

By tapping `Accept`, you consent to the use of these methods by us and third parties. You can always change your tracker preferences by visiting our Cookie Policy.

logo innerThatStartupJob
Discover the best startup and their job positions, all in one place.
Copyright © 2025