We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results

Principal Data Engineer

CareDx
401(k)
United States, California, Brisbane
3260 Bayshore Boulevard (Show on map)
Jun 20, 2025

CareDx, Inc. is a leading precision medicine solutions company focused on the discovery, development, and commercialization of clinically differentiated, high-value healthcare solutions for transplant patients and caregivers. CareDx offers products, testing services, and digital healthcare solutions along the pre- and post-transplant patient journey, and is the leading provider of genomics-based information for transplant patients.

We are transforming the future of transplant care by using data-driven insights to predict and personalize treatment options for transplant patients. Our mission is to revolutionize how care is delivered, leveraging AI, LLM, deep learning, transplant domain knowledge, and precision medicine to make earlier, smarter decisions that improve outcomes and save lives. We're looking for a Principal Data Engineer to help power this mission by building the next generation of clinical data product infrastructure and advancing AI tools that drive real-world impact in transplant medicine.

This role will assist in designing and implementing scalable data pipelines and AI-driven tools to support our mission of improving transplant patient outcomes. The ideal candidate has foundational knowledge in data engineering, OMOP, and natural language processing (NLP), with an interest in cloud computing and precision medicine.

Key Responsibilities:

  • Scalable Data Architecture and ETL Pipelines: Design, optimize, and manage end-to-end ETL pipelines to ingest, transform, normalize, and integrate large-scale real-world datasets not limited with EMR from diverse sources, ensuring robust and scalable data architectures.
  • Cloud & Distributed Computing: Utilize cloud platforms (Databricks/Azure) and distributed computing frameworks to deploy and automate AI agents, optimizing for scalability, cost-efficiency, and high availability in diagnostic and clinical applications.
  • Large Language Model (LLM) and Clinical Natural Language Processing (NLP): Build and deploy LLM and NLP pipelines to extract and standardize longitudinal clinical features from unstructured data, enhancing CareDx's precision medicine capabilities and enabling actionable insights.
  • AI/ML Innovation: Spearhead developing and implementing cutting-edge AI-driven systems tailored to internal stakeholder needs, accelerating the creation of transformative healthcare AI products.
  • Genomics & Precision Medicine: Partner with bioinformaticians, data scientists, machine learning experts, and clinical teams to integrate multi-omics data into AI models, driving improved patient transplant outcomes.
  • System Reliability: Uphold data integrity, security, and disaster recovery standards across distributed systems, ensuring operational resilience for CareDx's clinical and research initiatives.
  • Innovation: Research and implement state-of-the-art techniques to advance CareDx's leadership in transplant innovation, delivering impactful solutions at the forefront of healthcare technology.

Qualifications:

  • Education: PhD in Computer Science, Biomedical Informatics, Bioinformatics, Clinical Informatics or a related field.
  • Experience: 10+ years in data engineering, EMR, AI, specializing in processing large-scale clinical, EMR, genomic, or molecular datasets within healthcare or diagnostic sectors
  • Programming: Expert in Python, SQL, R, Bash, and Git with a strong command of modern development workflows
  • Data Engineering: Extensive experience in ETL pipelines, database management, and data modeling.
  • Cloud Platforms: 3+ years leveraging Databricks or Azure for deploying robust, cloud-based solutions
  • Clinical NLP: Experience in NLP techniques for processing free clinical text. Solid domain knowledge on diseases, immunology, and biomedicine is a plus.
  • OMOP CDM: Experience with the OMOP Common Data Model and clinical data standardization. Must be familiar with UMLS coding system (RxNorm, CUI, ICD10, SNOMED etc).
  • AI Development: Demonstrated expertise in building AI models with PyTorch/TensorFlow or Scikit-learn and their application to structured/unstructured data to drive innovative solutions.
  • Real World Data: Experience with real-world evidence studies; experience with EMR QC is a must.

San Francisco Bay Area:

The anticipated base salary range for candidates who will work in Brisbane, California is $181,000 to $235,000. The final salary offered to a successful candidate will be dependent on several factors that may include but are not limited to the type and length of experience within the job, type and length of experience within the industry, education, etc. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units. CareDx is a multi-state employer, and this salary range may not reflect positions that work in other states.

REMOTE: US only

The anticipated base salary range in the United States is $162,000 to $210,000. The final salary offered to a successful candidate will be dependent on several factors that may include but are not limited to the type and length of experience within the job, type and length of experience within the industry, education, etc. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units. CareDx is a multi-state employer, and this salary range may not reflect positions that work in other states.

Additional Details:

Every individual at CareDx has a direct impact on our collective mission to improve the lives of organ transplant patients worldwide. We believe in taking great care of our people, so they take even greater care of our patients.

Our competitive Total Rewards package includes:

  • Competitive base salary and incentive compensation
  • Health and welfare benefits including a gym reimbursement program
  • 401(k) savings plan match
  • Employee Stock Purchase Plan
  • Pre-tax commuter benefits
  • And more!
  • Please refer to our page to view detailed benefits at https://caredx.com/company/careers/

In addition, we have a Living Donor Employee Recovery Policy that allows up to 30 days of paid leave annually to a full-time employee who makes the selfless act of donating an organ or bone marrow.

With products that are making a difference in the lives of transplant patients today and a promising pipeline for the future, it's an exciting time to be part of the CareDx team. Join us in partnering with transplant patients to transform our future together.

CareDx, Inc. is an Equal Opportunity Employer and participates in the E-Verify program.

By proceeding with our application and submitting your information, you acknowledge that you have read our U.S. Personnel Privacy Notice and consent to receive email communication from CareDx.

******** We do not accept resumes from headhunters, placement agencies, or other suppliers that have not signed a formal agreement with us.

#LI-Remote

Applied = 0

(web-8588dfb-6jkxz)