We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results

Data Scientist

Certara USA, Inc.
United States, Pennsylvania, Wayne
4 Radnor Corporate Center (Show on map)
Nov 19, 2025
Overview

Certara is a growing company that provides a dynamic and exciting place to work. Our purpose is to assist in accelerating the development of meaningful medicines that make an impact on our society and the people that need them most. Innovation and creativity are highly valued, and everyone is given the opportunity for training and continuous development. Our portfolio spans the discovery, preclinical, clinical and post-marketing phases of drug development, working with 1,200 commercial companies, 250 academic institutions, and numerous regulatory agencies.

We are seeking an experienced Data Scientist to join our Data Science Solutions team at CertaraAI. The successful candidate will be responsible for collaborating with internal product teams as well as drive solutions for clients. They will work closely with our Data Science Engineering team and develop and utilize our burgeoning LLM platform to address use cases across the clinical space. They will utilize their background in data science to set up and execute experiments, design and produce metrics, and present progress to stakeholders. They will use their experience, the platform, and their programming abilities to come up with solutions to use cases like data extraction and normalization, CSR and other document generation, machine translation to various query and modeling languages, knowledge base mining, and new use cases as they arise.


Responsibilities

  • Work on client engagements to apply CertaraAI's LLM platform to a variety of use cases:
    • Interact with clients and work to understand their use cases and their data.
    • Work on small client teams and serve as the team's expert on LLMs and the capabilities of CertaraAI's platform.
    • Utilize the platform when able but also be able to utilize the API and program novel prototype solutions.
    • Utilize experience to design right sized experiments, manage expectations, design and produce metrics to track success.
  • Work with internal product and development teams to integrate LLM capabilities within Certara's world class clinical software suite:
    • Be a liaison of the data science team to other product teams across Certara, educating them on the CertaraAI platform as well as LLMs in general
    • Collaborate and build cross-product solutions and integrate CertaraAI's solutions into other Certara products.
    • Use the CertaraAI platform and LLMs in general to solve problems across the entire clinical lifecycle, make existing workflows more efficient, and work with best-in-class clinical products and world-renowned scientists to integrate LLMs into their workflows.

Qualifications

Requirements:

  • 3-5 years of experience in a data science role.
  • Excellent presentation skills with experience communicating complex ideas to diverse audiences.
  • Proficiency in Python, Git.
  • Experience with data extraction, data standardization, and harmonization with classical natural language approaches, as well as modern LLMs.
  • Experience with data cleaning and experimental design.
  • Experience with LLM prompt engineering, problem-solving using LLMs, and applying LLMs to real-world use cases.

Preferred Qualifications:

  • Master's degree or higher in a relevant field (e.g., data science, statistics).
  • Strong background in clinical data analysis.
  • Experience working within the clinical trial lifecycle.

Certara bases all employment-related decision on merit, taking into consideration qualifications, skills, achievement, and performance. We treat all applicants and employees without regard to personal characteristics such as race, color, ethnicity, religion, sex, sexual orientation, age, nationality, marital status, pregnancy, physical or mental condition, genetic information, military service, or other characteristic protected by law.

Applied = 0

(web-df9ddb7dc-zsbmm)