Clinical Data Science + Biomarker Discovery

An Automated Approach to Identify Responding Subpopulations

Design a robust data science driven approach to analyze clinical trial participant data to drive stratification, predicting endpoints, and discovering predictive biomarkers.

The challenge

Our customer had run 5 trials in treatment resistant depression amassing a dataset of drug efficacy spanning different doses and treatment regimens. They needed the ability to fist standardize and unify this trial data into a single dataset and then needed a repeatable approach to apply data science techniques to stratify patients, identify cohorts of responders, and ultimately discover novel biomarkers that could predict future clinical response. The trial data included biological assays of protein levels, clinically validated self assessments, and multiple treatment levels.

Our Solution

We developed an algorithm capable of learning clinically meaningful combinations of baseline lab values that were predictive of both primary and secondary endpoint outcomes, automatically identifying subpopulations with significantly elevated likelihood of experiencing treatment benefit.

This algorithm required harmonizing data from five previously conducted TRD trials, each featuring different dose levels, regimens, and biomarker panels.

Our approach emphasized transparency and explainability allowing translational scientists to validate findings and generate hypotheses around potential mechanisms of action and novel biomarkers.

Ultimately, our work provided a reusable framework to accelerate insight generation and patient stratification efforts across CNS studies.

Stories of Smarter Trials

Clinical Data Management + Operations

An Automated Approach to Identify Responding Subpopulations

Avoiding Endpoint Integrity Issues

Clean Data in Real Time, Accelerating to Data Lock

Turn Data Requests Into Instant Answers