Probability of Success Calibration
PhaseFolio derives stage-transition probabilities from observed clinical outcomes rather than expert opinion, following Thomas et al. (2021) and Wong et al. (2019). The benchmark matrix is three-dimensional (11 indications × 8 modalities × 3 biomarker strategies = 264 cells); seven evidence-based multipliers are applied through a log-odds (logit) transformation to keep results bounded and reflect diminishing returns at high baselines.
Three-dimensional benchmark matrix
11 indications × 8 modalities × 3 biomarker strategies = 264 cells.
Rather than applying a single set of industry-average transition rates, PhaseFolio stratifies PoS by three independent classification axes that are known to materially affect clinical outcomes: therapeutic area, drug modality, and biomarker strategy.
Therapeutic Area
11 indications from oncology (solid and hematologic) through cardiovascular, neurology, metabolic, and rare disease. Oncology solid tumor has the lowest overall LoA (~2.5%); rare disease the highest (~9.4%).
Thomas et al. 2021
Drug Modality
8 modalities including small molecule, monoclonal antibody, bispecific, ADC, cell therapy, gene therapy, and peptide. Modality affects safety profiles and regulatory pathways.
Citeline 2024
Biomarker Strategy
3 levels: none, enrichment (biomarker-selected population), and companion diagnostic (required for Rx). Biomarker use drives Phase II and III success rates up to 4×.
Parker et al. 2015
The full matrix contains 11 × 8 × 3 = 264 unique indication–modality–biomarker combinations, each specifying five stage-transition probabilities (Preclinical → Phase I → Phase II → Phase III → NDA/BLA → Approval). Values are derived from meta-analysis of 12,728+ clinical-stage transitions [BIO/QLS/Informa 2021] with modality-specific adjustments from Citeline 2024 pipeline data.
| Therapeutic Area | Preclinical | Phase I | Phase II | Phase III | NDA/BLA | Overall LoA |
|---|---|---|---|---|---|---|
| Oncology (Solid) | 5.0% | 40.0% | 24.0% | 55.0% | 90.0% | 2.4% |
| Oncology (Hematologic) | 7.0% | 72.0% | 42.0% | 63.0% | 90.0% | 12.0% |
| Rare Disease | 8.0% | 56.0% | 38.0% | 64.0% | 93.0% | 9.4% |
| Neurology | 4.0% | 46.0% | 20.0% | 47.0% | 88.0% | 1.5% |
| Immunology | 6.0% | 49.0% | 30.0% | 58.0% | 91.0% | 4.6% |
| Infectious Disease | 7.0% | 52.0% | 36.0% | 62.0% | 92.0% | 7.4% |
| Cardiovascular | 5.0% | 48.0% | 28.0% | 55.0% | 90.0% | 3.3% |
| Metabolic | 6.0% | 50.0% | 32.0% | 58.0% | 91.0% | 5.1% |
| Respiratory | 5.0% | 47.0% | 26.0% | 54.0% | 90.0% | 2.9% |
| Dermatology | 6.0% | 50.0% | 34.0% | 60.0% | 91.0% | 5.6% |
| Ophthalmology | 6.0% | 50.0% | 30.0% | 58.0% | 91.0% | 4.7% |
Table 1. Baseline stage-transition probabilities by therapeutic area (small molecule, no biomarker strategy). Overall Likelihood of Approval (LoA) is the product of all five transition rates. Source: BIO/QLS/Informa 2021; Wong et al. 2019.
Multiplier adjustments
Seven evidence-based factors, each applied where the source measured them.
Several evidence-based factors are known to shift clinical success probabilities relative to the population base rate. PhaseFolio applies these multipliers via a log-odds (logit) transformation — the mathematically correct method when a multiplier is a true odds ratio. The cited sources, however, report effect sizes in different forms: relative success ratios (Minikel 2024), phase success-rate comparisons (Parker 2015), relative approval rates (Mullard 2016), and a cumulative pipeline advantage (Tufts NEWDIGS 2023).
The engine currently treats all of these through the OR-style logit path as a deliberately conservative approximation — this under-credits favorable modifiers at higher baselines, never saturates to 1.0, and avoids stacked-modifier overshoot. Per-modifier estimand declarations (_source_estimand and _applied_as) are recorded in the machine-readable source and explained in the model card. Each multiplier is applied only to the clinical phases where the underlying evidence was measured.
| Modifier | Multiplier | Source estimand | Applied as | Stages | Source |
|---|---|---|---|---|---|
| Genetic Validation | 2.6× | RR | OR (logit) | II, III | Minikel et al. 2024, Nature |
| Companion Diagnostic | 2.0× | RR | OR (logit) | II, III | Parker et al. 2015 (ASCO) |
| Orphan Designation | 1.5× | RR | OR (logit) | II, III | Mullard 2016, Nat. Rev. Drug Disc. |
| Biomarker Enrichment | 1.5× | RR | OR (logit) | II, III | Parker et al. 2015; BIO 2021 |
| First-in-Class | 0.85× | RR | OR (logit) | II, III | BIO/QLS 2021 |
| CAR-T / TCR Therapy | 1.73× / stage | RR (cum. 3×) | OR (logit) | I, II | Tufts NEWDIGS 2023 |
| Gene Therapy (Orphan) | 1.41× / stage | RR (cum. 2×) | OR (logit) | I, II | Tufts NEWDIGS 2023 |
Table 2. Evidence-based multipliers, with the source estimand (what the literature reports) separated from the applied path (how the engine treats it). Favorable multipliers (>1) increase PoS; unfavorable (<1) decrease it. CAR-T and gene-therapy-orphan per-stage values are sqrt(cumulative) splits of the source's whole-pipeline advantage.
A critical design decision: preclinical PoS is never adjusted by any multiplier. Preclinical attrition is dominated by toxicology, pharmacokinetics, and formulation failures [Sun et al. 2025] — factors orthogonal to the clinical efficacy signals that these multipliers capture. Similarly, NDA/BLA approval rates reflect regulatory filing quality rather than drug-specific clinical attributes, and are therefore held constant.
Logistic transformation method
Why naive multiplication is wrong, and what log-odds space buys you.
Applying odds ratios to bounded probabilities requires care. Naive multiplication (PoS × OR, capped at 1.0) produces mathematically unsound results: a drug with 50% base PoS and a 2.6× genetic-validation multiplier would yield 130%, capped to 100% — falsely claiming certainty. With multiple favorable multipliers stacking, this problem cascades rapidly.
PhaseFolio instead applies multipliers in log-odds (logit) space — the standard biostatistical transformation for adjusting bounded probabilities by a multiplicative factor. The three-step transformation:
This approach has three desirable mathematical properties:
- Bounded output. The result is always in (0, 1) — it can never reach 0% or 100%, regardless of how many multipliers are stacked.
- Diminishing returns. A 2.6× OR applied to a 24% base PoS yields 45.1% (+21.1pp). Applied to a 70% base, it yields 85.8% (+15.8pp). The higher the base, the harder it is to push higher — matching clinical reality.
- Composability. Multiple ORs applied sequentially produce the same result regardless of order, because multiplication in log-odds space is commutative.
Worked example: PoS derivation
Rare disease small molecule with genetic validation and orphan designation.
Consider a rare disease small molecule with genetic validation and orphan designation. We derive the Phase II PoS step-by-step.
Phase II PoS derivation — rare disease, small molecule
Base rate (from benchmark matrix)
Phase II PoS = 38.0%
Apply genetic validation (factor 2.6, applied as OR)
odds = 0.38 / (1 − 0.38) = 0.613
odds × 2.6 = 1.594
PoS = 1.594 / (1 + 1.594) = 61.5%
Factor source: Minikel et al. 2024 (RR); applied as OR — see model card.
Apply orphan designation (factor 1.5, applied as OR)
odds = 0.615 / (1 − 0.615) = 1.597
odds × 1.5 = 2.396
PoS = 2.396 / (1 + 2.396) = 70.6%
Factor source: Mullard 2016 (RR); applied as OR — see model card.
Adjusted Phase II PoS
70.6%
Note: naive multiplication would yield min(1.0, 0.38 × 2.6 × 1.5) = 100% — clearly incorrect. The logistic method produces 70.6%, reflecting appropriate diminishing returns.
References
01Thomas, D.W., Burns, J., Audette, J., Carroll, A., Dow-Hygelund, C., & Hay, M. (2021). Clinical Development Success Rates and Contributing Factors 2011–2020. BIO, QLS Advisors, Informa Pharma Intelligence.
02Wong, C.H., Siah, K.W., & Lo, A.W. (2019). Estimation of clinical trial success rates and related parameters. Biostatistics, 20(2), 273–286.
03Citeline (2024). Pharma Intelligence Global Clinical Trials Database. Modality-specific pipeline data used to calibrate transition rates for bispecifics, ADCs, cell therapy, and gene therapy.
04Minikel, E.V., Painter, J.L., Dong, C.C., & Nelson, M.R. (2024). Refining the impact of genetic evidence on clinical success. Nature, 629, 624–629.
05Sun, J., Wei, Q., & Zhou, Y. (2025). Dynamic success rates of drug clinical trials. Nature Communications, 16, 1629.
06Mullard, A. (2016). Parsing clinical success rates. Nature Reviews Drug Discovery, 15, 447.
07Parker, J.L., Zhang, Z.Y., & Buckstein, R. (2015). Clinical trial risk in hematology and oncology: the effect of biomarker use. ASCO Annual Meeting Abstracts.
08Tufts Center for the Study of Drug Development / NEWDIGS (2023). Cell and Gene Therapy Success Rates.
Methodology version: methodology@2026-04 · Last updated: 2026-04-30
PhaseFolio