Methodology · Historical Snapshot

Probability of Success Calibration

You are viewing a frozen historical snapshot

This is methodology@2026-04 as it was on 2026-04-30 — the immutable record any signed export stamped methodology@2026-04 was computed under. It is intentionally the citation prose, not the current presentation, and it will never change. The methodology has since advanced (current is methodology@2026-05-21). Current version · Full version history · Verify an export

PhaseFolio derives stage-transition probabilities from observed clinical outcomes rather than expert opinion, following Thomas et al. (2021) and Wong et al. (2019). The benchmark matrix is three-dimensional (11 indications × 8 modalities × 3 biomarker strategies = 264 cells); seven evidence-based multipliers are applied through a log-odds (logit) transformation to keep results bounded and reflect diminishing returns at high baselines.

1. Three-dimensional benchmark matrix

Rather than applying a single set of industry-average transition rates, PhaseFolio stratifies PoS by three independent classification axes that are known to materially affect clinical outcomes: therapeutic area, drug modality, and biomarker strategy.

Therapeutic Area. 11 indications from oncology (solid and hematologic) through cardiovascular, neurology, metabolic, and rare disease. Oncology solid tumor has the lowest overall LoA (~2.5%); rare disease the highest (~9.4%). Source: Thomas et al. 2021.
Drug Modality. 8 modalities including small molecule, monoclonal antibody, bispecific, ADC, cell therapy, gene therapy, and peptide. Modality affects safety profiles and regulatory pathways. Source: Citeline 2024.
Biomarker Strategy. 3 levels: none, enrichment (biomarker-selected population), and companion diagnostic (required for Rx). Biomarker use drives Phase II and III success rates up to 4×. Source: Parker et al. 2015.

The full matrix contains 11 × 8 × 3 = 264 unique indication–modality–biomarker combinations, each specifying five stage-transition probabilities (Preclinical → Phase I → Phase II → Phase III → NDA/BLA → Approval). Values are derived from meta-analysis of 12,728+ clinical-stage transitions (BIO/QLS/Informa 2021) with modality-specific adjustments from Citeline 2024 pipeline data.

Table 1 — Baseline stage-transition probabilities by therapeutic area (small molecule, no biomarker strategy; Overall LoA is the product of all five transition rates):

Oncology (Solid): Preclinical 5.0%, Phase I 40.0%, Phase II 24.0%, Phase III 55.0%, NDA/BLA 90.0%, Overall LoA 2.4%
Oncology (Hematologic): Preclinical 7.0%, Phase I 72.0%, Phase II 42.0%, Phase III 63.0%, NDA/BLA 90.0%, Overall LoA 12.0%
Rare Disease: Preclinical 8.0%, Phase I 56.0%, Phase II 38.0%, Phase III 64.0%, NDA/BLA 93.0%, Overall LoA 9.4%
Neurology: Preclinical 4.0%, Phase I 46.0%, Phase II 20.0%, Phase III 47.0%, NDA/BLA 88.0%, Overall LoA 1.5%
Immunology: Preclinical 6.0%, Phase I 49.0%, Phase II 30.0%, Phase III 58.0%, NDA/BLA 91.0%, Overall LoA 4.6%
Infectious Disease: Preclinical 7.0%, Phase I 52.0%, Phase II 36.0%, Phase III 62.0%, NDA/BLA 92.0%, Overall LoA 7.4%
Cardiovascular: Preclinical 5.0%, Phase I 48.0%, Phase II 28.0%, Phase III 55.0%, NDA/BLA 90.0%, Overall LoA 3.3%
Metabolic: Preclinical 6.0%, Phase I 50.0%, Phase II 32.0%, Phase III 58.0%, NDA/BLA 91.0%, Overall LoA 5.1%
Respiratory: Preclinical 5.0%, Phase I 47.0%, Phase II 26.0%, Phase III 54.0%, NDA/BLA 90.0%, Overall LoA 2.9%
Dermatology: Preclinical 6.0%, Phase I 50.0%, Phase II 34.0%, Phase III 60.0%, NDA/BLA 91.0%, Overall LoA 5.6%
Ophthalmology: Preclinical 6.0%, Phase I 50.0%, Phase II 30.0%, Phase III 58.0%, NDA/BLA 91.0%, Overall LoA 4.7%

Source: BIO/QLS/Informa 2021; Wong et al. 2019.

2. Multiplier adjustments

Several evidence-based factors are known to shift clinical success probabilities relative to the population base rate. PhaseFolio applies these multipliers via a log-odds (logit) transformation — the mathematically correct method when a multiplier is a true odds ratio. The cited sources, however, report effect sizes in different forms: relative success ratios (Minikel 2024), phase success-rate comparisons (Parker 2015), relative approval rates (Mullard 2016), and a cumulative pipeline advantage (Tufts NEWDIGS 2023).

The engine currently treats all of these through the OR-style logit path as a deliberately conservative approximation — this under-credits favorable modifiers at higher baselines, never saturates to 1.0, and avoids stacked-modifier overshoot. Per-modifier estimand declarations (_source_estimand and _applied_as) are recorded in the machine-readable source and explained in the model card. Each multiplier is applied only to the clinical phases where the underlying evidence was measured.

Table 2 — Evidence-based multipliers (source estimand separated from applied path; favorable multipliers >1 increase PoS; unfavorable <1 decrease):

Genetic Validation: 2.6× (RR estimand, OR-applied), Phase II/III, Minikel et al. 2024 Nature
Companion Diagnostic: 2.0× (RR, OR-applied), Phase II/III, Parker et al. 2015 ASCO
Orphan Designation: 1.5× (RR, OR-applied), Phase II/III, Mullard 2016 Nat. Rev. Drug Disc.
Biomarker Enrichment: 1.5× (RR, OR-applied), Phase II/III, Parker et al. 2015; BIO 2021
First-in-Class: 0.85× (RR, OR-applied), Phase II/III, BIO/QLS 2021
CAR-T / TCR Therapy: 1.73× per stage (RR cumulative 3×, OR-applied), Phase I/II, Tufts NEWDIGS 2023
Gene Therapy (Orphan): 1.41× per stage (RR cumulative 2×, OR-applied), Phase I/II, Tufts NEWDIGS 2023

CAR-T and gene-therapy-orphan per-stage values are sqrt(cumulative) splits of the source's whole-pipeline advantage.

A critical design decision: preclinical PoS is never adjusted by any multiplier. Preclinical attrition is dominated by toxicology, pharmacokinetics, and formulation failures (Sun et al. 2025) — factors orthogonal to the clinical efficacy signals that these multipliers capture. Similarly, NDA/BLA approval rates reflect regulatory filing quality rather than drug-specific clinical attributes, and are therefore held constant.

3. Logistic transformation method

Applying odds ratios to bounded probabilities requires care. Naive multiplication (PoS × OR, capped at 1.0) produces mathematically unsound results: a drug with 50% base PoS and a 2.6× genetic-validation multiplier would yield 130%, capped to 100% — falsely claiming certainty. With multiple favorable multipliers stacking, this problem cascades rapidly.

PhaseFolio instead applies multipliers in log-odds (logit) space — the standard biostatistical transformation for adjusting bounded probabilities by a multiplicative factor. The three-step transformation:

Equation 1 — Logistic odds-ratio adjustment. Step 1: odds = PoS_base / (1 − PoS_base). Step 2: odds_adj = odds × OR. Step 3: PoS_adj = odds_adj / (1 + odds_adj).

This approach has three desirable mathematical properties:

Bounded output. The result is always in (0, 1) — it can never reach 0% or 100%, regardless of how many multipliers are stacked.
Diminishing returns. A 2.6× OR applied to a 24% base PoS yields 45.1% (+21.1pp). Applied to a 70% base, it yields 85.8% (+15.8pp). The higher the base, the harder it is to push higher — matching clinical reality.
Composability. Multiple ORs applied sequentially produce the same result regardless of order, because multiplication in log-odds space is commutative.

4. Worked example: PoS derivation

Consider a rare disease small molecule with genetic validation and orphan designation. We derive the Phase II PoS step-by-step.

Base rate (from benchmark matrix): Phase II PoS = 38.0%.

Apply genetic validation (factor 2.6, applied as OR): odds = 0.38 / (1 − 0.38) = 0.613; odds × 2.6 = 1.594; PoS = 1.594 / (1 + 1.594) = 61.5%. Factor source: Minikel et al. 2024 (RR); applied as OR.

Apply orphan designation (factor 1.5, applied as OR): odds = 0.615 / (1 − 0.615) = 1.597; odds × 1.5 = 2.396; PoS = 2.396 / (1 + 2.396) = 70.6%. Factor source: Mullard 2016 (RR); applied as OR.

Adjusted Phase II PoS: 70.6%.

Note: naive multiplication would yield min(1.0, 0.38 × 2.6 × 1.5) = 100% — clearly incorrect. The logistic method produces 70.6%, reflecting appropriate diminishing returns.

Key facts

Benchmark matrix size	11 indications × 8 modalities × 3 biomarker = 264 cells
Source N (stage transitions)	12,728+ (BIO/QLS/Informa 2021)
Multiplier count	7 evidence-based factors
Transformation method	Log-odds (logit) — bounded, diminishing-returns, commutative
Preclinical adjustment	Never adjusted (orthogonal failure modes)

References

01Thomas, D.W., Burns, J., Audette, J., Carroll, A., Dow-Hygelund, C., & Hay, M. (2021). Clinical Development Success Rates and Contributing Factors 2011–2020. BIO, QLS Advisors, Informa Pharma Intelligence.

02Wong, C.H., Siah, K.W., & Lo, A.W. (2019). Estimation of clinical trial success rates and related parameters. Biostatistics, 20(2), 273–286.

03Citeline (2024). Pharma Intelligence Global Clinical Trials Database. Modality-specific pipeline data used to calibrate transition rates for bispecifics, ADCs, cell therapy, and gene therapy.

04Minikel, E.V., Painter, J.L., Dong, C.C., & Nelson, M.R. (2024). Refining the impact of genetic evidence on clinical success. Nature, 629, 624–629.

05Sun, J., Wei, Q., & Zhou, Y. (2025). Dynamic success rates of drug clinical trials. Nature Communications, 16, 1629.

06Mullard, A. (2016). Parsing clinical success rates. Nature Reviews Drug Discovery, 15, 447.

07Parker, J.L., Zhang, Z.Y., & Buckstein, R. (2015). Clinical trial risk in hematology and oncology: the effect of biomarker use. ASCO Annual Meeting Abstracts.

08Tufts Center for the Study of Drug Development / NEWDIGS (2023). Cell and Gene Therapy Success Rates.

Frozen snapshot · methodology version: methodology@2026-04 · Last updated: 2026-04-30 · Version history →