NYU Stern MSBAI Capstone

DriftBreaker

Credit Risk Model Drift Detection Using Survival Analysis

2.26M
Loans Analyzed
$34.0B
Total Exposure
11.61%
Default Rate
$4.07B
Total Defaults

Framework Architecture

BI Analytics

Portfolio metrics, originations, defaults, book composition, cumulative PD curves

Model Engine v24.0

Champion GBM + Challenger Logit, Isotonic calibration, EWMA macro overlay, Forensic Portfolio Engine

Strategy

Financial Confusion Matrix, NPV-based P&L, Loss Estimation (EL=EAD×PD×LGD), Decision Matrix

Project Objectives

  • Detect Model Drift: Build a framework to identify when credit risk models deviate from expected performance using survival analysis techniques.
  • Attribution Analysis: Decompose drift into macro (economic) and micro (underwriting) components to inform remediation strategies.
  • P&L Impact: Translate model drift into financial terms — margin compression, variance attribution, and required rate calculations.
  • Actionable Decisions: Provide clear EXIT / REPRICE / MONITOR recommendations by segment based on quantitative analysis.
Data Source: Lending Club 2007-2018 | Highest Default: 2015 (18.25%) | Highest Risk Segment: High Risk (30.38%)

Lending Club

Pioneer in Peer-to-Peer Lending

Lending Club was founded in 2006 and became the world's largest peer-to-peer lending platform, facilitating over $50 billion in loans before transitioning to a neobank model in 2020. The platform connected borrowers seeking personal loans with investors looking for yield, disrupting traditional banking by removing the intermediary.

Company Timeline

2006 Founded in San Francisco
2007 Launched P2P lending platform
2014 IPO on NYSE ($LCG)
2016 CEO resignation amid scandal
2020 Acquired Radius Bank, became neobank
2021 Ceased P2P operations entirely

Platform Statistics

Total Loans Issued $50B+
Borrowers Served 3M+
Average Loan Size $15,000
Interest Rate Range 6% - 36%
Loan Terms 36 or 60 months

Grading System

Lending Club assigned grades A through G to borrowers based on creditworthiness, with subgrades 1-5 within each letter grade. This risk stratification determined interest rates and was central to investor decision-making.

A
Lowest Risk
B
Low Risk
C
Medium
D
Medium-High
E
High Risk
F
Higher Risk
G
Highest Risk
Why This Dataset: Lending Club's publicly available loan data (2007-2018) provides a rich, real-world dataset with known outcomes — ideal for studying credit risk model performance, drift detection, and the consequences of underwriting changes over time.

Portfolio Summary

Lending Club data (2007-2018)

2,260,668
Total Loans
$34.00B
Total Exposure
$4.07B
Default Volume
11.61%
DR (Count)
11.98%
DR ($)

By Segment

Segment Loans Exposure Default Rate (Count) Default $
High Risk 189,607 $3,411.7M 28.71% 29.77%
Medium Risk 974,477 $14,869.6M 14.68% 14.65%
Low Risk 1,096,584 $15,722.9M 5.92% 5.60%

By Term

TermLoansExposureDefault Rate
36 months 1,609,754 $20,513.0M 10.70%
60 months 650,914 $13,491.2M 17.15%

Timing Metrics

5.13%
PD12
9.78%
PD24
Q5-Q6
Peak Hazard
14.0
Median Months to Default

Portfolio Composition Evolution

Stacked area chart showing portfolio mix shift over time. Low Risk grew from 40% (2007) to 56% (2018), while High Risk declined from 13% to 5%.

Originations vs Defaults

Annual and cumulative volume analysis

Key Insight: 2015 vintage shows highest default rate (18.25%) despite strong economic conditions, suggesting underwriting deterioration. 2018 shows artificially low rates (2.72%) due to loan immaturity.

Chart shows ultimate default rates by vintage year. Color coding: Red >18% | Yellow >16% | Green <5% | Blue = Normal

Originations vs. Default Dollars by Year

Dual-axis chart: bars show origination volume (left axis), line shows defaulted dollar amounts (right axis). Note 2015 peak in both originations ($6.4B) and defaults ($1.17B).

Annual Metrics

VintageLoansVolumeDefaultsDefault $Default Rate
2007 251 $2.2M 45 17.93%
2008 1,562 $14.4M 247 15.81%
2009 4,716 $46.4M 594 12.60%
2010 11,536 $122.1M 1,487 12.89%
2011 21,721 $261.7M 3,297 15.18%
2012 53,367 $718.4M 8,644 16.20%
2013 134,814 $1.98B 21,030 15.60%
2014 235,629 $3.50B 41,408 17.57%
2015 421,095 $6.42B 76,851 18.25%
2016 434,407 $6.40B 71,666 16.50%
2017 443,579 $6.59B 44,854 10.11%
2018 495,242 $7.94B 13,460 2.72%

Default Analysis

Comprehensive default timing, curves, and early warning indicators

Critical Findings: Peak default risk occurs in months 13-18 (20.85% of defaults). 2015-2016 vintages show elevated risk (18.25% and 16.50% ultimate DR). Early defaults (PD12) highest in 2016 at 6.68%, with High Risk segment reaching 17.93% early default rate.
13-18
Peak Hazard Window (months)
18.25%
Worst Vintage (2015)
14
Median TTD (months)
40.8%
Defaults by Month 12
2.72%
Best Vintage (2018)

Cumulative Default Curves by Vintage

Each line represents a vintage cohort's cumulative default rate over 36 months. 2015-2016 vintages show steep acceleration post-month 12.

Defaulted Dollar Amounts by Vintage

Bar chart shows actual dollar amount of defaults by origination year. 2015 had highest default dollars ($1.17B), driven by both volume and elevated default rate.

Monthly Hazard Rate (Conditional Default Probability)

Hazard rate = P(default in month t | survived to month t). Peak around months 13-16, then declining.

Default Distribution by Time Period

Histogram showing concentration of defaults by month bucket.

Median TTD: High Risk defaults 3 months earlier than Low Risk.

Cumulative Default Timing

Timeframe Defaults % of Total Cumulative %
1-3 months 11,859 4.18% 4.18%
4-6 months 28,837 10.17% 14.35%
7-9 months 35,058 12.36% 26.71%
10-12 months 39,964 14.09% 40.80%
13-18 months (Peak) 59,131 20.85% 61.65%
19-24 months 45,914 16.19% 77.84%
25-36 months 49,145 17.33% 95.17%
37+ months 11,246 3.97% 99.14%

Early Defaults (PD12) Analysis

PD12 by Vintage

2016 had worst early default rate (6.68%), 2018 best (2.57%).

Segment × Vintage Heatmap

High Risk 2016: 17.93% early default rate.

Vintage Performance Comparison

Vintage Loans Volume PD12 PD24 PD36 Ultimate DR

Drift Detection

Population Stability Index and performance monitoring

0.08
PSI Score
⚠ MINOR SHIFT
0.45x
Actual/Expected PD
⚠ UNDER-PREDICTING
2016
Peak Drift Vintage
Alert: A/E ratio of 0.45x indicates model is significantly under-predicting actual defaults. Recalibration recommended.

A/E Ratio Over Time

A/E ratio declining from 1.05 (2016Q1) to 0.45 (2018Q4) shows progressive model under-prediction as vintage matures.

MetricThresholdCurrentStatus
PSI (Overall) < 0.10 0.08 MINOR SHIFT
A/E Ratio 0.8 - 1.2 0.45x CRITICAL
KS Statistic < 0.05 0.04 STABLE
Gini Coefficient > 0.30 0.33 ACCEPTABLE
Brier Score < 0.10 0.072 GOOD

Drift Detection Techniques (Dirty Dozen)

Click on any technique to learn more about how it detects distribution shifts.

1 Population Stability Index (PSI)
0.08
2 Actual/Expected (A/E) Ratio
0.45x
3 Kolmogorov-Smirnov (KS) Statistic
0.04
4 Jensen-Shannon Divergence (JSD)
0.02
5 Wasserstein Distance (Earth Mover's)
0.03
6 Chi-Square Test
12.4
7 Gini Coefficient
0.33
8 Brier Score
0.072
9 Adversarial AUC
0.52

Status Classification: The system aggregates metrics to classify drift as STABLE, WARNING, or CRITICAL.

Macro/Micro Attribution

Decomposing drift sources into economic vs underwriting factors

Attribution Result: Of the total model drift observed, approximately 35% is attributable to macro factors (economic conditions) while 65% is attributable to micro factors (underwriting/population shifts). This suggests underwriting policy review should be prioritized.

Drift Attribution Waterfall

Waterfall shows how base default rate (11.61%) is impacted by macro and micro factors to reach observed rate. Fed-calibrated coefficients: β_unemp=+0.12/σ, β_spread=+0.16/σ.

+0.12
β Unemployment
per 1σ (2.25pp)
+0.16
β HY Spread
per 1σ (2.75%)
35%
Macro Attribution
65%
Micro Attribution

Attribution Methodology

Macro Factors (35%)

Economic conditions affecting all borrowers uniformly

  • Unemployment rate: +0.5pp
  • HY Spread: +50bps
  • Consumer sentiment: -5pts
Combined macro scalar: 1.08x

Micro Factors (65%)

Underwriting and population characteristics

  • DTI loosening: 38% → 43%
  • Grade D/E share: +8%
  • Avg income: -$5K
Combined micro scalar: 1.15x

Model Methodology

Forensic Portfolio Engine - Architecture & Key Assumptions (Grade Excluded from Features)

Three-Component Framework

Final Probability of Default = Micro Score × Vintage Multiplier × Macro Scalar

1. Micro Scorecard

  • Champion: GBM (HistGradientBoostingClassifier)
  • Challenger: Logistic Regression
  • • 9 features (Grade EXCLUDED)
  • • Isotonic + Segment-specific calibration

2. Vintage Curve

  • • Empirical default rate by quarter
  • • 3-month discrete periods
  • • Panel expansion for survival
  • • Period categories: Y1-Y5+

3. Macro Overlay

  • • 3 indicators: TDSP, UNRATE, CORCACBS
  • • EWMA smoothing (α=0.3)
  • • 1-quarter lag effect
  • • Soft cap: 0.8x - 1.25x scalar

Key Assumptions

1. Multiplicative Independence

Assumes independence between micro, vintage, and macro components. No interaction terms modeled.

2. Quarterly Discretization

Continuous time converted to discrete quarters. Months on book (MOB) mapped to quarters (q = floor(MOB/3)).

3. Fixed 1-Quarter Lag

Economic conditions affect loans issued in the next quarter. Q1 conditions impact Q2 originations.

4. Forensic Date Accuracy

Assumes `last_pymnt_d` accurately reflects true loan duration. Captures early payoffs and extended terms (1-60 months).

5. Default Assignment at Final Month

Defaults assigned only at the final month of loan duration, not throughout the loan lifecycle.

Key Facts

Feature Set (9 Features - Grade EXCLUDED)

  • Numeric (6): loan_amnt, dti, annual_inc, revol_util, emp_length_int, pti
  • Categorical (3): period_cat, home_ownership, purpose
  • Note: Grade is used for risk segmentation only, NOT as a model feature
  • Engineered: pti = installment / (monthly_income + 1)

Based on Forensic Portfolio Engine methodology

Macro Indicators (3) + Financial Parameters

  • UNRATE: Unemployment Rate (lagged 1Q)
  • TDSP: Total Debt Service Payments
  • CORCACBS: Consumer Credit (lagged 1Q)

Financial Parameters

  • • LGD: 70%
  • • Required Return: 6%
  • • Fee Servicing: 1%
  • • Fee Collection: 18%
  • • Recovery Lag: 6 months

Model Training Details

Champion: GBM (HistGradientBoostingClassifier)

  • • max_iter = 300
  • • learning_rate = 0.05
  • • max_depth = 6
  • • early_stopping = True
  • • scoring = 'neg_log_loss'
  • • n_iter_no_change = 20

Challenger: Logistic Regression

  • • max_iter = 300
  • • solver = 'saga'
  • • n_jobs = -1 (parallel)

Calibration

  • • Isotonic regression (CalibratedClassifierCV)
  • • Segment-specific calibrators (low/med/high risk)
  • • Min 1000 samples per segment

Economics & NPV Calculation

Expected Loss = EAD × PD × LGD  |  NPV = -Principal + PV(Payments) + PV(Recoveries)
70%
LGD
6%
Required Return
1%
Servicing Fee
18%
Collection Fee
6mo
Recovery Lag

Financial Confusion Matrix

TP (Good Catch)
Model warned + NPV breach occurred
FN (The Bleed)
No warning + NPV breach occurred
FP (False Alarm)
Model warned + No breach
TN (Smooth)
No warning + No breach

Warn threshold at 90th percentile of horizon PD. Only terminal loans (Fully Paid, Charged Off) evaluated - active loans excluded.

Drift Detection & Diagnostics

The system computes multiple statistical metrics to detect distribution shift and model performance:

Core Drift Metrics

  • PSI - Population Stability Index (10 bins)
  • KS Statistic - Kolmogorov-Smirnov distance
  • Mean Shift - Δ in predicted probabilities
  • Variance Ratio - σ²(curr) / σ²(base)

Performance Diagnostics

  • Horizon AUC - Loan-level discrimination
  • Horizon PR-AUC - Precision-Recall
  • A/E Ratio - Actual vs Expected defaults
  • Overprediction Factor - Calibration check

A/E Interpretation: 1.0 = Perfect | >1.2 = Underpredicting | <0.8 = Overpredicting

Leakage Audit

  • ID Disjointness: Verifies no loan IDs overlap between train/cal/test sets
  • Permutation Test: Shuffled labels should yield AUC ≈ 0.5 (no signal from leakage)

P&L Analysis

NPV-based margin analysis with time-aware discounting (LGD: 70%, Required Return: 6%)

14.00%
Avg Coupon Rate
4.0%
Cost of Funds
2.0%
OpEx
$-639,000,000
Variance vs Budget

Margin Waterfall (High Risk Segment)

Coupon
14.0%
- Cost of Funds
-4.0%
- OpEx
-2.0%
- Expected Loss
-22.3%
Net Margin
-14.3%

Hazard & Margin Analysis by Segment

SegmentQ Hazard (Base)Q Hazard (Drifted)Annual PDCouponMarginRequired
High Risk 7.14% 8.57% 30.1% 14.0% -22.1% 36.1%
Medium Risk 3.27% 3.69% 14.0% 12.0% -8.0% 20.0%
Low Risk 1.23% 1.34% 5.3% 9.0% -2.3% 11.3%

Decision Matrix

Strategic recommendations by segment

High Risk

EXIT
  • 30.38% default rate
  • -18.49% margin
  • Required: 47.20%
Action: STOP LENDING - Required rate exceeds usury cap

Medium Risk

EXIT
  • 15.96% default rate
  • -5.96% margin
  • Required: 26.96%
Action: STOP LENDING - Required rate exceeds usury cap

Low Risk

REPRICE
  • 6.47% default rate
  • -2.47% margin
  • Required: 17.47%
Action: RAISE RATE TO 17.5% - Current margin negative

DB Intelligence

AI-powered portfolio analysis & strategic recommendations

Online Gemini 3 Flash
DB Assistant
Credit Risk Intelligence
DB Assistant
Hello! I'm DB, your DriftBreaker intelligence assistant. Ask me anything about the portfolio, drift metrics, or use the Scenario Builder page to model stress scenarios!
Quick Questions

Scenario Builder

Model macro shocks, underwriting changes, and pricing strategies

Macro Shocks
+0.0%
-2%Baseline+8%
+0bps
-100bpsBaseline+500bps
+0pts
-40ptsBaseline+20pts
Underwriting Policy
43%
25% (Tight)50% (Loose)
$30K
$20K$100K
Pricing Strategy
+0bps
-200bps+500bps

Scenario Impact (Live)

Portfolio PD
11.61% 11.61%
+0.00%
Macro Scalar
1.00x 1.00x
+0.00x
Annual Losses
$4.07B $4.07B
+$0.00B
Volume Impact
100%
-0% of book
Segment Impact
High Risk
30.38% -18.49%
Medium Risk
15.96% -11.23%
Low Risk
6.47% -6.14%

Pre-built Scenarios