Fair Lending

Does Alternative Data Reduce or Amplify Disparate Impact? Examining the Evidence

By Omar Hassan · September 2, 2025 · 9 min read

Abstract representation of equitable data analysis and disparate impact measurement

Among the most important — and most contested — questions in alternative credit underwriting is whether the use of non-bureau data sources reduces or amplifies disparate impact on protected classes. Advocates argue that cash-flow and transaction data levels the playing field for groups that are disproportionately thin-file or credit-invisible. Critics raise the concern that some alternative data sources — particularly those with geographic or behavioral correlations to race or national origin — could import protected-class discrimination into underwriting through a different door.

The honest answer is that both positions can be true, depending entirely on which signals are used and how the model is validated. This is not a satisfying answer for those seeking a simple advocacy position, but it is the accurate one.

Why FICO Itself Has a Disparate Impact Problem

Before examining alternative data, it is worth establishing the baseline. FICO scores have a well-documented disparate impact on certain protected classes. Research consistently finds that Black and Hispanic Americans score lower on average than white and Asian Americans on traditional FICO models, and that this disparity is not fully explained by income or wealth differences. Part of the explanation is that the features FICO measures — length of credit history, credit mix, amounts owed — capture aspects of the financial system that have historically been less accessible to certain communities due to redlining, discriminatory lending practices, and unequal access to credit products.

This means the baseline the alternative data community is competing against is not a neutral, bias-free model. It is a model with known disparate impact that regulators have nonetheless accepted as a standard because it meets the business necessity and predictive validity tests under ECOA. Any alternative data model needs to clear a similarly structured hurdle — not beat a zero-disparate-impact baseline, because that baseline does not exist.

Cash-Flow Data: Why the Picture Is More Positive

Research on transaction account-based underwriting models — from the CFPB's own published work, academic studies using large bank datasets, and the growing body of evidence from fintechs operating in the thin-file space — has generally found that cash-flow features show lower disparate impact ratios compared to traditional FICO features when measured on thin-file populations.

The intuition behind this finding is that cash-flow signals measure actual financial behavior rather than credit system participation. A borrower's pattern of paying rent on time, maintaining consistent deposits, and avoiding overdrafts is less likely to correlate with race or national origin than their history of credit card usage, which is tied to access patterns that have historically been unequal. When you control for income level, cash-flow consistency metrics tend to show smaller race-based disparities than FICO score distributions at equivalent income levels.

Specific data points from published research suggest that approval rate ratios (the ratio of approval rate for a minority group to approval rate for the reference group) for cash-flow-underwritten thin-file populations are frequently above the EEOC's 80% threshold (four-fifths rule), whereas FICO-based approval ratios for the same thin-file population often fall below that threshold. This is not universal — it depends on the specific features used and the population — but it is a consistent directional finding.

Where Alternative Data Creates New Disparate Impact Risk

The optimistic picture above is conditional. Certain alternative data features create genuine disparate impact risk that requires active mitigation:

Geographic features. Any model feature that directly or indirectly encodes geography — zip code, census tract, even area code in some formulations — can become a proxy for race given US residential segregation patterns. Cash-flow models that include geographic adjustment factors or area-specific calibration need to be tested for geographic correlation with race before deployment.

Income level as a feature. Income level is not a protected class under ECOA, but income is correlated with race in ways that are partially attributable to historical discrimination. Using income level as a standalone feature without controls can import race-correlated disparities into the model. This does not mean income is unusable — it means it must be tested in combination with the full feature set and the disparate impact results examined at the combined model level, not just feature by feature.

Certain spending pattern features. Spending categories can correlate with protected class characteristics. Features derived from spending on particular merchant categories, geographic cash withdrawal patterns, or similar behaviorally-coded signals require careful testing. The question is not whether the feature predicts default — it may well predict default very well — but whether it does so through a mechanism that constitutes disparate impact by functioning as a proxy for protected class status.

The Validation Framework: What Responsible Disparate Impact Testing Looks Like

Under OCC Bulletin 2013-29 and CFPB supervisory expectations, model validation for fair lending purposes requires a systematic testing protocol. For a cash-flow decisioning model, that protocol should include:

Adverse action rate analysis by demographic proxy. Because lenders typically do not have race data for applicants outside of HMDA reporting contexts, testing relies on proxy methods — BISG (Bayesian Improved Surname Geocoding) or similar probabilistic demographic assignment based on name and address. Adverse action rates are computed by proxy race/ethnicity group and tested for statistical significance.
Four-fifths rule application. The EEOC's four-fifths rule requires that the approval rate for a protected group be at least 80% of the approval rate for the group with the highest approval rate. This is a threshold test, not a definitive legal standard, but falling below it triggers investigative obligations.
Regression-based decomposition. Beyond simple approval rate ratios, a regression-based analysis can decompose how much of the approval rate disparity is explained by model features versus residual demographic correlation. This supports the business necessity justification documentation that ECOA examination requires.
Feature-level correlation screening. Each feature in the model should be tested for correlation with demographic proxies. Features with high correlation that are not essential to model performance can be removed or replaced with less proxy-correlated alternatives without significant Gini loss.

We're Not Claiming Cash-Flow Models Are Bias-Free

The argument here is not that cash-flow underwriting eliminates disparate impact. It is more limited: on the specific population where FICO performs poorly — thin-file and credit-invisible borrowers — well-designed cash-flow models with proper validation tend to show lower disparate impact ratios than FICO, while maintaining meaningful predictive accuracy. That is a net improvement for financial inclusion, not a solved problem.

The risk for the field is that the positive narrative around alternative data leads to premature deployment of poorly-validated models that import new forms of discrimination while claiming to advance inclusion. A cash-flow model deployed without disparate impact testing is not a fair lending improvement over FICO — it is an untested assumption. The field's credibility, and lenders' regulatory standing, depend on treating disparate impact testing as a prerequisite for deployment, not an afterthought.

What Ongoing Monitoring Must Catch

Initial validation before deployment is necessary but not sufficient. Disparate impact patterns in deployed models can shift over time as borrower populations change, economic conditions evolve, and model drift affects feature distributions. Monthly or quarterly monitoring of adverse action rates by demographic proxy, with a pre-defined threshold for triggering investigation, is the minimum standard for responsible ongoing deployment.

Lenders evaluating cash-flow decisioning vendors should explicitly ask: what monitoring data do you provide, at what frequency, and in what format? A vendor that cannot produce regular disparate impact monitoring reports from its platform is not a vendor that can support the lender's ongoing fair lending obligations. The monitoring infrastructure is not separate from the decisioning infrastructure — it is part of it.

Does Alternative Data Reduce or Amplify Disparate Impact? Examining the Evidence

Why FICO Itself Has a Disparate Impact Problem

Cash-Flow Data: Why the Picture Is More Positive

Where Alternative Data Creates New Disparate Impact Risk

The Validation Framework: What Responsible Disparate Impact Testing Looks Like

We're Not Claiming Cash-Flow Models Are Bias-Free

What Ongoing Monitoring Must Catch

Related articles

ECOA and Fair Lending Obligations for Decisioning APIs

Thin-File vs. Credit-Invisible: Definitions That Matter

Model Explainability and Adverse Action Notices