Week 9: Factor Investing

Which Stocks Outperform, and Why?

Learning Objectives

After this session you will be able to:

  • Explain what a factor is and how long-short portfolios are constructed
  • Distinguish risk-based from behavioural explanations for factor premia
  • Interpret alpha tests using HAC standard errors
  • Identify false discovery risk in published factor research
  • Evaluate whether factor premia survive transaction costs
  • Explain how JKP UK factor data and HAC inference enter Coursework 2 Scaffold B

Where We Are

Week Question Answer
2 Is the data trustworthy? Survivorship bias, look-ahead traps
3 Can we predict returns? ARIMA: barely (R² ~ 1%)
4 Can we predict risk? GARCH: yes (R² ~ 15–40%)
5 Can we build portfolios? Mean-variance, but estimation error hurts
9 Which stocks outperform? Factor models: today

R² here is conditional on target: ARIMA predicts the return level, GARCH predicts its variance — different quantities, same diagnostic.

We have learned that predicting the level of returns is nearly hopeless. But predicting which stocks beat others is a different problem, and the evidence is stronger.

The Core Question

If two stocks have the same market exposure, why does one outperform the other?

Decades of research show that certain firm characteristics predict which stocks earn higher returns:

  • Cheap stocks tend to beat expensive ones
  • Small firms tend to beat large ones
  • Recent winners tend to keep winning

These patterns are called factors.

What Is a Factor?

A factor is a measurable characteristic of a stock that is statistically associated with differences in expected returns.

Not a single stock. Not a sector bet.

A factor is a systematic tilt across many stocks based on one characteristic:

  • Value: book-to-market ratio (book value from accounts ÷ market value = price × shares; high B/M = priced cheaply vs accounting value)
  • Size: market capitalisation
  • Momentum: recent past return
  • Profitability: operating profit margin
  • Investment: asset growth rate

From Characteristic to Return

How do researchers turn a stock characteristic into a factor return?

Step 1: Rank all stocks by the characteristic (e.g. book-to-market ratio)

Step 2: Form portfolios:

  • High-characteristic group (e.g. highest 30% B/M = cheapest stocks): the long side
  • Low-characteristic group (e.g. lowest 30% B/M = most expensive stocks): the short side

Step 3: Compute the return spread each month:

\[\text{Factor return}_t = R_{\text{long},t} - R_{\text{short},t}\]

Because the short proceeds finance the long purchase, a factor is a zero-cost (self-financing) portfolio. Its “return” is a spread, not growth in wealth.

HML: A Concrete Example

HML = High B/M Minus Low B/M (the value factor)

High B/M means book value is high relative to price — the market is pricing the business cheaply.

Each month, sort all stocks by book-to-market ratio:

Long (cheap stocks)

  • High book-to-market
  • “Boring” companies
  • Often banks, utilities, industrials
  • If these return +2.0%

Short (expensive stocks)

  • Low book-to-market
  • “Glamorous” companies
  • Often tech, biotech, luxury
  • … and these return +1.0%

Sector mapping is typical, not definitional — a tech stock can be value if it falls far enough.

HML return this month = +2.0% − 1.0% = +1.0%

A positive HML means value stocks beat growth stocks.

Part I: Why Factors Exist

CAPM: One Factor

Week 5 recap: the textbook claim that a stock’s expected return depends only on its sensitivity to the market.

The Capital Asset Pricing Model (1960s) says expected returns depend on one thing only:

\[E[R_i] - R_f = \beta_i \times (E[R_m] - R_f)\]

  • \(R_f\): the risk-free rate (1-month T-bill return, the “safe” benchmark)
  • \(E[R_m]\): the expected market return; \(E[R_m] - R_f\) is the equity risk premium
  • \(\beta_i\): slope from regressing stock \(i\)’s excess returns on the market’s excess returns — how much stock \(i\) moves with the market
  • High beta → high expected return (compensation for risk)
  • Low beta → low expected return

Implication: The only way to earn more is to take more market risk. No other characteristic should matter.

This is testable. And the tests failed.

Where CAPM Breaks

Fama and French (1992) tested CAPM’s predictions on US stocks (1963–1990) using cross-sectional regressions (regressing average returns across firms on β, size, and B/M):

What CAPM predicts

  • Beta should explain all return differences
  • No other characteristic should matter
  • High-beta stocks should earn more

What the data shows

  • Once size and B/M are included, β no longer adds marginal explanatory power
  • Size and book-to-market predict returns
  • High-beta stocks do not reliably earn more

This paper is one of the most cited in finance. It motivated the shift from one factor to many.

Three Factors (1993)

Fama and French (1993) proposed a model with three long-short portfolios:

Factor Full name Construction Captures
MKT Market Market return minus risk-free rate Overall equity premium
SMB Small Minus Big Small-cap minus large-cap returns Size premium
HML High Minus Low Value minus growth returns Value premium

On the 25 US size × B/M–sorted portfolios (1963–1991), this model explains ~90% of time-series return variation, versus ~70% for CAPM alone. R² depends on the test assets and sample — it is not a universal number.

Five Factors (2015)

Fama and French (2015) added two more factors:

Factor Full name What it captures
RMW Robust Minus Weak Profitable firms beat unprofitable
CMA Conservative Minus Aggressive Low-investment firms beat high-investment

Why these two? Firms that are profitable and invest conservatively tend to earn higher returns. Both a risk story (profitable, low-investment firms have more durable cash flows, hence lower distress risk) and a behavioural story (the market underprices boring, profitable companies) fit the evidence — we revisit this tension on the next slide.

And one important factor that Fama-French deliberately excluded:

Momentum (MOM): buy recent winners, sell recent losers (Jegadeesh and Titman 1993)

  • 12-month formation period, skip the most recent month (to avoid short-term reversal contaminating the signal)
  • Strongest short-term predictor, but also the most volatile

Why exclude it? No coherent risk story — momentum pays off for a purely behavioural reason (investor under-reaction), which doesn’t fit Fama-French’s rational-risk framework. They called it the “main embarrassment of the three-factor model” in 1996 and have treated it as an anomaly, not a factor, ever since.

Why Do Premia Exist?

Risk explanation

  • Value stocks are distressed → riskier → require higher return
  • Small stocks are illiquid → harder to sell in a crisis (the size risk story is the most contested of the three)
  • Factors compensate for systematic risk
  • If true: premia should persist (risk doesn’t go away)

Behavioural explanation

  • Investors overreact to bad news → value stocks become too cheap
  • Investors chase glamour → growth stocks become too expensive
  • Factors exploit investor mistakes
  • If true: premia may shrink as investors learn

The honest answer: probably both. Some factors look like risk compensation, others look like mispricing. The distinction matters because it affects whether you expect the premium to survive.

Part II: What the Data Shows

The JKP Dataset

For this module we use the Jensen, Kelly, and Pedersen (2023) global factor dataset:

  • 71 countries, monthly frequency
  • 14 factor return series (including MKT, SMB, HML, MOM, RMW, CMA)
  • US sample: January 1926 to December 2023 (1,176 months)
  • UK sample: January 1986 to December 2023 (456 months)
  • Within portfolios: value-weighted (bigger firms get more weight)
  • Breakpoints: NYSE-capitalisation quantiles (so microcaps don’t dominate the sort)

This is the dataset you will use in Lab 9 and Coursework 2 Scaffold B. Understanding what it contains (and what construction choices it embeds) is essential.

Factor Performance: US

Factor Sharpe Ratios

Sharpe ratio = mean excess return ÷ standard deviation of excess return. Annualised here by × √12. Read the Sharpe column — nothing above 0.5, market included.

US Factor Performance, 1963–2023 (JKP dataset)

     Mean (% monthly)  Std (% monthly)  Sharpe (annualised)
MKT             0.641            5.007                0.444
SMB             0.110            2.272                0.167
HML             0.343            3.225                0.368
MOM             0.360            3.179                0.393
RMW             0.225            2.049                0.380
CMA             0.277            2.113                0.454

Notice: Sharpe ratios are modest (0.2–0.4). No factor delivers a “free lunch.” The premia are real but small and volatile.

US vs UK Factors

Annualised Sharpe ratios, 1986–2023

     US Sharpe  UK Sharpe
HML      0.297      0.356
SMB      0.008     -0.014
MOM      0.312      0.491
RMW      0.457      0.514
CMA      0.380      0.322

US observations: 456, UK observations: 453

The UK sample has only 456 months versus 1,176 for the US, so UK premium estimates carry roughly 1.6× the standard error of their US equivalents. Coursework 2 uses UK data, so you will work with wider confidence intervals. This is not a bug; it is the reality of international factor research.

Bloomberg Context

This is where Bloomberg helps: it anchors factor discussion in market context. If UK equities structurally underperform US equities over a period, UK factor estimates should be interpreted within that backdrop, not in isolation.

Academic vs Investable

Academic vs investable (monthly, overlapping sample)

Correlation (SPY total vs JKP MKT excess): 0.950
Observations: 71

Note: SPY is a TOTAL return; JKP MKT is an EXCESS return.
Level means are therefore NOT comparable without adjusting for R_f.
Correlation is the robust comparison here.

Even for the market factor (the easiest case), an investable proxy and an academic factor are not directly interchangeable: SPY is a total return while JKP MKT is an excess return (market minus risk-free). Their monthly co-movement is very high (correlation ≈ 0.99), but level comparisons require adding \(R_f\) back or subtracting it. For style factors (value, momentum), the gap between academic and investable versions is usually larger still — differences come from construction conventions, investability frictions, and data handling.

Gray: The Practitioner Lens

Wesley Gray (Alpha Architect, Quantitative Value, Quantitative Momentum) bridges academic research and practice:

  • Backtested returns ≠ implementable returns
  • Transaction costs, execution timing, and capacity constraints all erode the premium
  • The academically optimal strategy and the investable strategy are often different

Gray’s three rules of thumb for implementable factor investing (Gray and Vogel 2016):

  1. Rebalance less frequently than the literature suggests
  2. Accept some tracking error versus the academic benchmark
  3. Prefer low-turnover factors (value, quality) over high-turnover factors (momentum)

Part III: Testing Factor Models

What Is Alpha?

Before any equations, the intuition:

Alpha is the return that remains after you account for a portfolio’s exposure to known factors.

  • If a fund returns 12% and its factor exposures explain 11.7% (both annualised), the alpha is 0.3%.
  • Alpha measures skill (or luck): the part of performance that factors cannot explain.

Most active fund managers have negative alpha after fees (Fama and French 2010). This is why factor exposure analysis matters: it separates what you can get cheaply (factors) from what you are paying for (alpha).

Building the Attribution Equation

We build up in layers:

Layer 1 (CAPM): \(R_p - R_f = \alpha + \beta_{MKT}(R_m - R_f) + \epsilon\)

One factor: market. Alpha is everything the market doesn’t explain. (ε = residual — the random part each period that the factors cannot explain.)

Layer 2 (Fama-French 3): Add size and value:

\(R_p - R_f = \alpha + \beta_{MKT}(R_m - R_f) + \beta_{SMB} \cdot SMB + \beta_{HML} \cdot HML + \epsilon\)

Alpha often shrinks: some “skill” was really factor exposure.

Layer 3 (Five-factor): Add profitability and investment:

\(R_p - R_f = \alpha + \beta_{MKT}MKT + \beta_{SMB}SMB + \beta_{HML}HML + \beta_{RMW}RMW + \beta_{CMA}CMA + \epsilon\)

Alpha shrinks further. What remains is genuine outperformance (or noise).

Worked Example

All figures below are annualised percentage returns. Loadings are betas (unitless). MKT in the table is already the market excess return, so the MKT factor return is \(R_m - R_f\).

A fund returned 12% last year. Market returned 8%. Risk-free rate: 2%.

Step 1: Excess return = 12% − 2% = 10%

Step 2: Suppose factor loadings are:

Factor Loading (β) Factor return Contribution
MKT 1.10 6.0% 6.60%
SMB 0.25 1.5% 0.375%
HML 0.40 2.0% 0.80%
RMW 0.15 1.8% 0.27%
CMA 0.10 1.0% 0.10%
Total explained 8.145% ≈ 8.2%

Step 3: Alpha = 10.0% − 8.2% = 1.8% (annualised)

But is 1.8% statistically significant, or could it be noise? That depends on the standard error, which is where HAC inference comes in.

Your Turn

A different fund returned 9% last year. Risk-free rate: 2%. Factor returns same as before.

Factor loadings: β_MKT = 0.95, β_SMB = −0.10, β_HML = 0.30, β_RMW = 0.20, β_CMA = 0.05

Before I show you: estimate the alpha on paper. You have 60 seconds.

Factor β × Return Contribution
MKT 0.95 × 6.0% 5.70%
SMB −0.10 × 1.5% −0.15%
HML 0.30 × 2.0% 0.60%
RMW 0.20 × 1.8% 0.36%
CMA 0.05 × 1.0% 0.05%
Total 6.56%

Alpha = (9% − 2%) − 6.56% = 0.44%

Why Standard Errors Matter

An alpha of 0.44% per year sounds small. Is it statistically different from zero?

To answer this, we need a t-statistic: \(t = \frac{\hat{\alpha}}{SE(\hat{\alpha})}\)

(Rough rule from Week 3 stats: |t| > 2 ≈ “probably not zero” at the 5% level.)

The problem: residuals from factor-model regressions violate OLS’s iid assumption. They exhibit persistent volatility (Week 4’s GARCH story) and, for some factors, serial dependence from overlapping formation windows (momentum especially).

If we ignore this and use OLS standard errors, we get standard errors that are too small → t-statistics that are too large → false significance.

The fix: HAC (Heteroskedasticity- and Autocorrelation-Consistent) standard errors — also known as Newey–West — widen the error bars to reflect both issues (Newey and West 1987). HAC standard errors are larger, and the t-statistics are more honest.

OLS vs HAC: The Numbers

Setup: regress US HML on US MKT, 1926–2023. The intercept is HML’s CAPM-alpha — the part of HML’s return that isn’t just market exposure. We use 6 monthly lags in HAC, a common choice for monthly equity factors; Lab 9 explores sensitivity.

HML alpha test (regress HML on MKT, US 1926–2023)

                          OLS   HAC (NW-6)
Alpha (monthly)        0.0041       0.0041
Std Error              0.0008       0.0010
t-statistic              4.96         4.04

HAC SE is 1.2x larger than OLS SE.
The t-statistic drops — significance becomes less certain.

Lab 9 walks through this in detail with simulated and real data. For your CW2 report, always use HAC standard errors and discuss what changes.

Statistical vs Economic Significance

Statistical significance

  • t > 1.96 (or t > 3.0 after Harvey correction: Harvey, Liu & Zhu 2016 argued that with 300+ factors already tested, the conventional |t| > 1.96 threshold produces too many false positives — a multiple-testing penalty, not a stricter stats convention)
  • “Alpha is probably not zero”
  • Says nothing about magnitude

Economic significance

  • Is the alpha large enough to cover costs?
  • Typical round-trip cost drag for a monthly-rebalanced long-short factor: 0.5–1.5% annually (Detzel, Novy-Marx, and Velikov 2023)
  • After fees, is anything left?

An alpha that is statistically significant at 0.3% monthly (3.6% annual) sounds attractive. But if implementation costs 1.5%, net alpha is 2.1%. If the strategy requires monthly rebalancing, costs may be higher. Always ask: can this be captured in practice?

Part IV: The Factor Zoo

300+ Published Factors

Researchers have documented over 300 characteristics that predict stock returns (Harvey, Liu, and Zhu 2016):

  • Size, value, momentum, profitability, investment, quality, accruals, share issuance, asset growth, earnings surprise, analyst revisions, short interest, idiosyncratic volatility, …

The problem: with 300 tests and a 5% significance level, you expect 15 false discoveries — significant t-statistics that happen by chance rather than because the factor is real — assuming independent tests. Real tests are correlated, which can make this worse, not better.

Harvey, Liu, and Zhu (2016) argue: the standard t > 1.96 threshold is far too low for published factors. After accounting for multiple testing, you need t > 3.0 to be confident a factor is real.

Most published factors fail this higher bar.

Post-Publication Decay

McLean and Pontiff (2016) tested 97 published factors after their papers appeared:

In-sample (paper’s period)

  • Factors work as published
  • Average premium: strong

Out-of-sample (post-publication)

  • Premium declines by ~32% on average
  • Some factors disappear entirely

Two explanations:

  1. Data mining: the original result was partly luck (inflated by selection)
  2. Arbitrage: investors learned about the factor and traded it away

Both are probably at work. For your coursework, this means: always compare in-sample to out-of-sample performance.

Jensen, Kelly & Pedersen

Jensen, Kelly, and Pedersen (2024) systematically replicated factors across 93 countries:

  • 153 factors tested with consistent methodology
  • Standardised construction: same breakpoints, same weighting
  • Result: many factors replicate, but premia are smaller than originally published

Key finding: about half of published factors replicate reliably across countries. The other half are fragile, sample-specific, or sensitive to construction choices.

This is the dataset (JKP) that you use in labs and coursework. The construction choices embedded in JKP (value-weighting, capitalisation breakpoints) affect the factor returns you observe.

Which Factors Survive?

After multiple testing corrections, out-of-sample testing, and international replication:

Factor Survives? Evidence
MKT (market) Yes Robust across all samples
MOM (momentum) Yes Strong but volatile; crashes
RMW (profitability) Yes Robust, low turnover
HML (value) Weakened Indistinguishable from zero 2007–2020; partial recovery since
SMB (size) Fragile Sensitive to breakpoints and weighting — always ask how SMB was built
CMA (investment) Moderate Robust in US, weaker internationally

For your coursework: choose a factor and evaluate its strength honestly. A strong report will not claim the factor “works” or “doesn’t work” but will discuss the evidence with appropriate uncertainty.

Implications for Your Analysis

When you interpret factor results in Lab 9 or CW2, remember:

  1. Use HAC standard errors — OLS overstates significance
  2. Compare in-sample to out-of-sample — split your sample or use rolling windows
  3. Check multiple testing — a single significant result is not enough
  4. Report Sharpe ratios and economic magnitude — not just t-statistics
  5. Discuss construction choices — JKP’s methodology affects your numbers

Strong CW2 reports demonstrate critical awareness of these issues. Weak reports treat the data as ground truth.

Part V: Implementation

Transaction Costs

Detzel, Novy-Marx, and Velikov (2023) show that accounting for transaction costs can reverse the ranking of factor models:

Factor Turnover (annual) Estimated cost impact
MOM 100–200% (often higher in practice) Severe (can eliminate premium)
HML 20–40% Moderate
RMW 15–25% Mild
CMA 10–20% Mild

High-turnover factors (momentum) are the most vulnerable. A model that looks superior on gross returns may be inferior net of costs. Low-turnover factors (quality, value) fare better, but are not immune.

This is why Gray recommends accepting tracking error in exchange for lower costs.

Factor Crowding

When a factor strategy becomes popular, more capital chases the same stocks:

  1. Prices of “long” stocks are bid up → future returns decline
  2. Prices of “short” stocks are pushed down → future short returns decline
  3. The premium compresses

Smart beta ETFs have gathered $2.5 trillion globally (2024). This concentration creates systemic risk: when everyone holds the same factor tilt, a reversal can trigger forced selling across many portfolios simultaneously.

The value premium’s disappearance from 2007–2020 is consistent (though not proof) of crowding effects.

When Factors Fail

Momentum crash of 2009:

  • Momentum lost ~73% over roughly three months (March–May 2009) as beaten-down stocks surged in the recovery
  • Stocks that had been losing (the “short” side) rebounded violently
  • This is a systematic risk, not a data error

Value drought (2007–2020):

  • Value stocks underperformed growth by roughly 40 percentage points cumulative over the period
  • Tech dominance (FAANG) rewarded growth characteristics
  • Tested investor patience severely

Factors are not risk-free arbitrages. They can underperform for years. Investing in factors requires discipline, patience, and realistic expectations.

Honest Conclusions

What the evidence supports:

  1. Factors are real: systematic return differences associated with firm characteristics exist
  2. Premia are modest: Sharpe ratios of 0.2–0.4, not lottery tickets
  3. Not all survive: multiple testing, publication bias, and decay reduce the “zoo” to 5–7 robust factors
  4. Costs matter: transaction costs can eliminate premia, especially for high-turnover strategies
  5. Patience required: factors can underperform for years; discipline is essential

For your analysis: the goal is not to prove a factor “works” or “fails.” It is to evaluate the evidence honestly, with appropriate uncertainty, and to discuss what an investor would actually face.

Assessment Connection

CW2 Scaffold B (Tree-Based Factor Investing):

  • Uses JKP UK monthly factors
  • Walk-forward OLS, Ridge, tree-based models
  • SHAP feature importance
  • 2,500-word reflective report

What the report must cover:

Component What to discuss
Method rationale Why walk-forward? Why Ridge?
Data quality JKP construction, UK sample limits
Results OOS R², Sharpe, model comparison
Limitations Data mining, estimation risk, costs
Regulatory FCA guidance on model risk and suitability
Ethics Overfitting in production as a client-harm issue

Walk-forward, Ridge, and SHAP are previewed in Week 10 — don’t worry if the names are unfamiliar today.

Lab 9 Preview

Standard lab (60–90 min):

  • Part 1: Simulate autocorrelation, compare OLS vs HAC
  • Part 2: Alpha tests and economic interpretation
  • Part 3: Robustness (sample split, rolling windows)
  • Part 4: Transaction costs and selection bias

Advanced extension (optional, ~30 min):

  • Build a momentum factor from individual S&P 500 stock prices
  • Compare your DIY factor to JKP MOM
  • Discover how construction choices affect results
  • Directly relevant to CW2 “Data quality” section

Key Takeaways

  1. Factors are long-short portfolios built from firm characteristics; they represent systematic tilts, not stock picks

  2. Premia are real but modest — Sharpe ratios of 0.2–0.4, and many published factors fail replication

  3. Honest inference requires HAC standard errors — OLS understates uncertainty in autocorrelated data

  4. Costs can eliminate premia — especially for high-turnover strategies like momentum, even though momentum survives gross-of-cost tests

  5. Critical evaluation matters — your CW2 report should discuss limitations, not just results

Reading

Chapter: Chapter 9: Factor Investing

Essential reading:

  • Jensen, Kelly, and Pedersen (2024) — Is there a replication crisis in finance?
  • Fama and French (2015) — Five-factor model
  • Harvey, Liu, and Zhu (2016) — … and the cross-section of expected returns
  • McLean and Pontiff (2016) — Post-publication decay (directly relevant to CW2 out-of-sample discussion)

Practitioner:

  • Gray and Vogel (2016) — Quantitative Momentum (implementability)
  • Gray and Carlisle (2012) — Quantitative Value

Further:

  • Detzel, Novy-Marx, and Velikov (2023) — Transaction costs and factor models

Next Week: Backtesting

Week 10 moves from factor models to backtesting and production ML:

  • How to test trading strategies without cheating
  • Combinatorial purged cross-validation
  • Probability of backtest overfitting
  • From research prototype to production system

Factor models provide the benchmark: machine-learning strategies in Week 10 are evaluated against the factor framework you learned today.

Questions?

📧 b.quinn1@ulster.ac.uk 🏢 Office hours by appointment

Resources:

References

Detzel, Andrew, Robert Novy-Marx, and Mihail Velikov. 2023. “Model Comparison with Transaction Costs.” Journal of Finance 78 (3): 1743–75. https://doi.org/10.1111/jofi.13225.
Fama, Eugene F., and Kenneth R. French. 1992. “The Cross-Section of Expected Stock Returns.” Journal of Finance 47 (2): 427–65. https://doi.org/10.1111/j.1540-6261.1992.tb04398.x.
———. 1993. “Common Risk Factors in the Returns on Stocks and Bonds.” Journal of Financial Economics 33 (1): 3–56. https://doi.org/10.1016/0304-405X(93)90023-5.
———. 2010. “Luck Versus Skill in the Cross-Section of Mutual Fund Returns.” Journal of Finance 65 (5): 1915–47. https://doi.org/10.1111/j.1540-6261.2010.01598.x.
———. 2015. “A Five-Factor Asset Pricing Model.” Journal of Financial Economics 116 (1): 1–22. https://doi.org/10.1016/j.jfineco.2014.10.010.
Gray, Wesley R., and Tobias E. Carlisle. 2012. Quantitative Value: A Practitioner’s Guide to Automating Intelligent Investment and Eliminating Behavioral Errors. Wiley.
Gray, Wesley R., and Jack R. Vogel. 2016. Quantitative Momentum: A Practitioner’s Guide to Building a Momentum-Based Stock Selection System. Wiley.
Harvey, Campbell R., Yan Liu, and Heqing Zhu. 2016“... And the Cross-Section of Expected Returns.” Review of Financial Studies 29 (1): 5–68. https://doi.org/10.1093/rfs/hhv059.
Jegadeesh, Narasimhan, and Sheridan Titman. 1993. “Returns to Buying Winners and Selling Losers: Implications for Stock Market Efficiency.” Journal of Finance 48 (1): 65–91. https://doi.org/10.1111/j.1540-6261.1993.tb04702.x.
Jensen, Theis I., Bryan T. Kelly, and Lasse Heje Pedersen. 2024. “Is There a Replication Crisis in Finance?” Journal of Finance. https://doi.org/10.1111/jofi.13249.
McLean, R. David, and Jeffrey Pontiff. 2016. “Does Academic Research Destroy Stock Return Predictability?” Journal of Finance 71 (1): 5–32. https://doi.org/10.1111/jofi.12365.
Newey, Whitney K., and Kenneth D. West. 1987. “A Simple, Positive Semi-Definite, Heteroskedasticity and Autocorrelation Consistent Covariance Matrix.” Econometrica 55 (3): 703–8. https://doi.org/10.2307/1913610.