Week 5: Robo-Advisors & Portfolio Optimisation

Learning Objectives

Contrast traditional advisory and robo‑advisory models, including the economics of access and cost
Implement and interpret portfolio optimisation using Modern Portfolio Theory
Explain why MPT breaks down at scale and in the presence of real-world constraints
Describe how evolutionary algorithms extend portfolio optimisation to large-scale, multi-objective problems
Evaluate recent frontier research connecting algorithmic innovation to robo-advisory applications

🎯 Opening frame: “Welcome to Week 5. We start with why robo-advisers exist, build to how they work algorithmically, then follow the research thread to where the frontier sits today.” ❓ Big question: “What happens when you scale a beautiful 1952 theory to 2025 reality: thousands of securities, real-world constraints, and limited data on new assets?” 🔗 Connect to Week 1: Fee compression and the persistence puzzle. Robo-advisers are a key driver of that compression. 🔗 Connect to Week 4: GARCH volatility estimates feed directly into the covariance matrix that portfolio optimisers need. ⏱️ Timing: Parts I–II are economics and MPT; Parts III–IV are where theory breaks and research responds; Part V is Python. ⚠️ Assessment: CW1 (Presentation) is due next week: robo-adviser backtests is one of the four topic options.

Agenda

Part I : Economics of traditional vs. robo-advisory services
Part II : Technology and algorithms: Modern Portfolio Theory in practice
Part III : When theory meets reality: the hard problems in portfolio construction Part IV : The algorithmic revolution: from classical methods to frontier research Part V : Python implementation and directed learning

Part I : Economics of Traditional vs. Robo-Advisory

Traditional Wealth Management: The Service

What financial advisers provide:

Portfolio management: asset allocation, security selection, rebalancing
Financial planning: retirement, tax, estate planning advice
Relationship management: ongoing consultations, market updates
Bespoke service tailored to individual circumstances

High-touch, human-centred model built on trust and expertise

Traditional model foundations: Wealth management evolved as a bespoke, high-touch service for affluent clients. Think private banks, boutique advisers, independent RIAs (Registered Investment Advisers).
Portfolio management: Advisers construct diversified portfolios, select individual securities or funds, monitor performance, and rebalance periodically. This requires market knowledge, analytical tools, and time.
Financial planning: Beyond investment management, advisers provide holistic planning: retirement income projections, tax optimization strategies, estate planning coordination with lawyers/accountants, insurance needs analysis. This is value-added service that justifies fees.
Relationship management: Advisers meet clients quarterly or semi-annually, provide market commentary, manage behavioural issues (preventing panic selling in downturns), adjust plans as life circumstances change (marriage, children, job loss, inheritance).
Human element: The model relies on trust, empathy, and communication skills. Clients value having “someone to call” when markets crash or life events occur. This human capital is expensive.
Bespoke nature: Each client gets a customised plan. No two portfolios are identical. Advisers account for unique constraints: concentrated stock positions, ESG preferences, tax situations, liquidity needs. This customization requires time and expertise.
Historical context: This model dates back to the 1970s-1980s with the rise of fee-based (vs. commission-based) advisers. It replaced brokerage-driven models and became the gold standard for affluent clients.
Pedagogical point: Before discussing costs, establish what the service entails. Students should understand why this model is expensive: it’s labour-intensive, requires high skills, and doesn’t scale easily.
Engagement: “Have any of your families used a financial adviser? What did they value most about the service?” (Likely: peace of mind, behavioural coaching, holistic planning: not just returns.)
Transition: “This service model has an economic structure that creates exclusion. Let’s see why.”

Traditional Model: Cost Components

Why is advice expensive?

Human Capital: High salaries for credentialed staff (CFA, CFP)
Technology: Expensive per-seat licenses (Bloomberg, Aladdin)
Compliance: Regulatory filings, audits, liability insurance
Overhead: Physical offices, support staff

Hilpisch Insight (Hilpisch (2019)):
“Investments of $25-36 million just for derivatives analytics libraries” : High fixed costs favour large scale.

Traditional Model: The Capacity Constraint

The Binding Constraint: Time

Max Capacity: 100–150 relationships per adviser
Service requirements: Quarterly meetings, bespoke planning, emotional support
Scaling: To double clients, you must double advisers (Linear scaling)

Unit Economics:
Adviser Cost (£100k) / 100 Clients = £1,000 per client cost
Before overheads or profit.

The Access Barrier: Economics of Exclusion

Why Minimums Exist

Break-even Point: ~£100k assets (at 1.5% fee)
Profit Margin: Firms need 2-3x break-even to be viable
Result: Minimums set at $250,000 – $500,000

Household Finance Reality (Campbell (2006)):
The vast majority of households fall below this threshold.
Median UK financial wealth: ~£15k.

The Access Barrier: Who is Left Out?

The “Advice Gap”

Young Professionals: High income, low assets (building wealth)
Middle Class: £20k–£150k savings (too small for advisers, too complex for DIY)
Mass Market: <£20k savings (relies on cash/deposits)

Consequences: - Poor diversification - High fees in mutual funds - Behavioural errors (panic selling) - Exacerbated wealth inequality

Robo-Advisory: Automating the Value Chain

Task	Traditional	Robo-Adviser
Risk Assessment	1-hour interview	5-min questionnaire
Allocation	Manual / spreadsheet	Mean-Variance Optimisation
Rebalancing	Quarterly (manual)	Daily (automated drift check)
Tax Harvesting	Rare (too complex)	Continuous (algorithmic)

Why Automation Raises the Stakes

Tax-loss harvesting at scale
Algorithms scan every holding daily, sell losers, and immediately buy a correlated substitute to capture the tax benefit without disrupting the portfolio. A human adviser managing 500 clients cannot do this; a robo-adviser manages 500,000 at the same marginal cost.

Millisecond rebalancing
Drift from target allocation is corrected continuously, not quarterly. The portfolio always reflects the optimised weights rather than drifting away through price movements.

The catch

Both advantages depend entirely on the quality of those target weights. If the underlying optimisation is unstable (producing extreme, erratic allocations), daily automated rebalancing amplifies the problem rather than solving it. This is the central challenge we examine for the rest of today.

Robo-Advisory: Platform Scalability

Software Economics (Zero Marginal Cost)

Infrastructure: Cloud-based (AWS), API-driven trade execution
Capacity: 1 million clients served as easily as 1,000
Cost Curve: High fixed cost (dev), near-zero marginal cost

Comparison to Platforms (Week 5):
Robo-advisers have weak network effects (my return doesn’t depend on you) but extreme scale economies.

Robo-Advisory: Fee Compression

The New Price Point

Traditional: 1.0% – 1.5% AUM
Robo-Adviser: 0.15% – 0.35% AUM
Savings: ~80% reduction

Access for All: Minimums drop from $250,000 to $0 – $500.

Robo-Advisory: The Power of Compounding

Impact of Fees on £100k Portfolio (30 Years, 6% Return)

Fee Model	Annual Fee	Final Wealth	Wealth Lost
Traditional (1.5%)	£1,500+	£328,000	£246,000
Robo (0.25%)	£250+	£505,000	£69,000

Result:
The traditional client loses 35% of potential wealth to fees.

Cost Comparison: UK Market (2024)

Provider	Type	Annual fee	Minimum
Vanguard Personal Pension	Robo / passive	0.15%	£500
Nutmeg	Robo / managed	0.45–0.75%	£500
Wealthify	Robo / managed	0.60%	£1
St James’s Place	Traditional IFA	~1.5–2.0%	£20,000+
Typical IFA (independent)	Traditional	1.0–1.5% + initial 1–3%	£50,000–£250,000

Sources: provider websites, FCA Retail Investments Data (2024), Reher and Sokolinski (2024).

The structural point is straightforward: robo fees sit at 0.15–0.75%, traditional IFA fees at 1.0–2.0%, with minimums that exclude any household with less than £50,000 to invest. Fee differences of 1–1.5 percentage points compound dramatically over a 30-year accumulation horizon.

Philippon’s Puzzle: the Robo Exception

Finance unit costs: stuck at ~2% for 130 years (Philippon 2016)

Why didn’t technology reduce costs, as in every other industry?

Labour per client cannot be easily automated away
Regulatory barriers limit new entrants
Consumers can’t observe quality → price competition is weak

Robo-advisers break the mechanism

Replacing human labour per client with software that scales at near-zero marginal cost is a structural change, not just cheaper service.

Access Expansion or Two-Tier System?

What the evidence shows (Reher and Sokolinski 2024)

15–25% of new robo-adviser clients were previously unadvised: genuine access expansion, not substitution
Largest gains for households with £10k–£100k in investable assets, precisely the segment excluded by traditional IFAs
Diversification and asset allocation improve substantially for new entrants

What automation cannot replace

Behavioural coaching during market panics (the adviser who stops you selling in March 2020)
Holistic planning: debt, pension, property, insurance, tax together
Adaptation to genuinely complex life situations

The open question

Does this create better access to a good product? Or algorithmic advice for those who cannot afford human advice, whilst the wealthy retain personalised service? The evidence on welfare is the subject of Part III.

Part II : Technology and Algorithms

Modern Portfolio Theory

Harry Markowitz (1952): Investors care about return and risk; optimal portfolios balance these.

Key insights:

Diversification reduces risk: Combining uncorrelated assets lowers portfolio volatility without sacrificing return
Efficient frontier: Set of portfolios with highest return for each risk level
Optimal portfolio: Depends on investor risk aversion (risk-return trade-off)

Robo-adviser application: Automate Markowitz optimisation using client risk tolerance, expected returns, and covariance matrix.

🎯 Markowitz foundation: 1952 paper “Portfolio Selection” : Nobel Prize 1990. Key idea: don’t just look at individual returns; look at how assets move together. ⏱️ Math intuition: Portfolio return = weighted average. Portfolio risk = function of covariances. If assets are uncorrelated, portfolio risk < average asset risk. This is diversification. 🔗 Efficient frontier: Plot all portfolios in return-risk space. The upper edge = efficient portfolios. Any portfolio below the frontier is dominated. ❓ Investor choice: Where on the frontier? Depends on risk aversion. Risk-averse → left (low risk). Risk-tolerant → right (high return). 🎯 Robo-adviser automation: (1) Assess risk tolerance, (2) Estimate returns/covariances, (3) Solve optimisation, (4) Execute trades. ⚠️ Limitations: MPT assumes normal returns (false: fat tails). Assumes static parameters (returns/covariances change). Robo-advisers inherit these limits.

The Risk-Return Trade-Off

The most fundamental idea in all of finance: you cannot expect higher returns without accepting higher risk. This single principle underpins the Capital Asset Pricing Model, the Efficient Market Hypothesis, and every portfolio construction method we study.

So why not simply pick the highest-returning asset?

A fund returning 15% with 40% volatility is worse than one returning 10% with 8% volatility. You are taking on far more uncertainty per pound of expected gain. Return alone is not the right objective.

The Sharpe Ratio (Sharpe 1994)

\[\text{SR} = \frac{\bar{r} - r_f}{\sigma}\]

$\bar{r}$ = portfolio return, $r_f$ = risk-free rate (e.g. UK gilts), $\sigma$ = annualised volatility.

SR	Interpretation
< 0	Worse than holding cash
0 – 0.5	Modest: typical of passive equity over long horizons
0.5 – 1.0	Good: competitive with a diversified index fund
> 1.0	Excellent: rare and likely fragile out-of-sample

Portfolio Optimisation: The Problem

Three inputs. One output.

Symbol	Meaning	Source
$w$	Weight vector: how much to allocate to each asset (what we choose)	Optimiser output
$\mu$	Expected return vector: one forecast per asset	Historical data / model
$\Sigma$	Covariance matrix: how assets move together	Historical data
$r_f$	Risk-free rate (e.g. UK gilt yield)	Market

\[\max_{w} \; \frac{w^\top \mu \; - \; r_f}{\sqrt{w^\top \Sigma \, w}}\]

Numerator $w^\top \mu - r_f$: weighted average return above the risk-free rate, i.e. $w_1\mu_1 + w_2\mu_2 + \cdots - r_f$
Denominator $\sqrt{w^\top \Sigma w}$: portfolio volatility. $\Sigma$ is the covariance matrix capturing how assets move together. When assets are uncorrelated, this is smaller than a weighted average of individual volatilities. That reduction is diversification.

Subject to: weights sum to 1, no short selling ($w_i \geq 0$), no single position above 40%.

Portfolio Return: The Easy Part

Portfolio return is a weighted average, nothing more:

\[r_p = w_1\mu_1 + w_2\mu_2 + w_3\mu_3 = w^\top \mu\]

Concrete example: 40% in Equities (8%), 30% in Bonds (4%), 30% in Real Estate (6%):

\[r_p = 0.40 \times 0.08 + 0.30 \times 0.04 + 0.30 \times 0.06 = 6.2\%\]

Show code

def portfolio_return(weights, expected_returns):
    return np.dot(weights, expected_returns)   # w'mu

Portfolio Volatility: Where Diversification Lives

Portfolio volatility is not a weighted average of individual volatilities. For three assets it expands to:

\[\sigma_p = \sqrt{w_1^2\sigma_1^2 + w_2^2\sigma_2^2 + w_3^2\sigma_3^2 + 2w_1w_2\text{Cov}_{12} + 2w_1w_3\text{Cov}_{13} + 2w_2w_3\text{Cov}_{23}}\]

The cross-terms ($2w_iw_j\text{Cov}_{ij}$) are what matter. When stocks and bonds have low or negative covariance, holding both shrinks the portfolio’s volatility below the average of the parts. That is diversification.

Show code

def portfolio_volatility(weights, cov_matrix):
    return np.sqrt(weights.T @ cov_matrix @ weights)   # sqrt(w'Sigma w)

The matrix multiplication $w^\top\Sigma w$ computes all $N^2$ variance and covariance terms in one operation, however many assets you have.

Objective Function: A Coding Trick

scipy.optimize.minimize only minimises. To maximise the Sharpe ratio, we flip the sign:

Show code

def negative_sharpe(weights, expected_returns, cov_matrix, risk_free_rate):
    ret = portfolio_return(weights, expected_returns)
    vol = portfolio_volatility(weights, cov_matrix)
    return -(ret - risk_free_rate) / vol   # minimise this = maximise Sharpe

The solver finds the minimum of $-\text{SR}$, which is identical to finding the maximum of $\text{SR}$.

Constraints: Keeping the Solution Sensible

Left unconstrained, the solver will make extreme bets, sometimes putting 80% in one asset. Two rules prevent that:

Show code

constraints = {'type': 'eq', 'fun': lambda w: np.sum(w) - 1}  # fully invested
bounds      = [(0, 0.40) for _ in range(n_assets)]            # no shorts, max 40%

Constraint	What it enforces	Why it matters
Weights sum to 1	Every pound is allocated	No cash drag or leverage
$w_i \geq 0$	No short selling	Retail investors cannot short
$w_i \leq 0.40$	No single position above 40%	Regulators and common sense

3-Asset Example: Inputs

Expected returns and volatilities

Asset	Return	Volatility
Stocks	8%	20%
Bonds	4%	10%
Real Estate	6%	17.3%

Show code

exp_returns = np.array([0.08, 0.04, 0.06])

Covariance matrix

Show code

cov_matrix = np.array([
    [0.04,  0.010, 0.020],
    [0.010, 0.01,  0.005],
    [0.020, 0.005, 0.030]
])

Diagonal = variance ($\sigma^2$).

Off-diagonal = covariance. This is the key to diversification.

Convert to correlation: $\rho_{12} = \frac{\text{Cov}_{12}}{\sigma_1 \sigma_2}$

Stocks-Bonds: $\frac{0.010}{0.20 \times 0.10} = 0.50$

Bonds-RE: $\frac{0.005}{0.10 \times 0.173} = 0.29$ (better for diversification)

Optimisation Results

Show full code

import numpy as np
from scipy.optimize import minimize

def portfolio_return(weights, expected_returns):
    return np.dot(weights, expected_returns)

def portfolio_volatility(weights, cov_matrix):
    return np.sqrt(np.dot(weights.T, np.dot(cov_matrix, weights)))

def negative_sharpe(weights):
    ret = portfolio_return(weights, exp_returns)
    vol = portfolio_volatility(weights, cov_matrix)
    return -(ret - risk_free_rate) / vol if vol > 0 else np.inf

exp_returns    = np.array([0.08, 0.04, 0.06])
cov_matrix     = np.array([[0.04, 0.010, 0.020],
                            [0.010, 0.01, 0.005],
                            [0.020, 0.005, 0.030]])
risk_free_rate = 0.02
n_assets       = len(exp_returns)
constraints    = {'type': 'eq', 'fun': lambda w: np.sum(w) - 1}
bounds         = [(0, 0.4)] * n_assets

result  = minimize(negative_sharpe, np.ones(n_assets)/n_assets,
                   method='SLSQP', bounds=bounds, constraints=constraints)
weights = result.x
ret     = portfolio_return(weights, exp_returns)
vol     = portfolio_volatility(weights, cov_matrix)
sharpe  = (ret - risk_free_rate) / vol

print(f"Stocks:       {weights[0]:.1%}")
print(f"Bonds:        {weights[1]:.1%}")
print(f"Real Estate:  {weights[2]:.1%}")
print(f"Return:  {ret:.2%}  |  Vol: {vol:.2%}  |  Sharpe: {sharpe:.2f}")

Stocks:       40.0%
Bonds:        35.5%
Real Estate:  24.5%
Return:  6.09%  |  Vol: 13.07%  |  Sharpe: 0.31

Why not 100% stocks? Stocks alone: 8% return, 20% vol, Sharpe = $(8\%-2\%)/20\% = 0.30$. The optimal portfolio trades a little return for substantially lower volatility, producing a higher Sharpe. That is the entire point of diversification.

Note

❓ Ask: “The 40% cap is binding on stocks and RE. What does that tell you about what the solver wants to do?”

The solver would exceed 40% in stocks and RE if allowed. Binding means the constraint is actively holding it back from a more concentrated allocation.

What Is the Efficient Frontier?

We just found the best portfolio for a Sharpe-seeking investor. But what about a client who is 52, not 22?

The efficient frontier answers: for every level of volatility a client will accept, what is the highest return achievable? It is the full menu of optimal portfolios, not just one.

A portfolio on the frontier cannot be improved without accepting more risk
A portfolio below the frontier is dominated: diversification can do better at the same volatility

The robo-adviser questionnaire doesn’t find a different portfolio for each client. It finds a different point on the same curve.

The Capital Allocation Line

Once we identify the frontier, one portfolio stands out: the tangency portfolio, the point where a line drawn from the risk-free rate just touches the curve.

That line is the Capital Allocation Line (CAL). Its slope is the Sharpe ratio of the tangency portfolio: the steepest achievable.

Every investor should hold the tangency portfolio as their risky allocation, then adjust overall risk by mixing it with the risk-free asset:

Conservative client: 30% tangency portfolio + 70% gilts
Moderate client: 70% tangency portfolio + 30% gilts
Risk-tolerant client: 100% tangency portfolio (most robo-advisers stop here)

This is why robo-advisers produce the same underlying portfolio for all clients, then scale exposure up or down, not a different portfolio for each risk profile.

Efficient Frontier Visualisation

Show code

import numpy as np
import matplotlib.pyplot as plt

# Generate efficient frontier
def generate_frontier(expected_returns, cov_matrix, risk_free_rate=0.02, n_points=50):
    frontier_returns, frontier_vols, frontier_sharpes = [], [], []
    
    # Range of target returns
    min_ret = expected_returns.min()
    max_ret = expected_returns.max()
    target_returns = np.linspace(min_ret, max_ret, n_points)
    
    for target in target_returns:
        # Minimise volatility subject to target return
        n_assets = len(expected_returns)
        
        def portfolio_volatility(weights):
            return np.sqrt(np.dot(weights.T, np.dot(cov_matrix, weights)))
        
        constraints = [
            {'type': 'eq', 'fun': lambda x: np.sum(x) - 1},
            {'type': 'eq', 'fun': lambda x: np.dot(x, expected_returns) - target}
        ]
        bounds = [(0, 1) for _ in range(n_assets)]
        initial = np.array([1/n_assets] * n_assets)
        
        result = minimize(portfolio_volatility, initial, method='SLSQP',
                          bounds=bounds, constraints=constraints)
        
        if result.success:
            vol = portfolio_volatility(result.x)
            frontier_returns.append(target)
            frontier_vols.append(vol)
            frontier_sharpes.append((target - risk_free_rate) / vol if vol > 0 else 0)
    
    return np.array(frontier_vols), np.array(frontier_returns), np.array(frontier_sharpes)

# Generate and plot
vols, rets, sharpes = generate_frontier(exp_returns, cov_matrix)

# Find optimal portfolio (max Sharpe ratio)
optimal_idx = np.argmax(sharpes)
optimal_vol = vols[optimal_idx]
optimal_ret = rets[optimal_idx]
optimal_sharpe = sharpes[optimal_idx]

plt.figure(figsize=(10, 6))
scatter = plt.scatter(vols*100, rets*100, c=sharpes, cmap='viridis', s=50)
plt.colorbar(scatter, label='Sharpe Ratio')

# Add tangent line from risk-free rate to optimal portfolio
risk_free_rate = 0.02
# Extend line beyond optimal portfolio for visualization
max_vol = vols.max() * 1.1
plt.plot([0, max_vol*100], 
         [risk_free_rate*100, risk_free_rate*100 + optimal_sharpe * max_vol*100],
         'r--', linewidth=2, label='Capital Allocation Line')

# Highlight optimal portfolio
plt.scatter([optimal_vol*100], [optimal_ret*100], 
            color='red', s=200, marker='*', 
            label=f'Optimal Portfolio (Sharpe={optimal_sharpe:.2f})', zorder=5)

plt.xlabel('Portfolio Volatility (% per year)', fontsize=12)
plt.ylabel('Expected Return (% per year)', fontsize=12)
plt.title('Efficient Frontier: Risk-Return Trade-off', fontsize=13, pad=15)
plt.legend(loc='lower right')
plt.grid(alpha=0.3, linestyle=':')
plt.tight_layout()
plt.show()

Visualisation overview: This plot shows all possible efficient portfolios using our three assets. Each point represents a different portfolio allocation.
Axes explanation: X-axis is portfolio volatility (risk measured as standard deviation), Y-axis is expected return. Both expressed as percentages per year.
Color coding: The viridis colormap shows Sharpe ratio: lighter yellow indicates higher risk-adjusted returns, darker purple indicates lower risk-adjusted returns.
Capital Allocation Line (red dashed): The tangent line from the risk-free rate (0% volatility, 2% return) to the efficient frontier. The point where it touches the frontier is the optimal (maximum Sharpe ratio) portfolio, marked with a red star.
Optimal portfolio interpretation: This is the single best risky portfolio: it offers the steepest slope (highest risk-adjusted return). All investors should hold this portfolio, then adjust their overall risk by mixing it with the risk-free asset or using leverage.
What to emphasise: The upward curve shows the fundamental risk-return trade-off. The tangent line shows there’s one optimal portfolio that dominates all others in risk-adjusted terms.
Setup for next slides: “Now let’s interpret what this frontier tells us mathematically, understand the optimal portfolio, and explore how robo-advisers use it in practice.”

Reading the Efficient Frontier

What the curve tells us:

Each point = a different portfolio allocation
Curve slopes upward: higher return requires higher risk
Left side is steep: diversification is powerful here
Right side flattens: diminishing returns to risk-taking
Color: lighter yellow = higher Sharpe ratio

Red star = optimal portfolio (max Sharpe)

Visualisation basics: X-axis is portfolio volatility (standard deviation), Y-axis is expected return. Each point represents a feasible portfolio that minimises risk for its return level.
Upward slope: The fundamental risk-return trade-off. No free lunch: you cannot get higher returns without accepting higher volatility. This comes directly from the covariance structure of assets.
Changing slope interpretation: The slope (change in return / change in volatility) tells us how much extra return we get per unit of additional risk. The slope changes because of how covariances interact in the portfolio volatility formula: vol_p = sqrt(w’Sigma w).
Left side steep: Starting from low-risk portfolios (heavy in bonds), adding small amounts of higher-return assets (stocks, real estate) boosts returns significantly. The covariances are modest, so diversification is powerful: combining assets with low correlation reduces overall portfolio risk.
Right side flat: When already holding high-return assets, further increases in expected return require concentrating even more in risky assets. Diversification benefits are exhausted; we’re just adding volatility. Eventually we’re 100% in the highest-return asset.
Color gradient: The viridis colormap shows Sharpe ratio across the frontier. Lighter yellow indicates higher risk-adjusted returns. Notice the best Sharpe ratios are in the middle portion of the frontier, not at the extremes.
Pedagogical point: This visualisation makes abstract portfolio theory concrete. Students can see that optimal portfolios balance return gains against risk costs: it’s not just about maximising return.
Engagement: “Why can’t we just invest 100% in the highest-return asset?” (Because volatility matters; we care about risk-adjusted returns, not just raw returns.)

The Optimal Portfolio

Finding the best risk-adjusted return:

Tangency portfolio: where Capital Allocation Line (red dashed) touches the frontier
Maximises Sharpe ratio: (Return - Rf) / Volatility
Red star marks the optimal point
All investors should hold this portfolio (then adjust risk via mix with risk-free asset)

Separation theorem: Everyone holds the same risky portfolio, differing only in risk exposure

The catch

This is only true if expected returns and covariances are estimated correctly. We will see in Part III why that assumption is deeply problematic in practice.

Client Choice and Risk Tolerance

Mapping questionnaires to portfolios:

Robo-advisers assess risk tolerance via questionnaire
Responses map to a position along the Capital Allocation Line
Conservative (blue): more in risk-free assets (left)
Aggressive (orange): more in tangency portfolio (right)
Example: 25-year-old vs. 65-year-old retiree

All hold same risky portfolio, differ in leverage

Risk tolerance assessment: Questionnaires ask scenario-based questions like “What would you do if your portfolio lost 20% in a year?” Responses reveal loss aversion, time horizon, and financial capacity to bear risk. These are psychometrically designed instruments validated against actual investment behaviour.
Mapping to allocations: Platforms like Betterment and Wealthfront convert questionnaire scores into a quantitative risk tolerance parameter. This determines the percentage allocated to the tangency portfolio vs. risk-free assets. High tolerance → 90-100% risky assets. Low tolerance → 30-50% risky assets.
Life-cycle investing: Younger investors with long time horizons can tolerate short-term volatility and typically choose high-risk allocations (right side). Older investors near retirement need stability and capital preservation, preferring low-risk allocations (left side). This aligns with standard financial planning advice.
Dynamic adjustment (glide paths): Some robo-advisers automatically shift allocations over time. Target-date approaches gradually reduce equity exposure as retirement approaches. This automates age-appropriate risk-taking without requiring client intervention.
Client choice vs. optimal choice: Not all clients choose rationally. Behavioural biases (overconfidence, loss aversion, recency bias) can lead to suboptimal selections. Good robo-advisers provide guardrails: “Given your age and goals, your selected allocation seems too conservative/aggressive. Are you sure?”
Fiduciary responsibility: Robo-advisers have a duty to ensure suitability: the recommended portfolio must align with client circumstances. The questionnaire and allocation algorithm are central to meeting this obligation.
Real-world example: A 25-year-old software engineer with stable income, no dependents, and 40 years to retirement might score 8/10 on risk tolerance. The robo-adviser allocates 90% to the tangency portfolio (global stocks/bonds mix) and 10% to cash. This position lies far right on the capital allocation line.
Assessment (if applicable): You should understand that risk tolerance drives allocation choice, and that robo-advisers use questionnaires to operationalise this mapping. A good critique asks whether questionnaires adequately capture client circumstances (e.g., hidden liquidity needs, behavioural biases).
Engagement: “Where would you position yourself today? Would that change in 20 years?” (Provokes thinking about life-cycle investing and how risk tolerance evolves.)

Implementation: Rebalancing and Tax-Loss Harvesting

Maintaining and enhancing the portfolio:

Rebalancing: automatically sell winners, buy losers to maintain target allocation
Prevents drift from chosen risk level
Typical triggers: quarterly, or when asset drifts >5% from target
Tax-loss harvesting: sell losing positions to generate tax deductions
Buy similar (not identical) assets to maintain exposure
Adds ~50-100 basis points to after-tax returns annually
Can exceed robo-adviser fees (0.25-0.50%)

Estimation Error: MPT’s Achilles Heel

MPT assumes we know expected returns and covariances. We don’t. We estimate them from noisy historical data, and the optimiser treats those estimates as facts.

A 2pp revision to one asset’s return estimate causes:

Show code

def optimize_portfolio(exp_ret, cov_mat, rf=0.02):
    n = len(exp_ret)
    def neg_sharpe(w):
        r = portfolio_return(w, exp_ret)
        v = portfolio_volatility(w, cov_mat)
        return -(r - rf) / v if v > 0 else np.inf
    result = minimize(neg_sharpe, np.ones(n)/n, method='SLSQP',
                      bounds=[(0, 1)]*n,
                      constraints={'type': 'eq', 'fun': lambda w: np.sum(w) - 1})
    return result.x

base_returns = np.array([0.08, 0.10, 0.12])
sens_cov = np.array([[0.04, 0.01, 0.02],
                     [0.01, 0.06, 0.03],
                     [0.02, 0.03, 0.09]])

perturbed = base_returns.copy()
perturbed[0] += 0.02  # Asset 1: 8% → 10%

w_base = optimize_portfolio(base_returns, sens_cov)
w_pert = optimize_portfolio(perturbed, sens_cov)

delta = (w_pert - w_base) * 100
print("Revising Asset 1 return estimate by just 2pp causes:")
for i, d in enumerate(delta, 1):
    print(f"  Asset {i}: {d:+.0f}pp weight change")

Revising Asset 1 return estimate by just 2pp causes:
  Asset 1: +14pp weight change
  Asset 2: -6pp weight change
  Asset 3: -8pp weight change

The optimiser is a magnifying glass for estimation error, not a filter for it.

Solution 1: Bootstrap Weight Uncertainty

Point estimates for optimal weights give no sense of how reliable those weights are. Bootstrapping the return history shows the full range of “optimal” portfolios consistent with the data.

Show bootstrap portfolio code

import pandas as pd

# Load real Bloomberg daily returns (SPY, BND, VNQ — 1,760 obs, 2018-2024)
_bdf = load_bloomberg()
_bdf['date'] = pd.to_datetime(_bdf['date'])
boot_daily = (_bdf[_bdf['ticker'].isin(['SPY','BND','VNQ'])]
              .drop_duplicates(subset=['date','ticker'])
              .pivot(index='date', columns='ticker', values='return')
              .dropna().sort_index())
boot_assets = boot_daily.columns.tolist()   # alphabetical: BND, SPY, VNQ
asset_labels = {'BND': 'BND (US Bonds)', 'SPY': 'SPY (US Stocks)', 'VNQ': 'VNQ (Real Estate)'}

# Baseline optimal — unconstrained (0–100% per asset, no max position limit)
ann_ret = boot_daily.mean().values * 252
ann_cov = boot_daily.cov().values * 252
w_base = optimize_portfolio(ann_ret, ann_cov, rf=0.04)
print("Baseline optimal (unconstrained, 0-100% per asset):")
for asset, w in zip(boot_assets, w_base):
    print(f"  {asset_labels[asset]}: {w:.0%}")

# Bootstrap 1,000 resamples of the full daily history
np.random.seed(42)
n_obs = len(boot_daily)
boot_weights = []
for _ in range(1000):
    idx = np.random.choice(n_obs, size=n_obs, replace=True)
    s = boot_daily.iloc[idx].values
    cov_est = np.cov(s.T) * 252
    if np.all(np.linalg.eigvals(cov_est) > 0):
        w = optimize_portfolio(s.mean(axis=0) * 252, cov_est, rf=0.04)
        if w is not None:
            boot_weights.append(w)

bw = np.array(boot_weights)
w_mean  = bw.mean(axis=0)
w_lower = np.percentile(bw, 2.5, axis=0)
w_upper = np.percentile(bw, 97.5, axis=0)

print(f"\n95% Bootstrap CIs ({len(bw)} runs, real Bloomberg data):")
for i, asset in enumerate(boot_assets):
    width = (w_upper[i] - w_lower[i]) * 100
    print(f"  {asset_labels[asset]}: {w_mean[i]:.0%}  [{w_lower[i]:.0%}, {w_upper[i]:.0%}]  (±{width/2:.0f}pp)")

Baseline optimal (unconstrained, 0-100% per asset):
  BND (US Bonds): 0%
  SPY (US Stocks): 100%
  VNQ (Real Estate): 0%

95% Bootstrap CIs (1000 runs, real Bloomberg data):
  BND (US Bonds): 0%  [0%, 0%]  (±0pp)
  SPY (US Stocks): 95%  [10%, 100%]  (±45pp)
  VNQ (Real Estate): 4%  [0%, 82%]  (±41pp)

The baseline says 100% SPY. But the bootstrap says the SPY allocation could be anywhere from 10% to 100%. A 90pp-wide confidence interval on the “optimal” weight is not a rounding error; it is a warning about how much we should trust the point estimate.

Solution 2: Out-of-Sample Validation

In-sample optimisation always looks good: the solver overfits to whatever history it sees. Walk-forward validation tests honestly by never letting the model see the data it is evaluated on.

The rolling-window procedure:

Estimate optimal weights on the first year of data
Hold that portfolio for the next quarter (out-of-sample)
Roll forward one quarter, re-estimate, repeat

Show rolling backtest code

import pandas as pd

# Load Bloomberg data: SPY, BND, VNQ (daily, 2018-2024)
_df = load_bloomberg()
_df['date'] = pd.to_datetime(_df['date'])
returns_history = (_df[_df['ticker'].isin(['SPY','BND','VNQ'])]
                   .drop_duplicates(subset=['date','ticker'])
                   .pivot(index='date', columns='ticker', values='return')
                   .dropna().sort_index())

train_window = 252   # 1 year of trading days
test_window  = 63    # 1 quarter

optimized_rets, equal_rets = [], []
for start in range(0, len(returns_history) - train_window - test_window, test_window):
    train = returns_history.iloc[start:start + train_window]
    test  = returns_history.iloc[start + train_window:start + train_window + test_window]
    cov_est = train.cov().values * 252
    if not np.all(np.linalg.eigvals(cov_est) > 0):
        continue
    w_opt = optimize_portfolio(train.mean().values * 252, cov_est)
    optimized_rets.append((test.values @ w_opt).mean())
    equal_rets.append(test.mean(axis=1).mean())

print(f"Out-of-Sample Performance (Bloomberg 2018-2024, {len(optimized_rets)} quarterly windows):")
print(f"  Optimised portfolio:  {np.mean(optimized_rets)*252:.1%} annualised")
print(f"  Equal-weight (1/N):   {np.mean(equal_rets)*252:.1%} annualised")
print(f"  Difference:           {(np.mean(optimized_rets)-np.mean(equal_rets))*252:+.1%}")

Out-of-Sample Performance (Bloomberg 2018-2024, 23 quarterly windows):
  Optimised portfolio:  9.5% annualised
  Equal-weight (1/N):   7.6% annualised
  Difference:           +1.9%

Over longer samples, optimised portfolios often underperform simple equal-weight. This sample shows the opposite: the optimiser got lucky on a 7-year window. DeMiguel, Garlappi, and Uppal (2009) document this failure across 50 years and 7 datasets.

Solution 3: Bayesian Shrinkage

Problem: Sample means are noisy estimates of true expected returns

Bayesian solution: Shrink extreme estimates toward the grand mean (Week 1, §0.4)

James-Stein estimator: \[\hat{\mu}_{\text{JS}} = \bar{\mu} + (1 - \lambda)(\hat{\mu}_i - \bar{\mu})\]

Notation: $\hat{\mu}_i$ = sample mean return for asset $i$, $\bar{\mu}$ = grand mean (average across assets). $\lambda$ = shrinkage intensity (0 = no shrinkage, 1 = full shrinkage to grand mean).

Show James-Stein shrinkage code

# Sample estimates with one outlier (noisy estimation from a short history)
sample_returns = np.array([0.09, 0.18, 0.08])  # Bonds suspiciously high
js_cov = np.array([[0.04, 0.01, 0.02],
                    [0.01, 0.03, 0.005],
                    [0.02, 0.005, 0.03]])

grand_mean = sample_returns.mean()
shrinkage_lambda = 0.4
shrunk_returns = grand_mean + (1 - shrinkage_lambda) * (sample_returns - grand_mean)

print("Sample vs Shrunk Returns:")
assets_js = ['Stocks', 'Bonds', 'Real Estate']
for name, s, sh in zip(assets_js, sample_returns, shrunk_returns):
    print(f"  {name:12s}: {s:.2%} → {sh:.2%}")

w_sample = optimize_portfolio(sample_returns, js_cov)
w_shrunk = optimize_portfolio(shrunk_returns, js_cov)
print("\nOptimal Weights:")
print(f"  {'':12s}  {'Sample':>8}  {'Shrunk':>8}")
for name, ws, wsh in zip(assets_js, w_sample, w_shrunk):
    print(f"  {name:12s}  {ws:>8.0%}  {wsh:>8.0%}")

Sample vs Shrunk Returns:
  Stocks      : 9.00% → 10.07%
  Bonds       : 18.00% → 15.47%
  Real Estate : 8.00% → 9.47%

Optimal Weights:
                  Sample    Shrunk
  Stocks              0%        2%
  Bonds              82%       69%
  Real Estate        18%       29%

Shrinkage pulls every estimate toward the grand mean: the high one (Bonds) down, the low ones (Stocks, Real Estate) up. That evens out the optimal weights and stops the solver from concentrating in a single asset.

Summary: When “Optimal” Breaks

What we saw with SPY/BND/VNQ and real data:

Sensitivity: Small changes in one asset’s return sent optimal weights to extremes (sensitivity slide).
Bootstrap: “Optimal” SPY weight had a 90pp-wide 95% CI (10%–100%). Point estimates are not trustworthy.
Rolling backtest: In our 7-year window the optimiser beat equal-weight; over many datasets and decades it usually loses (DeMiguel, Garlappi, and Uppal (2009)).
Shrinkage: Pulling estimates toward the grand mean (Bonds down, Stocks/RE up) stabilised weights and avoided concentration.

Robo-advisers respond by constraining weights (e.g. 5–40% per asset), using shrinkage or equal-weight, and rebalancing to targets. They do not show clients that “optimal” weights have 90pp-wide uncertainty.

The paradox

MPT assumes we know expected returns and covariances. We only have noisy estimates; optimisation amplifies that noise (bias–variance). We used bootstrap, walk-forward backtesting, and Bayesian shrinkage to show why. Practical fix: simple, robust rules (constraints, equal-weight, risk parity) instead of unconstrained MPT.

Real Correlation Structure

Show code

import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt

df = load_bloomberg()
df["date"] = pd.to_datetime(df["date"])
etfs = ["SPY", "TLT", "GLD", "QQQ", "EFA", "BND", "IWM", "VNQ"]
# Ticker → short description for axis labels
etf_labels = {"SPY": "SPY (US large cap)", "QQQ": "QQQ (US tech)", "IWM": "IWM (US small cap)",
              "EFA": "EFA (ex-US developed)", "BND": "BND (US agg bonds)", "TLT": "TLT (long Treas)",
              "GLD": "GLD (gold)", "VNQ": "VNQ (real estate)"}
pivot = (df[df["ticker"].isin(etfs)]
         .drop_duplicates(subset=["date", "ticker"])
         .pivot_table(index="date", columns="ticker", values="log_return", aggfunc="mean")
         .dropna())
corr = pivot.corr().rename(index=etf_labels, columns=etf_labels)
fig, ax = plt.subplots(figsize=(10, 8))
sns.heatmap(corr, annot=True, fmt=".2f", cmap="RdBu_r",
            center=0, vmin=-1, vmax=1, square=True, linewidths=0.5, ax=ax,
            annot_kws={"size": 11})
ax.set_title("Bloomberg ETF correlations (2018–2024), 8-asset universe")
plt.xticks(rotation=45, ha="right")
plt.yticks(rotation=0)
plt.tight_layout()
plt.show()

8 ETFs, 2018–2024. Red = positive, blue = negative. Which pairs surprise you?

What the Correlation Data Shows

Equity cluster: SPY, QQQ, IWM, EFA correlate strongly with each other (about 0.76–0.93 in the heatmap); diversification among them is limited.
“Diversifiers” (TLT, GLD): In this full-sample matrix, TLT is negative with equities (e.g. SPY), GLD is low positive; neither is a uniform safe haven. (Time variation comes on the next slide.)
8 assets → 36 distinct entries in the covariance matrix (8 variances + 28 covariances); these estimates are noisy.
Scale up: 500 assets → 125,250 parameters (500×501/2) from the same kind of history.

Regime Change: When Diversification Fails

Show code

import pandas as pd
import matplotlib.pyplot as plt

_df2 = load_bloomberg()
_df2["date"] = pd.to_datetime(_df2["date"])
pivot = (_df2[_df2["ticker"].isin(["SPY", "TLT"])]
         .drop_duplicates(subset=["date", "ticker"])
         .pivot(index="date", columns="ticker", values="log_return")
         .dropna())
rolling_corr = pivot["SPY"].rolling(60).corr(pivot["TLT"]).dropna()
fig, ax = plt.subplots(figsize=(10, 3.5))
ax.plot(rolling_corr.index, rolling_corr.values, color="steelblue", linewidth=1.5)
ax.axhline(0, color="black", linewidth=0.8, linestyle="--")
ax.fill_between(rolling_corr.index, rolling_corr.values, 0,
                where=(rolling_corr.values > 0), alpha=0.3, color="red", label="Positive (risky)")
ax.fill_between(rolling_corr.index, rolling_corr.values, 0,
                where=(rolling_corr.values <= 0), alpha=0.3, color="green", label="Diversifying")
ax.set_title("SPY–TLT 60-day rolling correlation (Bloomberg, 2018–2024)")
ax.legend(loc="lower left")
ax.set_ylim(-0.6, 0.6)
plt.tight_layout()
plt.show()

The 2022 lesson every optimiser missed:

Pre-2022: bonds and equities negatively correlated: the standard diversification logic worked
2022: Federal Reserve raised rates aggressively; stocks and bonds fell simultaneously; correlation flipped positive
Any MPT portfolio “optimised” using pre-2022 data suffered: the efficient frontier itself shifted
The Markowitz assumption of a stationary covariance matrix is an assumption the market will eventually violate

Part III : When Theory Meets Reality

MPT’s Hidden Assumptions

The gap between elegance and reality

MPT requires us to know:

Expected returns for every asset: estimated from noisy historical data
The complete covariance matrix: N(N+1)/2 parameters for N assets
That returns are approximately normally distributed
That constraints are simple (weights sum to 1, no shorts)
That the investment universe is small enough to solve exactly

Reality: None of these conditions reliably hold at scale

The Scale Problem

When size changes everything

Asset management at scale:

Global AUM reached $128 trillion in 2024 (12% growth); industry reports put it at a large share of global financial assets (Boston Consulting Group 2025)
Large managers hold portfolios of hundreds or thousands of individual securities
For 1,000 assets: MPT requires estimating 500,500 parameters from ~250 annual data points

The Scale Problem

Why small-scale algorithms fail at large scale:

Classical quadratic programming solvers work well for ~100 assets
At 1,000+ assets: computational cost explodes, estimation error dominates inputs
Algorithms that reliably solve small problems often converge to poor solutions at large scale

The curse of dimensionality is not just a theoretical concern: it is the central practical challenge of modern asset management

How Much of the Covariance Matrix Is Noise?

When we estimate a covariance matrix from historical data, not all of it is signal. Random Matrix Theory tells us exactly where the noise ends.

The Marcenko-Pastur Law gives the eigenvalue bounds expected from purely random returns:

\[ \lambda_{\pm} = \sigma^2 \!\left(1 \pm \sqrt{\tfrac{M}{T}}\right)^{\!2} \]

where $M$ = assets, $T$ = observations, and $Q = T/M$ is the Q-ratio.

Any eigenvalue inside $[\lambda_{-}, \lambda_{+}]$ is statistically indistinguishable from noise.

Why the Q-Ratio Changes Everything

$Q = T/M$ determines how much of the covariance matrix survives as genuine signal.

Setting	$M$ assets	$T$ obs	$Q$	Consequence
Bloomberg ETF lab	8	~2,000	250	Very few noise eigenvalues
Typical fund	100	250	2.5	Most eigenvalues are noise
Large-scale AM	1,000	250	0.25	Matrix is rank-deficient: uninvertible

At $Q < 1$ there are fewer observations than assets and the covariance matrix cannot be inverted at all. 1–3 eigenvalues carry almost all genuine signal in most estimated matrices.

The Same Mathematics Powers AI

The noise-signal decomposition from portfolio theory reappears throughout modern machine learning.

Word embeddings (GloVe, Word2Vec): a word co-occurrence matrix is factorised via SVD, the rectangular analogue of eigendecomposition. Keeping only the top-$k$ singular values is exactly the same logic applied to language.
Transformer fine-tuning (LoRA): weight updates are written as $\Delta W = AB$ with rank $r \ll d$, explicitly discarding the noise subspace to learn efficiently from limited data.
Attention heads: each head learns a different dominant eigenvector of the token-interaction matrix $QK^\top$, capturing a distinct semantic relationship.

We will return to this in the sequential learning week. For now, the core intuition is already in your hands.

Denoising the Covariance Matrix: Three Steps

The RMT noise bound gives us a principled way to clean the covariance matrix rather than just shrinking it blindly.

Decompose: $\Sigma = V \Lambda V^\top$ (eigendecompose the sample covariance matrix)
Classify: eigenvalues above $\lambda_{+}$ carry genuine signal; those below are statistically noise
Reconstruct: replace noise eigenvalues with their mean, rebuild $\hat{\Sigma} = V \hat{\Lambda} V^\top$

The eigenvectors $V$ are preserved throughout: only the magnitude of the noise components is shrunk, not the direction of the correlations.

The Eigenvalue Spectrum: Bloomberg 8-ETF Data

Show denoising code

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

_df3 = load_bloomberg()
_df3["date"] = pd.to_datetime(_df3["date"])
etfs = ["SPY", "TLT", "GLD", "QQQ", "EFA", "BND", "IWM", "VNQ"]
pivot = (_df3[_df3["ticker"].isin(etfs)]
         .drop_duplicates(subset=["date", "ticker"])
         .pivot(index="date", columns="ticker", values="log_return")
         .dropna())
M, T = pivot.shape[1], pivot.shape[0]
cov = pivot.cov().values
eigenvalues, eigenvectors = np.linalg.eigh(cov)

sigma2 = np.mean(eigenvalues)
lam_plus = sigma2 * (1 + 1 / np.sqrt(T / M)) ** 2

ev_sorted = sorted(eigenvalues, reverse=True)
ranks = range(1, M + 1)   # ranks 1–8, not 0–7
colors = ["steelblue" if ev > lam_plus else "lightcoral" for ev in ev_sorted]

from matplotlib.patches import Patch
from matplotlib.lines import Line2D

fig, ax = plt.subplots(figsize=(9, 4.5))
ax.bar(ranks, ev_sorted, color=colors)
ax.axhline(lam_plus, color="red", linestyle="--", linewidth=1.5)
legend_handles = [
    Patch(facecolor="steelblue", label="Signal (above MP bound)"),
    Patch(facecolor="lightcoral", label="Noise (below MP bound)"),
    Line2D([0], [0], color="red", linestyle="--", linewidth=1.5,
           label=f"MP upper bound λ+ = {lam_plus:.4f}"),
]
ax.legend(handles=legend_handles)
ax.set_xticks(list(ranks))
ax.set_xlabel("Eigenvalue rank (1 = largest)")
ax.set_ylabel("Eigenvalue magnitude")
ax.set_title(f"Bloomberg 8-ETF eigenvalue spectrum (M={M}, T={T}, Q={T/M:.0f})")
ax.legend()
plt.tight_layout()
plt.show()

What the Denoised Matrix Achieves

Replacing noise eigenvalues with their mean produces a matrix that:

Is better conditioned (no near-zero eigenvalues that blow up the inverse)
Has the same eigenvector structure as the sample matrix (correlations are preserved)
Is theoretically justified, unlike ad hoc shrinkage toward the grand mean

This is the same principle as James-Stein shrinkage, but with a mathematically grounded decision about which components to shrink rather than applying a uniform pull.

The result feeds directly into the optimiser. With Q ≈ 250, our 8-ETF universe already has a well-conditioned matrix; the denoising matters far more at scale (a typical fund has Q ≈ 2.5).

Beyond Variance: Skewness Matters

MPT treats a +20% and a -20% return as identical risk. Investors do not.

Equity returns are negatively skewed: crashes are more common than symmetry implies (2008, 2020).
Positive skewness commands a premium: venture capital and options buyers pay for upside optionality.
Loss aversion amplifies the problem: behavioural research shows losses hurt roughly twice as much as equivalent gains feel good.

Adding skewness as a third objective (maximise return, minimise variance, maximise skewness) converts the problem from a tractable quadratic programme into something far harder, with no single optimal solution.

The Newly-Listed Securities Problem

MPT assumes every asset has an estimable return distribution. Large portfolios always include IPOs, recent listings, and newly opened markets where weeks or months of data offer little more than noise.

Excluding them misses high-growth opportunities. Including them requires expert judgement: formal beliefs about future performance drawn from fundamentals, sector knowledge, and analyst views.

The challenge is how to combine probability-based estimates and judgement-based estimates in a single coherent optimisation. This is precisely the problem Black-Litterman was designed to address.

Real-World Constraints: From Quadratic to NP-Hard

Standard MPT needs only two constraints: weights sum to 1, no short-selling. Real portfolios add:

Cardinality: hold between $K_{\min}$ and $K_{\max}$ securities; no infinitesimal positions
Minimum transaction lots: invest in whole units (e.g. 100 shares per lot)
Boundary limits: each security has a floor and a cap on allocation
Multiple objectives: maximise return, minimise variance, and maximise skewness simultaneously

Adding cardinality and lot-size constraints converts the smooth quadratic programme into an NP-hard combinatorial problem. For $N = 500$ assets and $K = 50$ to hold, there are $\binom{500}{50} > 10^{62}$ possible portfolios: no exact algorithm can search them.

The Gap: Textbook vs Reality

	Textbook MPT	Real portfolio construction
Universe	Small, all well-established	Hundreds of assets, many newly listed
Constraints	Weights sum to 1, no shorts	Cardinality, lot sizes, boundary limits
Objectives	Minimise variance	Return, variance, skewness simultaneously
Solver	Exact quadratic programme	Approximation; NP-hard in general
Estimation	Assumed known	Noisy; some assets have no history

Seventy years of research have been devoted to bridging this gap. Part IV traces that journey.

Part IV : The Algorithmic Revolution

Seventy Years of Portfolio Technology

Generation	Era	Core approach	Where it breaks
1st: Classical	1952–1990	Markowitz mean-variance	Estimation error; breaks at scale
2nd: Robust	1990–2010	Shrinkage, Black-Litterman, factor models	Still single-objective, small universe
3rd: Computational	2010–	Evolutionary algorithms, ML	Computationally intensive; active research

Most retail robo-advisers today use 1st and 2nd generation methods. The research frontier, and the gap between theory and deployed products, sits in the 3rd.

Black-Litterman: Market Consensus + Expert Views

Developed at Goldman Sachs in 1990, the intuition is Bayesian: start from what the market already believes, then layer on where you have genuine conviction.

Prior: reverse-engineer expected returns from current market-cap weights via CAPM. This is the market’s collective forecast, not noisy sample means.
Update: “I believe UK equities will outperform by 2% over the next year.” That view shifts the prior only in the assets it touches.
Result: stable, well-diversified allocations; views only move weights where the manager has something to say.

The fix is elegant but still limited to small-to-medium portfolios with a single objective. It does not solve cardinality or lot-size constraints.

When One Frontier Is Not Enough

The classical efficient frontier is two-dimensional: return versus variance. Add skewness as a third objective and the “frontier” becomes a surface.

On that surface there is no single “best” portfolio; there are only Pareto-optimal portfolios: ones where you cannot improve any objective without worsening at least one other. Think of a restaurant menu: the best value dish depends on what you care about (price, calories, taste).

The three objectives pull in different directions:

Higher return generally requires more variance.
Positive skewness (limited downside) often means accepting a lower expected return.
An investor’s personal weighting over the three determines which point on the surface is right for them.

The next generation of robo-advisers will match clients to a point on this Pareto surface, not just a single risk-tolerance band.

Evolutionary Algorithms: Computing What Calculus Cannot

Classical optimisers need smooth, differentiable objectives. Cardinality and lot-size constraints destroy smoothness; gradient methods cannot navigate integer steps. The search space is too vast for exhaustive enumeration.

Evolutionary algorithms borrow from biology instead: maintain a population of candidate portfolios, score each against all objectives, then apply selection, crossover, and mutation. Better portfolios survive; the population evolves toward the Pareto surface.

Multi-Objective Evolutionary Algorithms (MOEAs) extend this idea to multiple simultaneous objectives. Rather than collapsing return, variance, and skewness into one number (like the Sharpe ratio), a MOEA maintains a diverse set of non-dominated solutions across the full Pareto frontier. No gradients are needed; integer constraints are handled naturally.

(Liu et al. 2024) demonstrate that MOEAs are the only class of algorithm that can handle NP-hard portfolio problems at scale.

Why Scale Breaks Standard MOEAs

MOEAs work well at small scale (30 assets), but performance collapses at large scale (1,000 assets). Liu et al. (2024) (Fig. 1) show that all standard algorithms fail to approximate the Pareto frontier once the portfolio exceeds a few hundred securities.

The problem is exploration versus convergence: in thousands of dimensions, a population that searches broadly never converges; one that converges quickly gets stuck in a poor region. Standard algorithms cannot hold both.

Three research directions have emerged to fix this:

Group decision variables: exploit the fact that assets in the same sector move together
Reduce the decision space: use dimensionality reduction before optimising
Novel search strategies: guide exploration toward promising regions of the Pareto surface

Liu et al. (2025): Three Problems, One Framework

IEEE Transactions on Evolutionary Computation, Vol. 29, Feb 2025 (ABS 4) (Liu et al. 2024)

The paper takes the three open problems we have just covered and solves them simultaneously:

Newly-listed securities: an uncertain random variable framework blends probability-based estimates (established assets) with expert-judgement-based estimates (IPOs, recent listings) in one mathematically consistent model.
Constraint handling: an encoder-decoder mechanism transforms the NP-hard constrained problem (cardinality, lot sizes, boundaries) into an unconstrained one that any MOEA can address.
Scale: the LSWOEA algorithm combines decision space reduction with a dispersed target-guided search to maintain exploration-convergence balance at 1,000 assets.

Results: Statistically Dominant at Every Scale

Tested against 9 benchmark MOEAs across 6 portfolio sizes (30 to 1,000 securities), LSWOEA achieves higher hypervolume on all six datasets. The performance advantage grows with scale, precisely where other algorithms fail most.

Hypervolume measures how much of the objective space is dominated by the Pareto frontier: unlike a single Sharpe ratio, it captures the quality of the entire trade-off surface. Statistical significance: Mann-Whitney U, Bonferroni-corrected, $p \ll 0.001$.

From any point on the resulting frontier, the investor can choose a strategy matching their preference:

Return preference: higher expected return, accepting more variance and lower skewness
Risk preference: minimum variance, larger allocation to the risk-free asset
Skewness preference: positive asymmetry, accepting lower expected return

Running time: roughly 15 seconds for a 1,000-security portfolio on standard hardware.

From Today’s Robo to Tomorrow’s Algorithmic Engine

Dimension	Today’s robo-adviser	Tomorrow’s system
Universe	6–12 ETFs	Thousands of securities
Objectives	Sharpe ratio (2D)	Return, variance, skewness (3D Pareto)
New assets	Excluded or ad hoc	Uncertain random variable framework
Solver	Constrained MPT	MOEA at scale
Client interface	Risk band (1–10)	Point on Pareto surface

The shift is from automating a 1952 model to operationalising 2025 research. The robo-adviser of tomorrow is an evolutionary engine running on the Pareto frontier.

Part V : Python Implementation and Directed Learning

Lab: Build Your Own Robo-Adviser

Five tasks, each mirroring a section of today’s lecture:

Recreate the fee comparison and visualise access expansion
Build the portfolio optimiser; test on SPY, BND, VNQ
Generate the efficient frontier; identify the tangency portfolio
Perturb expected returns and see how weights respond
Reflect: what would you need to add to handle 1,000 assets and three objectives?

Deliverable: notebook with code, plots, and 300–400 word interpretation connecting code to theory.

Lab structure: Five tasks mirroring the lecture content. Each has code, visualisation, and interpretation.
Task 1 (cost comparison): Recreate the fee charts. Explore break-even points. Calculate savings over time.
Task 2 (portfolio optimiser): Implement the optimisation algorithm. Test on 3–5 asset portfolios.
Task 3 (efficient frontier): Generate the frontier, identify the max Sharpe portfolio.
Task 4 (sensitivity): Vary expected returns, risk aversion, and constraints. See how allocations change.
Task 5 (reflection): How would the three-moment model change your optimiser? What would you need to add to handle 1,000 assets? This connects the lab to Parts III–IV.
Timing: 60–90 minutes in lab; 60–90 minutes directed learning extensions.
Assessment link: Lab 5 prepares for CW1 (robo-adviser backtests) and CW2 (portfolio scaffold).

Quick Check-In Questions

Q1: Why do robo-advisers have lower minimum account sizes than traditional advisers?
Q2: What is the efficient frontier, and how do robo-advisers use it?
Q3: Name one reason why standard portfolio optimisation algorithms struggle with large-scale portfolios of 1,000+ securities.

Interactive check: Pause and ask students to answer (verbally, chat, or poll).
A1: Automation reduces marginal cost per client to near-zero; fixed costs spread across millions of accounts; no capacity constraint from human advisers.
A2: The efficient frontier is the set of portfolios with highest return for each risk level. Robo-advisers match client risk tolerance (from questionnaire) to an optimal portfolio on the frontier.
A3: Estimation error dominates inputs (500,000+ parameters from ~250 data points); cardinality and lot-size constraints make the problem NP-hard; the high-dimensional decision space causes standard MOEAs to lose the exploration-convergence balance and converge to poor solutions.
Why ask these?: Reinforces key concepts, checks understanding, engages students.
Timing: 2-3 minutes total.

Assessment Touchpoints

CW1: Business Analysis Presentation (Week 6)

Task: Analyse a FinTech business model (e.g., Nutmeg, Wealthfront).
Relevance: Apply the economics of robo-advisory to explain their value proposition.
- How do they lower costs?
- Who is their target market?
- What are the risks?

CW2: Technical Implementation (Week 13)

Task: Build a portfolio optimisation tool.
Relevance: The Python code we write today (Part V) is the core engine of a robo-adviser.
Key Skill: Implementing MPT and backtesting performance.

Directed Learning Plan (≈3 hours)

Reading (60 min):

Hilpisch (2019) Chapter 13 (portfolio analytics and optimisation)
Reher and Sokolinski (2024) (empirical evidence on access and welfare)
Gu, Kelly, and Xiu (2020) Sections 1-2 (ML in asset pricing, optional but valuable)

Practical (75 min):

Complete Lab 04 Tasks 1-5
Include code, plots, and interpretations

Reflection (45 min):

Write 300-400 words: “Who benefits most from robo-advisers? What are the main risks? How should governance evolve?”
Use at least one citation

Reading strategy: Hilpisch Ch. 13 is technical: focus on portfolio optimisation code (pp. 300-320). Reher & Sokolinski: read intro, methodology, results (skip technical appendices). Gu et al.: optional; read if interested in ML extensions.
Practical time: 75 minutes includes coding, running variations, creating plots, writing interpretations. Budget ~15 min per task.
Reflection prompt: This is practice for coursework essays. Answer in clear prose (not bullet points). Support claims with evidence (cite Reher findings). Discuss trade-offs (not just benefits).
Total time: 60 + 75 + 45 = 180 minutes ≈ 3 hours. This is expected directed learning.
Support resources: Lab 04 Colab notebook, chapter text, slides, reading papers. TAs in lab sessions. Office hours by appointment.
Assessment preparation: Directed learning directly prepares you for coursework.

Project preview: factor-based investing

Preview of an end-of-semester project pathway:

You’ll replicate or extend research on factor-based investing using professional factor data

Today’s primer task (optional, 20-30 min):

Explore the Jensen-Kelly-Pedersen (JKP) factor dataset (resources/jkp-sample.csv)
Understand what factors like MKT, SMB, HML, MOM represent
Compute basic summary statistics and visualize cumulative returns
Connect factors to robo-advisor portfolio construction

Why connect this to robo-advisors?

Modern robo-advisors don’t just diversify across assets: they target systematic factor exposures (value, momentum, quality). Understanding factors is essential for evaluating algorithmic investment strategies.

Why Week 4 is the right time:

Conceptual foundation: Students now understand portfolio optimization and risk-return tradeoffs from today’s lecture
Sufficient lead time: Coursework 2 due end of semester: introducing now gives 8+ weeks for students to work on it gradually
Natural connection: Robo-advisors use factor tilts in their algorithms; JKP factors measure these systematic return patterns
Not overwhelming: Positioned as optional primer task, not a requirement this week

JKP Factor Dataset Context:

What it is: Research-quality factor returns maintained by Jensen, Kelly, and Pedersen
Why it matters: Similar to Fama-French factors but with methodological improvements; widely used in academic research
Key factors:
- MKT (Market): Overall market return minus risk-free rate
- SMB (Small-Minus-Big): Small-cap stocks minus large-cap (size effect)
- HML (High-Minus-Low): Value stocks minus growth stocks (value effect)
- MOM (Momentum): Winners minus losers (momentum effect)
Connection to robo-advisors: Many platforms tilt portfolios toward factors believed to generate long-run excess returns

Sample CSV Structure:

date,MKT,SMB,HML,MOM
2020-01-31,0.021,-0.005,0.008,0.012
2020-02-29,-0.085,-0.012,-0.003,-0.004
...

Primer Task Walkthrough (optional):

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

# Load JKP sample data
jkp = pd.read_csv('resources/jkp-sample.csv', parse_dates=['date']).set_index('date')

# 1. Summary statistics
print("Factor Summary Statistics:")
print(jkp[['MKT','SMB','HML','MOM']].describe())

# 2. Annualized returns and volatility
annual_ret = jkp[['MKT','SMB','HML','MOM']].mean() * 12 * 100  # Monthly to annual
annual_vol = jkp[['MKT','SMB','HML','MOM']].std() * np.sqrt(12) * 100
sharpe = annual_ret / annual_vol

print("\nAnnualized Metrics:")
for factor in ['MKT','SMB','HML','MOM']:
    print(f"{factor}: {annual_ret[factor]:.1f}% return, {annual_vol[factor]:.1f}% vol, Sharpe: {sharpe[factor]:.2f}")

# 3. Cumulative returns (show momentum performance)
cum_returns = (1 + jkp['MOM']).cumprod() - 1
plt.figure(figsize=(10,6))
plt.plot(cum_returns)
plt.title('Momentum Factor (MOM) Cumulative Return')
plt.ylabel('Cumulative Return')
plt.grid(alpha=0.3)
plt.show()

Expected insights students should gain:

Factor returns are volatile: Even systematic factors have periods of underperformance
Different risk-return profiles: Some factors (MKT) have high returns but high vol; others have moderate returns with lower vol
Long-run patterns: Over decades, certain factors show positive average returns (the “factor premium”)
Portfolio implications: Robo-advisors that tilt toward value or momentum are making systematic bets on these factor premiums

Possible scope (full version):

Complete JKP dataset (1963-present, 60+ years)
Multiple factor portfolios and combinations
HAC (heteroskedasticity and autocorrelation consistent) standard errors
Robustness checks (subperiods, alternative specifications)
You’ll replicate findings from a Journal of Portfolio Management paper OR propose an extension
Due end of semester; detailed brief provided Week 8-10

Why this preview is useful:

It shows how factor data connects back to robo-adviser portfolio construction, and it gives you a concrete dataset to practise careful measurement, validation, and interpretation.

Why optional this week?

Core learning this week is robo-advisor economics and MPT implementation
JKP task helps students who want to get ahead on Coursework 2
Not required for Lab 04 or immediate assessments
But strongly encouraged for building familiarity with factor data early

Student support:

Lab 04 (separate from today) will include JKP data orientation
Office hours available for Coursework 2 questions (starting Week 8)
Additional guidance provided Week 8-10 when formal brief released

Timing: 3-4 minutes for this slide. It’s a preview and encouragement to explore, not a technical deep-dive.

Transition: “That’s a glimpse of your major project. Now let’s look at where the frontier of portfolio construction is heading with embeddings…”

Looking Forward: From Characteristics to Embeddings

Traditional approach:
Robo-advisers use hand-crafted characteristics (size, value, momentum) to build portfolios

Emerging approach:
Learn asset relationships from portfolio holdings data: “asset embeddings”

Key insight from recent research (Gabaix et al. 2025):
Portfolio holdings encode rich information about which assets belong together

Analogy: Just as words appearing in similar contexts have related meanings, assets appearing in similar portfolios share investment characteristics

Context: We’ve covered traditional robo-adviser algorithms (MPT, factor models). Now let’s preview an emerging technique that’s pushing the frontier.
Traditional limitations: Hand-crafted factors require economic intuition. We need to think of size, value, momentum. What if we could learn relationships directly from data?
Embeddings intuition: Think of Netflix recommendations. “People who liked Movie A also liked Movie B” → movies have similar embeddings. Same idea for assets: “Funds that held Apple also held Microsoft” → assets have similar investment profiles.
Portfolio holdings as data: Mutual funds and ETFs file quarterly holdings (13F filings in US). Thousands of institutional portfolios reveal which assets professionals group together.
What embeddings capture: Industry relationships (tech stocks cluster), risk profiles (high-volatility assets cluster), investment styles (growth vs. value), supply chain connections (suppliers and customers cluster).
Research finding (Gabaix et al. 2025): Asset embeddings predict return comovement and cross-sectional valuations better than traditional characteristics. This suggests institutions’ revealed preferences encode information not in standard factor models.
Three advantages for robo-advisers:
1. Automatic adaptation: Embeddings update as holdings data arrives: no manual factor engineering
2. Rich information: Captures relationships too complex to express as simple rules
3. Recommendation systems: “If you hold these assets, you might also want…” (like Amazon/Netflix)
Welfare implications: If embeddings enable better portfolio construction at lower cost (learning from professionals without hiring them), they could extend sophisticated strategies to middle-class investors: furthering the democratization story.
Challenges:
1. Interpretability: Hard to explain “we recommend Tesla because of high embedding similarity to Apple.” Risk committees want intuitive explanations.
2. Data frequency: Holdings updated quarterly with 45-day lag. Not real-time like prices/characteristics.
3. Overfitting risk: With 50-300 dimensions, easy to overfit. Requires rigorous validation (CPCV, multiple testing corrections: Week 10).
4. Regulatory concerns: Will regulators accept “black box” recommendations? Or require transparent, rule-based approaches?
Connection to Week 1: Remember ML in asset pricing (Gu, Kelly, and Xiu 2020)? Embeddings are another ML technique: representation learning applied to portfolio holdings.
Connection to Week 10: Production ML pipelines. If robo-advisers adopt embeddings, they need: retraining schedules (quarterly), drift detection (when holdings distributions shift), validation (out-of-sample testing).
Optional extension: Lab 10 includes an embeddings extension. You’ll implement a lightweight version using synthetic holdings data, validate predictions, and discuss governance tradeoffs.
Current adoption: Still primarily academic research (2024-2025). Some quantitative hedge funds experimenting. Not yet in retail robo-advisers. But watch this space: 5-10 year horizon for mainstream adoption if research continues validating performance.
Paradigm shift: From “economists design factors” to “algorithms learn patterns from data.” This mirrors NLP’s shift from hand-crafted features to word embeddings (Word2Vec, BERT). Same transformation reaching finance.
Realistic expectations: Won’t replace traditional approaches immediately. More likely hybrid: use embeddings to augment factor models, not replace them. Combine economic intuition with data-driven discovery.
Student engagement: “This is cutting-edge research (2024-2025 papers). You’re learning techniques at the frontier. If you pursue quantitative finance, you’ll see embeddings in job interviews and industry applications.”
Assessment relevance (if applicable): Embeddings can be a strong optional enhancement in a research-style project, demonstrating engagement with frontier research and technical sophistication.
Timing: 6-7 minutes for this slide. It’s conceptual preview, not technical deep-dive. The deep-dive is in Lab 10 and Chapter 4 reading.
Transition to summary: “That’s a preview of where the field is heading. Now let’s summarize today’s core takeaways.”

Key Takeaways

1. Robo-advisers automate MPT, reducing marginal cost to near-zero and expanding access to the $25K–$150K wealth band (Reher and Sokolinski (2024)).

2. But estimation error is severe: a 90pp-wide 95% confidence interval on the “optimal” SPY weight (our Bloomberg data) is not a rounding error.

3. Solutions exist: rolling-window validation, Bayesian shrinkage, RMT denoising. Each addresses a specific failure mode.

4. At scale, NP-hard constraints and the exploration-convergence breakdown require a new generation of evolutionary algorithms (MOEAs).

5. The frontier (Liu et al. (2024)) is 15 seconds for 1,000 securities. The gap between academic research and deployed products is narrowing fast.

Recap structure: Six key takeaways mapping to learning objectives.
Takeaway 1: Cost structure is the economic foundation: automation changes the game.
Takeaway 2: Algorithms aren’t magic: they implement well-known theory (MPT). The innovation is scale and accessibility.
Takeaway 3: Evidence is mixed: gains are real but modest. Not everyone benefits equally. Overhyped claims should be questioned.
Takeaway 4: Governance matters as much as algorithms. Suitability, bias, and transparency are critical for trust and sustainability.
Takeaway 5: Inclusion is partial, not universal. We’ve expanded the tent, but many remain outside.
Takeaway 6: The future is likely hybrid: robo-advisers for routine tasks, human advisers for complex planning. This combines efficiency and empathy.
Return to learning objectives: “Look back at the objectives. Can you explain each one?”
Next week preview: “Week 7 introduces cross-sectional machine learning: extending regression to ensemble methods for financial prediction.”
Final engagement: “Would you use a robo-adviser? Why or why not?”
Closing: “Robo-advisers are a success story in FinTech: they’ve genuinely expanded access and reduced costs. But they’re not a panacea. Critical evaluation is essential.”

References

See chapter bibliography for full citations.

Core readings:

Hilpisch (2019) Chapter 13 : Portfolio analytics and optimisation
Reher and Sokolinski (2024) : Empirical evidence on robo-adviser access and welfare
Gu, Kelly, and Xiu (2020) : Machine learning in empirical asset pricing
Vives (2019) : Digital disruption and FinTech taxonomy

Hilpisch 2019: Textbook for technical implementation. Chapter 13 covers portfolio optimisation using scipy.optimize. Code examples in Python.
Reher & Sokolinski 2024: Recent empirical paper (published in Review of Financial Studies). Uses quasi-experimental methods to estimate causal effects. Key evidence for this week.
Gu, Kelly & Xiu 2020: Foundational ML paper (published in Review of Financial Studies). Shows how ML improves asset pricing models. Optional but valuable for understanding algorithmic extensions.
Vives 2019: Banking textbook chapter on FinTech. Provides taxonomy (process vs. product innovation). Good for framing robo-advisers in broader context.
Additional reading: Markowitz (1952) for original MPT; Fama-French for factor models; Black-Litterman for Bayesian portfolio optimisation.
Accessing papers: All are in reading list on Moodle or via library. Hilpisch book is course textbook. Reher and Gu are open-access on SSRN or authors’ websites.
Student task: “Start with Reher & Sokolinski (empirical evidence), then Hilpisch Ch. 13 (technical implementation).”

Slides Bibliography

Boston Consulting Group. 2025. “Global Asset Management 2025: The Industry Hit a New Record High in 2024.” BCG report and press release. https://www.bcg.com/press/29april2025-global-asset-management-record-high-critical-turning-point.

Campbell, John Y. 2006. “Household Finance.” Journal of Finance 61 (4): 1553–1604. https://doi.org/10.1111/j.1540-6261.2006.00883.x.

DeMiguel, Victor, Lorenzo Garlappi, and Raman Uppal. 2009. “Optimal Versus Naive Diversification: How Inefficient Is the 1/n Portfolio Strategy?” The Review of Financial Studies 22 (5): 1915–53. https://doi.org/10.1093/rfs/hhm075.

Gabaix, Xavier, Ralph S. J. Koijen, Robert Richmond, and Motohiro Yogo. 2025. “Asset Embeddings.” Working Paper. SSRN Electronic Journal. https://doi.org/10.2139/ssrn.4507511.

Gu, Shihao, Bryan Kelly, and Dacheng Xiu. 2020. “Empirical Asset Pricing via Machine Learning.” Review of Financial Studies. https://doi.org/10.1093/rfs/hhaa009.

Hilpisch, Yves. 2019. Python for Finance. 2nd ed. O’Reilly Media. https://www.oreilly.com/library/view/python-for-finance/9781492024330/.

Liu, Weilong, Yong Zhang, Kailong Liu, Barry Quinn, Xingyu Yang, and Qiao Peng. 2024. “Evolutionary Multi-Objective Optimisation for Large-Scale Portfolio Selection with Both Random and Uncertain Returns.” IEEE Transactions on Evolutionary Computation.

Philippon, Thomas. 2016. “The FinTech Opportunity.” Working Paper w22476. National Bureau of Economic Research. https://www.nber.org/system/files/working_papers/w22476/w22476.pdf.

Reher, Michael, and Stanislav Sokolinski. 2024. “Robo-Advisors and Access to Wealth Management.” Journal of Financial Economics 155: 103829. https://doi.org/10.1016/j.jfineco.2024.103829.

Sharpe, William F. 1994. “The Sharpe Ratio.” Journal of Portfolio Management 21 (1): 49–58. https://doi.org/10.3905/jpm.1994.409501.

Vives, Xavier. 2019. “Digital Disruption in Banking.” Annual Review of Financial Economics. https://doi.org/10.1146/annurev-financial-100719-120854.

Symbol	Meaning	Source
\(w\)	Weight vector: how much to allocate to each asset (what we choose)	Optimiser output
\(\mu\)	Expected return vector: one forecast per asset	Historical data / model
\(\Sigma\)	Covariance matrix: how assets move together	Historical data
\(r_f\)	Risk-free rate (e.g. UK gilt yield)	Market