ai/gateway/knowledge/trading/strategies/stocks/alpha-combos.md

---
description: "Combines hundreds of thousands of weak individual alpha signals into a single tradeable mega-alpha by optimizing combination weights using a structured 11-step procedure based on demeaned returns and regression residuals."
tags: [stocks, machine-learning, alpha-combination, quantitative]
---

# Alpha Combos

**Section**: 3.20 | **Asset Class**: Stocks | **Type**: Machine Learning / Quantitative / Alpha Combination

## Overview
Alpha combo strategies combine a large number of weak quantitative trading signals ("alphas") into a single tradeable "mega-alpha" portfolio. Each individual alpha is too faint to trade profitably on its own after costs, but a sufficiently large combination can generate a viable signal. The combination weights are optimized using a structured procedure that handles serial correlation, cross-sectional demeaning, and risk normalization.

## Construction / Signal
Assume N alphas (possibly hundreds of thousands), all trading the same universe of ~2,500 liquid U.S. stocks. Each alpha produces desired stock holdings at a sequence of times t_1, t_2, .... The procedure for fixing combination weights w_i [Kakushadze and Yu, 2017b]:

1. Start with time series of realized alpha returns `R_is`, i=1,...,N, s=1,...,M+1.
2. Calculate serially demeaned returns:
   ```
   X_is = R_is - (1/(M+1)) * sum_{s=1}^{M+1} R_is
   ```
3. Calculate sample variances of alpha returns:
   ```
   sigma_i^2 = (1/M) * sum_{s=1}^{M+1} X_is^2
   ```
4. Calculate normalized demeaned returns:
   ```
   Y_is = X_is / sigma_i
   ```
5. Keep only the first M columns: Y_is, s=1,...,M.
6. Cross-sectionally demean Y_is:
   ```
   Lambda_is = Y_is - (1/N) * sum_{j=1}^{N} Y_js
   ```
7. Keep only the first M-1 columns: Lambda_is, s=1,...,M-1.
8. Compute expected alpha returns E_i and normalize:
   ```
   E_i = (1/d) * sum_{s=1}^{d} R_is                     (360)
   E_tilde_i = E_i / sigma_i
   ```
   (d-day moving average; d need not equal T)
9. Calculate residuals `epsilon_tilde_i` of regression (no intercept, unit weights) of `E_tilde_i` over `Lambda_is`.
10. Set alpha portfolio weights:
    ```
    w_i = eta * epsilon_tilde_i / sigma_i
    ```
11. Set normalization coefficient eta such that:
    ```
    sum_{i=1}^{N} |w_i| = 1
    ```

## Entry / Exit Rules
- **Entry**: At each rebalance time, recompute combination weights w_i using the 11-step procedure and establish positions accordingly.
- **Exit**: Positions are updated at each rebalance; individual alpha positions change according to the alpha's own signals, and the overall mega-alpha weight adjusts.
- **Holding period**: Determined by individual alpha holding periods (typically daily, from close to close).

## Key Parameters
- **Number of alphas N**: Can be hundreds of thousands or millions
- **Return history M+1**: Number of time periods used for variance estimation
- **Expected return window d**: Number of days for moving average of alpha returns (Eq. 360; d need not equal M)
- **Universe**: Typically ~2,500 most liquid U.S. stocks
- **Alpha returns R_is**: Daily alpha returns from close to close

## Variations
- **Fewer alphas**: The procedure scales from a few dozen to millions of alphas
- **Different expected return estimator**: Instead of d-day moving average, other estimators for E_i can be used
- **Multiple universes**: Extend to different stock universes or asset classes

## Notes
- Individual alphas are "ubiquitous, faint, and ephemeral" — their signal is too weak to trade profitably alone due to transaction costs.
- The 11-step procedure handles: serial correlation (steps 1-2), scale normalization (steps 3-4), cross-sectional neutrality (steps 5-7), expected return estimation (step 8), residualization (step 9), and final weight normalization (steps 10-11).
- "Alpha" here follows the practitioner definition: any reasonable expected return signal, not necessarily Jensen's alpha.
- 101 explicit examples of such quantitative alphas are given in Kakushadze (2016).
- This is a cross-sectional multi-stock strategy requiring significant data infrastructure.
- Typical holding period: daily (overnight or close-to-close).