Hostname: page-component-77f85d65b8-g98kq Total loading time: 0 Render date: 2026-03-30T09:33:31.828Z Has data issue: false hasContentIssue false

Optimal Portfolio Size Under Parameter Uncertainty

Published online by Cambridge University Press:  04 December 2025

Nathan Lassance
Affiliation:
UCLouvain LFIN/LIDAM nathan.lassance@uclouvain.be
Rodolphe Vanderveken*
Affiliation:
UCLouvain LFIN/LIDAM
Frédéric Vrins
Affiliation:
UCLouvain LFIN/LIDAM Decision Science Department, HEC Montréal frederic.vrins@uclouvain.be
*
rodolphe.vanderveken@uclouvain.be (corresponding author)
Rights & Permissions [Opens in a new window]

Abstract

We introduce a method to determine the investor’s optimal portfolio size that maximizes the expected out-of-sample utility under parameter uncertainty. This portfolio size trades off between accessing investment opportunities and limiting the number of estimated parameters. Unlike sparse methods such as lasso, which exclude assets during the optimization step, our approach fixes the optimal number of assets before optimizing the portfolio weights, which improves robustness and provides greater flexibility in practical implementations. Empirically, our size-optimized portfolios outperform their counterparts applied to all available assets. Our methodology renders portfolio theory valuable even when the data-set dimension and sample size are comparable.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of the Michael G. Foster School of Business, University of Washington
Figure 0

FIGURE 1 Utility of the Mean–Variance Portfolio as a Function of $ N $ and$ \rho $Figure 1 depicts $ U\left({\boldsymbol{w}}^{\star}\right) $ in equation (4) as a function of the portfolio size $ N $ under the assumption that asset returns are equicorrelated with a correlation $ \rho \in \left\{\mathrm{0.2,0.5,0.8}\right\} $. We calibrate the assets’ monthly Sharpe ratios to a data set of $ M=96 $ portfolios sorted on size and book-to-market spanning July 1963 to August 2023. Starting with $ N=1 $ asset chosen randomly, we compute $ U\left({\boldsymbol{w}}^{\star}\right) $. Then, we add a randomly selected asset not previously selected and compute $ U\left({\boldsymbol{w}}^{\star}\right) $ again. We continue this procedure until $ N=M $. We repeat this procedure $ \mathrm{10,000} $ times and depict the average $ U\left({\boldsymbol{w}}^{\star}\right) $ over all draws. We consider a risk-aversion coefficient $ \gamma =1 $.

Figure 1

FIGURE 2 Optimal Portfolio Size for the Sample Mean–Variance (SMV) portfolio and the Optimal Two-Fund Rule (2F)Figure 2 depicts the optimal portfolio size for the SMV portfolio, $ {N}_{smv}^{\star } $ in (17) (Graph A), and for the optimal 2F, $ {N}_{2f}^{\star } $ in (18) (Graph B), as a function of the sample size $ T $. Each line represents a different choice of the correlation, $ \rho =\left\{\mathrm{0.2,0.5,0.8}\right\} $, and the degrees of freedom of the $ t $-distribution, $ \nu =\left\{6,\infty \right\} $. We calibrate $ {\overline{\boldsymbol{\theta}}}_M=\left(\mathrm{0.125,0.0169}\right) $ to a data set of $ M=96 $ portfolios sorted on size and book-to-market spanning July 1963 to August 2023.

Figure 2

FIGURE 3 Estimated Optimal Portfolio Size in Simulated DataFigure 3 depicts the boxplots of the estimated optimal portfolio size $ N $ for the sample mean–variance portfolio $ \left({\hat{N}}_{smv}^{\star}\right) $, the two-fund rule $ \Big({\hat{N}}_{2f}^{\star } $), the GMV-three-fund rule $ \left({\hat{N}}_{3f,g}^{\star}\right) $, and the EW-three-fund rule $ \left({\hat{N}}_{3f, ew}^{\star}\right) $. These are the estimated counterparts of $ {N}_{smv}^{\star } $ in (17), $ {N}_{2f}^{\star } $ in (18), $ {N}_{3f,g}^{\star } $ in (A.10), and $ {N}_{3f, ew}^{\star } $ in (A.19) following the estimation methodology in Appendix A.II. The boxplots are obtained by simulating $ \mathrm{10,000} $ times $ T=120 $$ t $-distributed returns. Using a data set of $ M=96 $ portfolios sorted on size and book-to-market spanning July 1963 to August 2023, we set $ {\boldsymbol{\mu}}_M={\hat{\boldsymbol{\mu}}}_{96} $ and $ {\boldsymbol{\Sigma}}_M\in \left\{{\hat{\boldsymbol{\Sigma}}}_{96},{\hat{\boldsymbol{\Sigma}}}_{96}\left(\overline{\rho}\right)\right\} $, where $ {\hat{\boldsymbol{\Sigma}}}_{96}\left(\overline{\rho}\right) $ is an equicorrelation covariance matrix and $ \overline{\rho}=0.74 $ is the average of all correlations in $ {\hat{\boldsymbol{\Sigma}}}_{96} $. We consider $ \nu \in \left\{6,\infty \right\} $ degrees of freedom. Each boxplot corresponds to a different choice of $ \left(\nu, {\boldsymbol{\Sigma}}_M\right) $. We depict with crosses the oracle value of the optimal $ N $ that is known under Assumption 1 (i.e., when $ {\boldsymbol{\Sigma}}_M $ is of the form $ {\boldsymbol{\Sigma}}_M\left(\rho \right) $).

Figure 3

FIGURE 4 Impact of Correlation $ \rho $ on Optimal Portfolio Size in Simulated DataFigure 4 depicts the boxplots of the estimated optimal portfolio size $ N $ for the sample mean–variance portfolio (Graph A, $ {\hat{N}}_{smv}^{\star } $), the two-fund rule (Graph B, $ {\hat{N}}_{2f}^{\star } $), the GMV-three-fund rule (Graph C, $ {\hat{N}}_{3f,g}^{\star } $), and the EW-three-fund rule (Graph D, $ {\hat{N}}_{3f,ew}^{\star } $). These are the estimated counterparts of $ {N}_{smv}^{\star } $ in (17), $ {N}_{2f}^{\star } $ in (18), $ {N}_{3f,g}^{\star } $ in (A.10), and $ {N}_{3f, ew}^{\star } $ in (A.19) following the estimation methodology in Appendix A.II. The boxplots are obtained by simulating $ \mathrm{10,000} $ times $ T=120 $$ t $-distributed returns. Using a data set of $ M=96 $ portfolios sorted on size and book-to-market spanning July 1963 to August 2023, we set $ {\boldsymbol{\mu}}_M={\hat{\boldsymbol{\mu}}}_{96} $ and $ {\boldsymbol{\Sigma}}_M={\hat{\boldsymbol{\Sigma}}}_{96}\left(\rho \right) $, where $ {\hat{\boldsymbol{\Sigma}}}_{96}\left(\overline{\rho}\right) $ is an equicorrelation covariance matrix and $ \rho $ varies between 0.1 and 0.9 with a step size of 0.1. We consider $ \nu =6 $ degrees of freedom. We depict with dotted lines and crosses the oracle value $ {N}^{\star } $ of the optimal $ N $ that is known under Assumption 1 (i.e., when $ {\boldsymbol{\Sigma}}_M $ is of the form $ {\boldsymbol{\Sigma}}_M\left(\rho \right) $).

Figure 4

FIGURE 5 Expected Out-of-Sample Utility in Simulated Data $ \left(\nu =6,{\boldsymbol{\Sigma}}_M={\hat{\boldsymbol{\Sigma}}}_{96}\right) $Figure 5 depicts the expected out-of-sample utility (EU) of the sample mean–variance portfolio (SMV; Graph A), the two-fund rule (2F; Graph B), the GMV-three-fund rule (3FGMV; Graph C), and the EW-three-fund rule (3FEW; Graph B) as a function of the portfolio size $ N $. We simulate $ \mathrm{10,000} $ samples of $ t $-distributed returns with $ \nu =6 $ degrees of freedom. For each $ N $, we conduct a rolling window exercise as described in Section IV.B, and in each rolling window, we randomly select the $ N $ assets. Using a data set of $ M=96 $ portfolios sorted on size and book-to-market spanning July 1963 to August 2023, we set $ {\boldsymbol{\mu}}_M={\hat{\boldsymbol{\mu}}}_{96} $ and $ {\boldsymbol{\Sigma}}_M={\hat{\boldsymbol{\Sigma}}}_{96} $. In each graph, the solid blue line depicts the EU in (21), the shaded gray area depicts the one-sigma interval around the EU across simulations, and the dashed horizontal blue line depicts the EU obtained when using the estimated optimal $ N $ (i.e., $ {\hat{N}}_{smv}^{\star } $ for the SMV portfolio, $ {\hat{N}}_{2f}^{\star } $ for 2F, $ {\hat{N}}_{3f,g}^{\star } $ for 3FGMV, and $ {\hat{N}}_{3f, ew}^{\star } $ for 3FEW). The dash-dotted red line depicts the EU of the equal-weighted portfolio. The dotted horizontal gray line depicts the zero EU level. The risk-aversion coefficient is $ \gamma =1 $.

Figure 5

FIGURE 6 Expected Out-of-Sample Utility and Estimated Portfolio Size of Soft- and Hard-Thresholding of the Sample Mean–Variance Portfolio in Simulated Data $ \left(\nu =6,{\boldsymbol{\Sigma}}_M={\hat{\boldsymbol{\Sigma}}}_{96}\right) $Figure 6 depicts the expected out-of-sample utility (EU) and the estimated optimal portfolio size of the soft- and hard-thresholding versions of the sample mean–variance (SMV) portfolio, SMV-ST and SMV-HT, described in Section IV.C. We depict these as a function of the number of assets $ N $ on which SMV-ST and SMV-HT are estimated. We simulate $ \mathrm{10,000} $ samples of $ t $-distributed returns with $ \nu =6 $ degrees of freedom. For each $ N $, we conduct a rolling window exercise as described in Section IV.B, and in each rolling window, we randomly select the $ N $ assets. Using a data set of $ M=96 $ portfolios sorted on size and book-to-market spanning July 1963 to August 2023, we set $ {\boldsymbol{\mu}}_M={\hat{\boldsymbol{\mu}}}_{96} $ and $ {\boldsymbol{\Sigma}}_M={\hat{\boldsymbol{\Sigma}}}_{96} $. In Graphs A and C, the solid blue lines depict the EU of SMV-ST or SMV-HT in (21), the shaded gray areas depict the one-sigma interval around the EU across simulations, the dash-dotted red lines depict the EU of the equal-weighted portfolio, and the dotted horizontal gray lines depict the zero EU level. The risk-aversion coefficient is $ \gamma =1 $. In Graphs B and D, the blue crosses depict the average estimated optimal portfolio size.

Figure 6

FIGURE 7 Expected Out-of-Sample Utility of Factor-Plus-Alpha Strategy in Simulated Data $ \left(\nu =6,{\boldsymbol{\Sigma}}_M={\hat{\boldsymbol{\Sigma}}}_{96}\right) $Figure 7 depicts the expected out-of-sample utility (EU) of the factor-plus-alpha (F+A) strategy, described in Section OA.4 of the Supplementary Material, as well as the two-fund rule, as a function of the portfolio size $ N $. We simulate $ \mathrm{10,000} $ samples of $ t $-distributed returns with $ \nu =6 $ degrees of freedom. For each $ N $, we conduct a rolling window exercise as described in Section IV.B, and in each rolling window, we randomly select the $ N $ assets out of the $ M $ available ones. Using a data set of $ M=96 $ portfolios sorted on size and book-to-market spanning July 1963 to August 2023, we set $ {\boldsymbol{\mu}}_M={\hat{\boldsymbol{\mu}}}_{96} $ and $ {\boldsymbol{\Sigma}}_M={\hat{\boldsymbol{\Sigma}}}_{96} $. In each graph, the solid blue and dashed green lines depict the EU in (21) of F+A and 2F, respectively, the shaded gray area depicts the one-sigma interval around the EU of F+A across all simulations, the dash-dotted red line depicts the EU of the equal-weighted portfolio, and the dotted horizontal gray line depicts the zero EU level. In Graph A, the sample size $ T=120 $ is fixed. In Graph B, the sample size increases with $ N $ as $ T=60+2N $. The risk-aversion coefficient is $ \gamma =1 $.

Figure 7

TABLE 1 List of Data Sets Considered in the Empirical Analysis

Figure 8

TABLE 2 List of Portfolio Strategies Considered in the Empirical Analysis

Figure 9

TABLE 3 List of Asset Selection Rules Considered in the Empirical Analysis

Figure 10

TABLE 4 Annualized Net Out-of-Sample Utility in Empirical Data

Figure 11

FIGURE 8 Difference in Net Out-of-Sample Utility Relative to Investing in All AssetsFigure 8 depicts, for three different portfolio strategies, the difference between the annualized net out-of-sample utility in percentage points obtained when implementing the portfolios on a subset of $ N $ assets, where the optimal $ N $ is estimated using our theory, using 10 asset selection rules relative to the case where the portfolios are implemented on all $ M $ assets. The three portfolio strategies are the two-fund rule (blue), the GMV-three-fund rule (green), and the EW-three-fund rule (red). Each graph considers one of the 10 asset selection rules described in Table 3. This figure is constructed following the methodology described in Section V.D. We consider seven data sets described in Table 1. We estimate the portfolios with the linear shrinkage covariance matrix of Ledoit and Wolf (2004). The net out-of-sample utility is computed using rolling windows, a sample size $ T=120 $ months, and proportional transaction costs of 10 basis points. The risk-aversion coefficient is $ \gamma =1 $.

Supplementary material: File

Lassance et al. supplementary material

Lassance et al. supplementary material
Download Lassance et al. supplementary material(File)
File 620.3 KB