Skip to main content Accessibility help


  • Xavier D’Haultfoeuille (a1) and Arnaud Maurel (a2)


It is often believed that without instruments, endogenous sample selection models are identified only if a covariate with a large support is available (see, e.g., Chamberlain, 1986, Journal of Econometrics 32, 189–218; Lewbel, 2007, Journal of Econometrics141, 777–806) . We propose a new identification strategy mainly based on the condition that the selection variable becomes independent of the covariates for large values of the outcome. No large support on the covariates is required. Moreover, we prove that this condition is testable. We finally show that our strategy can be applied to the identification of generalized Roy models.


Corresponding author

*Address correspondence to Xavier D’Haultfoeuille, CREST, 15 boulevard Gabriel Péri, 92 240 Malakoff, France; e-mail:


Hide All
Abbring, J. (2010) Identification of dynamic discrete choice models. Annual Review of Economics 2, 367394.
Abbring, J. & van den Berg, G. (2003) The identifiability of the mixed proportional hazards competing risks model. Journal of the Royal Statistical Society, Series B 65, 701710.
Andrews, D.K. & Schafgans, M. (1998) Semiparametric estimation of the intercept of a sample selection model. Review of Economic Studies 65, 497517.
Bayer, P.J., Khan, S., & Timmins, C. (2011) Nonparametric identification and estimation in a Roy model with common nonpecuniary returns. Journal of Business & Economic Statistics 29, 201215.
Borjas, G. (1987) Self-selection and the earnings of immigrants. American Economic Review 77,531553.
Carneiro, P., Hansen, K., & Heckman, J. (2003) Estimating distributions of treatment effects with an application to the returns to schooling and measurement of the effects of uncertainty on college choice. International Economic Review 44, 361422.
Chamberlain, G. (1986) Asymptotic efficiency in semiparametric model with censoring. Journal of Econometrics 32, 189218.
Dagsvik, J. & Strøm, S. (2006) Sectoral labour supply, choice restrictions and functional form. Journal of Applied Econometrics 21, 803826.
D’Haultfœuille, X. & Maurel, A. (2011) Inference on an Extended Roy Model, with an Application to Schooling Decisions in France. Working paper, Duke University.
Elbers, C. & Ridder, G. (1982) True and spurious duration dependence The identifiability of the proportional hazard model. Review of Economic Studies 49, 403409.
Heckman, J.J. (1974) Shadow prices, market wages, and labor supply. Econometrica 42, 679694.
Heckman, J.J. (1990) Varieties of selection bias. American Economic Review 80, 313318.
Heckman, J.J. & Honore, B. (1989) The identifiability of competing risks models. Biometrika 76, 325330.
Heckman, J. & Vytlacil, E. (2005) Structural equations, treatment effects, and econometric policy evaluation. Econometrica 73, 669738.
Khan, S. & Tamer, E. (2010) Irregular identification, support conditions and inverse weight estimation. Econometrica 78, 20212042.
Lee, S. (2006) Identification of a competing risks model with unknown transformations of latent failure times. Biometrika 93, 9961002.
Lee, S. & Lewbel, A. (2011) Nonparametric Identification of Accelerated Failure Time Competing Risks Models. Working paper, Boston College.
Lewbel, A. (2007) Endogenous selection or treatment model estimation. Journal of Econometrics 141, 777806.
Roy, A.D. (1951) Some thoughts on the distribution of earnings. Oxford Economic Papers (New Series) 3, 135146.
Schafgans, M. & Zinde-Walsh, V. (2002) On intercept estimation in the sample selection model. Econometric Theory 18, 4050.
Vella, F. (1998) Estimating models with sample selection bias: A survey. Journal of Human Resources 33, 127169.
Willis, R. & Rosen, S. (1979) Education and self-selection. Journal of Political Economy 87, S7S36.
Recommend this journal

Email your librarian or administrator to recommend adding this journal to your organisation's collection.

Econometric Theory
  • ISSN: 0266-4666
  • EISSN: 1469-4360
  • URL: /core/journals/econometric-theory
Please enter your name
Please enter a valid email address
Who would you like to send this to? *


Full text views

Total number of HTML views: 0
Total number of PDF views: 0 *
Loading metrics...

Abstract views

Total abstract views: 0 *
Loading metrics...

* Views captured on Cambridge Core between <date>. This data will be updated every 24 hours.

Usage data cannot currently be displayed