To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
This chapter introduces several basic tools in high-dimensional probability: decoupling, concentration for quadratic forms (the Hanson–Wright inequality), symmetrization, and contraction. These techniques are illustrated through estimates of the operator norm of a random matrix. This is applied to matrix completion, where the goal is to recover a low-rank matrix from a random subset of its entries. Exercises explore variants of the Hanson–Wright inequality, mean estimation, concentration of the norm for anisotropic random vectors, distances to subspaces, graph cutting, the concept of type in normed spaces, non-Euclidean versions of the approximate Caratheodory theorem, and covariance estimation.
This chapter introduces financial risk as a random variable representing uncertain future gains or losses. A risk measure quantifies this uncertainty by mapping random variables to real numbers. The prominent example of Value-at-Risk (V@R) is discussed.
The Average-Value-at-Risk (AV@R) has emerged as a superior, coherent risk measure that accounts for the magnitude of potential losses beyond a given quantile and consistently favors diversification.
This chapter begins the study of random vectors in high dimensions, starting by showing their norm concentrates. We give a probabilistic proof of the Grothendieck inequality and apply it to semidefinite optimization. We explore a semidefinite relaxation for the maximum cut, presenting the Goemans–Williamson randomized approximation algorithm. We also give an alternative proof of the Grothendieck inequality with nearly the best known constant using the kernel trick, a method widely used in machine learning. The exercises explore invariant ensembles of random matrix theory, various versions of the Grothendieck inequality, semidefinite relaxations, and the notion of entropy.
This chapter presents the main ideas behind measuring the risks of random vectors. The main point is that it may be possible to transfer assets between components of a vector, and so the risk measure becomes a convex set in Euclidean space.
Chapter 2 focuses on background information that is essential to understanding SEMs. This includes providing the general structural equation model that appears throughout the book along with definitions of the notation and the assumptions of the model. The chapter introduces path diagram symbols and their relation to the equation form of the model. It also describes differences between endogenous and exogenous variables and observed and latent variables for both continuous and categorical variables. In addition, the chapter introduces the problems of missing data, outliers and influential cases, and multiple significance testing, issues that are common in all types of models. Finally, basic rules of expected values, variances, and covariances are part of the chapter.
This chapter develops a non-asymptotic theory of random matrices. It starts with a quick refresher on linear algebra, including the perturbation theory for matrices and featuring a short proof of the Davis–Kahan inequality. Three key concepts are introduced – nets, covering numbers, and packing numbers – and linked to volume and error-correcting codes. Bounds on the operator norm and singular values of random matrices are established. Three applications are given: community detection in networks, covariance estimation, and spectral clustering. Exercises explore the power method to compute the top singular value, the Schur bound on the operator norm, Hermitian dilation,Walsh matrices, the Wedin theorem on matrix perturbations, a semidefinite relaxation of the cut norm, the volume of high-dimensional balls, and Gaussian mixture models.
Models with multiple equations rather than a single equation are the subject of Chapter 4. It covers model specification, implied moments, model identification, model estimation, and model interpretation, fit, and diagnostics in the context of such models. The consequences of measurement error and the treatment of mediation effects are part of the chapter. Finally, the chapter compares simultaneous equation models and Directed Acyclic Graphs (DAGs).