We use cookies to distinguish you from other users and to provide you with a better experience on our websites. Close this message to accept cookies or find out how to manage your cookie settings.
To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
In this chapter we study the first example of a correlated memoryless phenomenon: the famous “drunkard’s walk”, formally termed the random walk. We begin from a very simple case, in a homogeneous and isotropic space on a discrete hypercubic lattice. Then we add traps here and there. Eventually we make a foray into the continuous regime, with the Fokker–Planck diffusion equation (which, we see, is what physicists call a Schrödinger equation in imaginary time), and the stochastic differential Langevin equation.
This chapter quantitatively examines molecule numbers and reaction rates within a cell, along with thermal fluctuations and Brownian motion, from a mesoscopic perspective. Thermal fluctuations of molecules are pivotal in chemical reactions, protein folding, molecular motor systems, and so on. We introduce estimations of cell size and molecule numbers within cells, highlighting the possible significance of the minority of molecules. Describing their behaviors necessitates dealing with stochastic fluctuations, and the Gillespie algorithm, widely employed in Monte Carlo simulations for stochastic chemical reactions, is described. We elaborate on extrinsic and intrinsic noise in cells, and on why understanding how cells process fluctuations for sensing is crucial. To facilitate this comprehension, we revisit the fundamentals of statistics, including the law of large numbers and the central limit theorem. We derive the diffusion equation from random walk and confirm the dimensionality dependence of random walks, and elucidate Brownian motion as the continuous limit of random walk and explain the Einstein relation. As examples of the physiological significance of fluctuations in cell biology, we estimate the diffusion constant of proteins inside cells, diffusion-limited reactions, and introduce bacterial random walks and chemotaxis, and amoeboid movements of eukaryotic cells.
We develop explicit bounds for the tail of the distribution of the all-time supremum of a random walk with negative drift, where the increments have a truncated heavy-tailed distribution. As an application, we consider a ruin problem in the presence of reinsurance.
This chapter delves into the theory and application of reversible Markov Chain Monte Carlo (MCMC) algorithms, focusing on their role in Bayesian inference. It begins with the Metropolis–Hastings algorithm and explores variations such as component-wise updates, and the Metropolis-Adjusted Langevin Algorithm (MALA). The chapter also discusses Hamiltonian Monte Carlo (HMC) and the importance of scaling MCMC methods for high-dimensional models or large datasets. Key challenges in applying reversible MCMC to large-scale problems are addressed, with a focus on computational efficiency and algorithmic adjustments to improve scalability.
The payoff in the Chow–Robbins coin-tossing game is the proportion of heads when you stop. Stopping to maximize expectation was addressed by Chow and Robbins (1965), who proved there exist integers ${k_n}$ such that it is optimal to stop at n tosses when heads minus tails is ${k_n}$. Finding ${k_n}$ was unsolved except for finitely many cases by computer. We prove an $o(n^{-1/4})$ estimate of the stopping boundary of Dvoretsky (1967), which then proves ${k_n} = \left\lceil {\alpha \sqrt n \,\, - 1/2\,\, + \,\,\frac{{\left( { - 2\zeta (\! -1/2)} \right)\sqrt \alpha }}{{\sqrt \pi }}{n^{ - 1/4}}} \right\rceil $ except for n in a set of density asymptotic to 0, at a power law rate. Here, $\alpha$ is the Shepp–Walker constant from the Brownian motion analog, and $\zeta$ is Riemann’s zeta function. An $n^{ - 1/4}$ dependence was conjectured by Christensen and Fischer (2022). Our proof uses moments involving Catalan and Shapiro Catalan triangle numbers which appear in a tree resulting from backward induction, and a generalized backward induction principle. It was motivated by an idea of Häggström and Wästlund (2013) to use backward induction of upper and lower Value bounds from a horizon, which they used numerically to settle a few cases. Christensen and Fischer, with much better bounds, settled many more cases. We use Skorohod’s embedding to get simple upper and lower bounds from the Brownian analog; our upper bound is the one found by Christensen and Fischer in another way. We use them first for yet many more examples and a conjecture, then algebraically in the tree, with feedback to get much sharper Value bounds near the border, and analytic results. Also, we give a formula that gives the exact optimal stop rule for all n up to about a third of a billion; it uses the analytic result plus terms arrived at empirically.
Chapter 10 focuses on the time scale of art history, which spans centuries or even millennia. It begins by showing how art historical processes can be interpreted in terms of complex dynamic systems. It then discusses patterns of art historical change, from linear to stepwise to metastable change. The chapter addresses the question of continuity and discontinuity in art history from both quantitative and qualitative perspectives. Finally, it discusses how the concepts and methods of network theory can help us to understand the complex dynamics of artworlds.
The random walk is perhaps the simplest stochastic process one can think of. It is discrete time in the sense that it is defined at positive integer times only. We give the main features of this process, including its expectation and variance at any time, and establish the link between the probabilities of the values it can take at different times and the binomial distribution. Finally, we study a few transformations of the latter, including deterministic shifting or scaling. Taking the exponential of this linear transform yields the geometric random walk. Finally, we discuss how to change the time-step in order to obtain a process that is defined not only at integer times, but on every point of a given discrete-time grid. In all cases, we illustrate how the sample paths of the resulting process change. This stochastic process plays a central role in finance and is at the heart of the Cox–Ross–Rubinstein model.
We investigate the tail behavior of the first-passage time for Sinai’s random walk in a random environment. Our method relies on the connection between Sinai’s walk and branching processes with immigration in a random environment, and the analysis on some important quantities of these branching processes such as extinction time, maximum population, and total population.
In this chapter we move towards more subtle aspects of econometric analysis, where it is not immediately obvious from the numbers or the graphs that something is wrong. We see that so-called influential observations may not be visible from graphs but become apparent after creating a model. This is one of the key takeaways from this chapter – that we do not throw away data prior to econometric analysis. We should incorporate all observations in our models and, based on specific diagnostic measures, decide which observations are harmful.
In practice we do not always have clear guidance from economic theory about specifying an econometric model. At one extreme, it may be said that we should “let the data speak.” It is good to know that when they “speak” that what they say makes sense. We must be aware of a particularly important phenomenon in empirical econometrics: the spurious relationship. If you encounter a spurious relationship but do not recognize it as such, you may inadequately consider such a relationship for hypothesis testing or for the creation of forecasts. A spurious relationship appears when the model is not well specified. In this chapter, we see from a case study that people can draw strong but inappropriate conclusions if the econometric model is not well specified. We see that if you a priori hypothesize a structural break at a particular moment in time, and based on that very assumption analyze the data, then it is easy to draw inaccurate conclusions. As with influential observations, the lesson here is that one should first create an econometric model, and, given that model, investigate whether there could have been a structural break.
In this paper, we study random walks on groups that contain superlinear-divergent geodesics, in the line of thoughts of Goldsborough and Sisto. The existence of a superlinear-divergent geodesic is a quasi-isometry invariant which allows us to execute Gouëzel’s pivoting technique. We develop the theory of superlinear divergence and establish a central limit theorem for random walks on these groups.
Edited by
R. A. Bailey, University of St Andrews, Scotland,Peter J. Cameron, University of St Andrews, Scotland,Yaokun Wu, Shanghai Jiao Tong University, China
This is an introduction to representation theory and harmonic analysis on finite groups. This includes, in particular, Gelfand pairs (with applications to diffusion processes à la Diaconis) and induced representations (focusing on the little group method of Mackey and Wigner). We also discuss Laplace operators and spectral theory of finite regular graphs. In the last part, we present the representation theory of GL(2, Fq), the general linear group of invertible 2 × 2 matrices with coefficients in a finite field with q elements. More precisely, we revisit the classical Gelfand–Graev representation of GL(2, Fq) in terms of the so-called multiplicity-free triples and their associated Hecke algebras. The presentation is not fully self-contained: most of the basic and elementary facts are proved in detail, some others are left as exercises, while, for more advanced results with no proof, precise references are provided.
Research in recent years has highlighted the deep connections between the algebraic, geometric, and analytic structures of a discrete group. New methods and ideas have resulted in an exciting field, with many opportunities for new researchers. This book is an introduction to the area from a modern vantage point. It incorporates the main basics, such as Kesten's amenability criterion, Coulhon and Saloff-Coste inequality, random walk entropy and bounded harmonic functions, the Choquet–Deny Theorem, the Milnor–Wolf Theorem, and a complete proof of Gromov's Theorem on polynomial growth groups. The book is especially appropriate for young researchers, and those new to the field, accessible even to graduate students. An abundance of examples, exercises, and solutions encourage self-reflection and the internalization of the concepts introduced. The author also points to open problems and possibilities for further research.
We prove that the local time of random walks conditioned to stay positive converges to the corresponding local time of three-dimensional Bessel processes by proper scaling. Our proof is based on Tanaka’s pathwise construction for conditioned random walks and the derivation of asymptotics for mixed moments of the local time.
In water-rich smectite gels, bound or less mobile H2O layers exist near negatively-charged clay platelets. These bound H2O layers are obstacles to the diffusion of unbound H2O molecules in the porespace, and therefore reduce the H2O self-diffusion coefficient, D, in the gel system as a whole. In this study, the self-diffusion coefficients of H2O molecules in water-rich gels of Na-rich smectites (montmorillonite, stevensite and hectorite) were measured by pulsed-gradient spin-echo proton nuclear magnetic resonance (NMR) to evaluate the effects of obstruction on D. The NMR results were interpreted using random-walk computer simulations which show that unbound H2O diffuses in the gels while avoiding randomly-placed obstacles (clay platelets sandwiched in immobilized bound H2O layers). A ratio (volume of the clay platelets and immobilized H2O layers)/(volume of clay platelets) was estimated for each water-rich gel. The results showed that the ratio was 8.92, 16.9, 3.32, 3.73 and 3.92 for Wyoming montmorillonite (⩽ 5.74 wt.% clay), Tsukinuno montmorillonite (⩽ 3.73 wt.% clay), synthetic stevensite (⩽ 8.97 wt.% clay), and two synthetic hectorite samples (⩽ 11.0 wt.% clay), respectively. The ratios suggest that the thickness of the immobilized H2O layers in the gels is 4.0, 8.0, 1.2, 1.4 and 1.5 nm, respectively, assuming that each clay particle in the gels consists of a single 1 nm-thick platelet. The present study confirmed that the obstruction effects of immobilized H2O layers near the clay surfaces are important in restricting the self-diffusion of unbound H2O in water-rich smectite gels.
We investigate the translation lengths of group elements that arise in random walks on the isometry groups of Gromov hyperbolic spaces. In particular, without any moment condition, we prove that non-elementary random walks exhibit at least linear growth of translation lengths. As a corollary, almost every random walk on mapping class groups eventually becomes pseudo-Anosov, and almost every random walk on $\mathrm {Out}(F_n)$ eventually becomes fully irreducible. If the underlying measure further has finite first moment, then the growth rate of translation lengths is equal to the drift, the escape rate of the random walk.
We then apply our technique to investigate the random walks induced by the action of mapping class groups on Teichmüller spaces. In particular, we prove the spectral theorem under finite first moment condition, generalizing a result of Dahmani and Horbez.
We consider an SIR (susceptible $\to$ infective $\to$ recovered) epidemic in a closed population of size n, in which infection spreads via mixing events, comprising individuals chosen uniformly at random from the population, which occur at the points of a Poisson process. This contrasts sharply with most epidemic models, in which infection is spread purely by pairwise interaction. A sequence of epidemic processes, indexed by n, and an approximating branching process are constructed on a common probability space via embedded random walks. We show that under suitable conditions the process of infectives in the epidemic process converges almost surely to the branching process. This leads to a threshold theorem for the epidemic process, where a major outbreak is defined as one that infects at least $\log n$ individuals. We show further that there exists $\delta \gt 0$, depending on the model parameters, such that the probability that a major outbreak has size at least $\delta n$ tends to one as $n \to \infty$.
Chapter 4 introduces the molecular diffusion concept and Fick’s Law to explain the mixing phenomena at a small-scale CV in the distributed models rather than the large CV of the well-mixed model. For this purpose, it begins with describing diffusion phenomena, then formulating Fick’s law and developing the diffusion equation. Subsequently, examining the random velocity of Brownian particles and their pure random walk, we articulate the probabilistic nature of the molecular diffusion process and the reason why Fick’s Law is an ensemble mean law. Next, analytical solutions to the diffusion equation for various types of inputs are introduced. The advection-dispersion equation (ADE) formulation then follows, which couples the effect of fluid motion at fluid continuum scale and random motion of fluid molecules at the molecular scale to quantify solute migration. Likewise, we present analytical solutions to the ADE for several input forms and discuss snapshots and breakthroughs for different input forms.
Higher-order networks aim at improving the classical network representation of trajectories data as memory-less order $1$ Markov models. To do so, locations are associated with different representations or “memory nodes” representing indirect dependencies between visited places as direct relations. One promising area of investigation in this context is variable-order network models as it was suggested by Xu et al. that random walk-based mining tools can be directly applied on such networks. In this paper, we focus on clustering algorithms and show that doing so leads to biases due to the number of nodes representing each location. To address them, we introduce a representation aggregation algorithm that produces smaller yet still accurate network models of the input sequences. We empirically compare the clustering found with multiple network representations of real-world mobility datasets. As our model is limited to a maximum order of $2$, we discuss further generalizations of our method to higher orders.
In this article we introduce a simple tool to derive polynomial upper bounds for the probability of observing unusually large maximal components in some models of random graphs when considered at criticality. Specifically, we apply our method to a model of a random intersection graph, a random graph obtained through p-bond percolation on a general d-regular graph, and a model of an inhomogeneous random graph.