To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
In phylogenetic analysis it is useful to study the distribution of the parsimony length of a tree under the null model, by which the leaves are independently assigned letters according to prescribed probabilities. Except in one special case, this distribution is difficult to describe exactly. Here we analyze this distribution by providing a recursive and readily computable description, establishing large deviation bounds for the parsimony length of a fixed tree on a single site and for the minimum length (maximum parsimony) tree over several sites. We also show that, under very general conditions, the former distribution converges asymptotically to the normal, thereby settling a recent conjecture. Furthermore, we show how the mean and variance of this distribution can be efficiently calculated. The proof of normality requires a number of new and recent results, as the parsimony length is not directly expressible as a sum of independent random variables, and so normality does not follow immediately from a standard central limit theorem.
It is known that a generalized open Jackson queueing network after appropriate scaling (in both time and space) converges almost surely to a fluid network under the uniform topology. Under the same topology, we show that the distance between the scaled queue length process of the queueing network and the fluid level process of the corresponding fluid network converges to zero in probability at an exponential rate.
As a model for an ATM switch we consider the overflow frequency of a queue that is served at a constant rate and in which the arrival process is the superposition of N traffic streams. We consider an asymptotic as N → ∞ in which the service rate Nc and buffer size Nb also increase linearly in N. In this regime, the frequency of buffer overflow is approximately exp(–NI(c, b)), where I(c, b) is given by the solution to an optimization problem posed in terms of time-dependent logarithmic moment generating functions. Experimental results for Gaussian and Markov modulated fluid source models show that this asymptotic provides a better estimate of the frequency of buffer overflow than ones based on large buffer asymptotics.
We study a model of a stochastic transportation system introduced by Crane. By adapting constructions of multidimensional reflected Brownian motion (RBM) that have since been developed for feedforward queueing networks, we generalize Crane's original functional central limit theorem results to a full vector setting, giving an explicit development for the case in which all terminals in the model experience heavy traffic conditions. We investigate product form conditions for the stationary distribution of our resulting RBM limit, and contrast our results for transportation networks with those for traditional queueing network models.
We analyse the queue QL at a multiplexer with L sources which may display long-range dependence. This includes, for example, sources modelled by fractional Brownian motion (FBM). The workload processes W due to each source are assumed to have large deviation properties of the form P[Wt/a(t) > x] ≈ exp[– v(t)K(x)] for appropriate scaling functions a and v, and rate-function K. Under very general conditions limL→xL–1 log P[QL > Lb] = – I(b), provided the offered load is held constant, where the shape function I is expressed in terms of the cumulant generating functions of the input traffic. For power-law scalings v(t) = tv, a(t) = ta (such as occur in FBM) we analyse the asymptotics of the shape function limb→xb–u/a(I(b) – δbv/a) = vu for some exponent u and constant v depending on the sources. This demonstrates the economies of scale available though the multiplexing of a large number of such sources, by comparison with a simple approximation P[QL > Lb] ≈ exp[−δLbv/a] based on the asymptotic decay rate δ alone. We apply this formula to Gaussian processes, in particular FBM, both alone, and also perturbed by an Ornstein–Uhlenbeck process. This demonstrates a richer potential structure than occurs for sources with linear large deviation scalings.
The distributions of nearest neighbour random walks on hypercubes in continuous time t 0 can be expressed in terms of binomial distributions; their limit behaviour for t, N → ∞ is well-known. We study here these random walks in discrete time and derive explicit bounds for the deviation of their distribution from their counterparts in continuous time with respect to the total variation norm. Our results lead to a recent asymptotic result of Diaconis, Graham and Morrison for the deviation from uniformity for N →∞. Our proofs use Krawtchouk polynomials and a version of the Diaconis–Shahshahani upper bound lemma. We also apply our methods to certain birth-and-death random walks associated with Krawtchouk polynomials.
In this paper, transient characteristics related to excursions of the occupation process of M/M/∞ queues are studied, when the excursion level is large and close to the mean offered load. We show that the classical diffusion approximation by an Ornstein–Uhlenbeck (OU) process captures well the average values of the transient variables considered, while the asymptotic distributions of these variables depart from those corresponding to the OU process. They exhibit, however, equivalent tail behaviour at infinity and numerical evidence shows that they are amazingly close to each other over the whole half-line.
We explore a dynamic approach to the problems of call admission and resource allocation for communication networks with connections that are differentiated by their quality of service requirements. In a dynamic approach, the amount of spare resources is estimated on-line based on feedbacks from the network's quality of service monitoring mechanism. The schemes we propose remove the dependence on accurate traffic models and thus simplify the tasks of supplying traffic statistics required of network users. In this paper we present two dynamic algorithms. The objective of these algorithms is to find the minimum bandwidth necessary to satisfy a cell loss probability constraint at an asynchronous transfer mode (ATM) switch. We show that in both schemes the bandwidth chosen by the algorithm approaches the optimal value almost surely. Furthermore, in the second scheme, which determines the point closest to the optimal bandwidth from a finite number of choices, the expected learning time is finite.
We consider the piecewise linear interpolation of Gaussian processes with continuous sample paths and stationary increments. The interrelation between the smoothness of the incremental variance function, d(t – s) = E[(X(t) – X(s))2], and the interpolation errors in mean square and uniform metrics is studied. The method of investigation can also be applied to the analysis of different methods of interpolation. It is based on some limit results for large deviations of a sequence of Gaussian non-stationary processes and related point processes. Non-stationarity in our case means mainly the local stationary condition for the sequence of correlation functions rn(t,s), n = 1, 2, ···, which has to hold uniformly in n. Finally, we discuss some examples and an application to the calculation of the distribution function of the maximum of a continuous Gaussian process with a given precision.
Rates of convergence to normality of O(N-½) are obtained for a standardized sum of m random variables selected at random from a finite set of N random variables in two cases. In the first case, the sum is randomly normed and the variables are not restricted to being independent. The second case is an alternative proof of a result due to von Bahr, which deals with independent variables. Both results derive from a rate obtained by Höglund in the case of sampling from a finite population.
Point-process and other techniques are used to make a comprehensive investigation of the almost-sure behaviour of partial maxima (the rth largest among a sample of n i.i.d. random variables), partial record values and differences and quotients involving them. In particular, we obtain characterizations of such asymptotic properties as a.s. for some finite constant c, or a.s. for some constant c in [0,∞], which tell us, in various ways, how quickly the sequences increase. These characterizations take the form of integral conditions on the tail of F, which furthermore characterize such properties as stability and relative stability of the sequence of maxima. We also develop their relation to the large-sample behaviour of trimmed sums, and discuss some statistical applications.
A new construction of the Gaussian distribution is introduced and proven. The procedure consists of using fractal interpolating functions, with graphs having increasing fractal dimensions, to transform arbitrary continuous probability measures defined over a closed interval. Specifically, let X be any probability measure on the closed interval I with a continuous cumulative distribution. And let fΘ,D:I → R be a deterministic continuous fractal interpolating function, as introduced by Barnsley (1986), with parameters Θ and fractal dimension for its graph D. Then, the derived measure Y = fΘ,D(X) tends to a Gaussian for all parameters Θ such that D → 2, for all X. This result illustrates that plane-filling fractal interpolating functions are ‘intrinsically Gaussian'. It explains that close approximations to the Gaussian may be obtained transforming any continuous probability measure via a single nearly-plane filling fractal interpolator.
Let X1, X2, ·· ·be stationary normal random variables with ρn = cov(X0, Xn). The asymptotic joint distribution of and is derived under the condition ρn log n → γ [0,∞). It is seen that the two statistics are asymptotically independent only if γ = 0.
Motivated by a statistical application, we consider continuum percolation in two or more dimensions, restricted to a large finite box, when above the critical point. We derive surface order large deviation estimates for the volume of the largest cluster and for its intersection with the boundary of the box. We also give some natural extensions to known, analogous results on lattice percolation.
In this paper we consider improvements in the rate of approximation for the distribution of sums of independent Bernoulli random variables via convolutions of Poisson measures with signed measures of specific type. As a special case, the distribution of the number of records in an i.i.d. sequence of length n is investigated. For this particular example, it is shown that the usual rate of Poisson approximation of O(1/log n) can be lowered to O(1/n2). The general case is discussed in terms of operator semigroups.
We derive upper bounds for the total variation distance, d, between the distributions of two random sums of non-negative integer-valued random variables. The main results are then applied to some important random sums, including cluster binomial and cluster multinomial distributions, to obtain bounds on approximating them to suitable Poisson or compound Poisson distributions. These bounds are generally better than the known results on Poisson and compound Poisson approximations. We also obtain a lower bound for d and illustrate it with an example.
In the Bayesian estimation of higher-order Markov transition functions on finite state spaces, a prior distribution may assign positive probability to arbitrarily high orders. If there are n observations available, we show (for natural priors) that, with probability one, as n → ∞ the Bayesian posterior distribution ‘discriminates accurately' for orders up to β log n, if β is smaller than an explicitly determined β0. This means that the ‘large deviations' of the posterior are controlled by the relative entropies of the true transition function with respect to all others, much as the large deviations of the empirical distributions are governed by their relative entropies with respect to the true transition function. An example shows that the result can fail even for orders β log n if β is large.
We investigate the asymptotic behaviour of the distribution of the number of comparisons needed by a quicksort-style selection algorithm that finds the lth smallest in a set of n numbers. Letting n tend to infinity and considering the values l = 1, ···,n simultaneously we obtain a limiting stochastic process. This process admits various interpretations: it arises in connection with a representation of real numbers induced by nested random partitions and also in connection with expected path lengths of a random walk in a random environment on a binary tree.
Grenander et al. (1991) proposed a conditional cyclic Gaussian Markov random field model for the edges of a closed outline in the plane. In this paper the model is recast as an improper cyclic Gaussian Markov random field for the vertices. The limiting behaviour of this model when the vertices become closely spaced is also described and in particular its relationship with the theory of ‘snakes' (Kass et al. 1987) is established. Applications are given in Grenander et al. (1991), Mardia et al. (1991) and Kent et al. (1992).
Stein's method is used to obtain two theorems on multivariate normal approximation. Our main theorem, Theorem 1.2, provides a bound on the distance to normality for any non-negative random vector. Theorem 1.2 requires multivariate size bias coupling, which we discuss in studying the approximation of distributions of sums of dependent random vectors. In the univariate case, we briefly illustrate this approach for certain sums of nonlinear functions of multivariate normal variables. As a second illustration, we show that the multivariate distribution counting the number of vertices with given degrees in certain random graphs is asymptotically multivariate normal and obtain a bound on the rate of convergence. Both examples demonstrate that this approach may be suitable for situations involving non-local dependence. We also present Theorem 1.4 for sums of vectors having a local type of dependence. We apply this theorem to obtain a multivariate normal approximation for the distribution of the random p-vector, which counts the number of edges in a fixed graph both of whose vertices have the same given color when each vertex is colored by one of p colors independently. All normal approximation results presented here do not require an ordering of the summands related to the dependence structure. This is in contrast to hypotheses of classical central limit theorems and examples, which involve for example, martingale, Markov chain or various mixing assumptions.