To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
William Parsons (1800–67), third Earl of Rosse, was responsible for building in 1845 the largest telescope of his time, nicknamed the 'Leviathan'. It enabled the Earl to make unprecedented astronomical discoveries, including the discovery of the spiral nature of galaxies. Rosse (then Lord Oxmantown) began publishing scientific papers on telescopes in 1828, and for the rest of his life made regular contributions to scientific journals in Ireland, England and Scotland. He served as President of the British Association for the Advancement of Science in 1843, and of the Royal Society from 1848 to 1854, and his addresses to those societies are also included in this collection. Edited by his younger son, the engineer Sir Charles Parsons (1854–1931) and published in 1926, these papers show the wide range of the Earl's interests, from astronomy and telescopes to ancient bronze artefacts and the use of iron in shipbuilding.
Furnished with more than a hundred figures, maps and tables, this book was first published in 1878 by Simon Newcomb (1835–1909), a noted mathematician and professor at the United States Naval Observatory. A meticulous work, originally intended to be of use to the general reader as well as the student, it provides a view of astronomy as it stood on the eve of General Relativity, and inevitably includes some theories which have since been disproved. Newcomb outlines a brief history of astronomy, from ancient Greece (when the planets were thought to be fixed in crystal spheres), to the application of the new laws of thermodynamics and the latest observations of the solar system. Included are a rejection of the then prevalent theory that the sun has a cool interior and its own inhabitants, details of the anomaly of Mercury's orbit according to Newtonian theory, and thorough observational guides.
John Flamsteed (1646–1719) was the first Astronomer Royal, appointed to the newly founded Greenwich Observatory. Charged with improving navigation at sea, he used meticulous telescopic observations to compile a 'Catalogue of British Stars', radically updating Tycho Brahe's previous naked-eye calculations. However he delayed publishing, leading to a vituperative quarrel with contemporaries Newton and Halley, who published his results without his permission. Flamsteed managed to destroy most of that edition - his own was published posthumously - but his reputation was damaged. A century later, Francis Baily (1774–1844), a stockbroker who became President of the Royal Astronomical Society, rediscovered Flamsteed's papers, including autobiographical writings and extensive correspondence. Their publication in this volume, along with a revised version of the catalogue, rehabilitated Flamsteed's reputation and restated the importance of methodical observation in astronomy. Today, this book illuminates both the social context of Flamsteed's work and the intellectual climate of Baily's London.
A clear understanding of the Earth's past evolution can provide the key to its possible future development. The Earth: Its Birth and Growth explores the evolution of the Earth over 4.6 billion years using basic reasoning and simple illustrations to help explain the underlying physical and chemical principles and major processes involved. Fully updated and revised, this rigorous but accessible second edition includes three completely new chapters. It incorporates exciting developments in isotope geology, placing results within a wider framework of Earth evolution and plate tectonics. Some background in physics and chemistry is assumed, but basic theories and processes are explained concisely in self-contained sections. Key research papers and review articles are fully referenced. This book is ideal as supplementary reading for undergraduate and graduate students in isotope geochemistry, geodynamics, plate tectonics and planetary science. It also provides an enjoyable overview of Earth's evolution for professional scientists and general readers.
In Chapter 2, I recount a brief history of the discovery of the major maser species. This chapter is designed to be understood by a reader with little mathematical knowledge. For those more interested in the technical details relating to radio telescopes, I refer the reader forward to Chapter 4, and particularly to Section 4.1.
During the late 1950s, technological developments allowed radio telescopes to observe at frequencies above 1 GHz (the L-band in radar terminology). A key L-band transition is the ‘spin-flip’ line of atomic hydrogen at 1421 MHz, which allowed astronomers to map the neutral hydrogen content of the Galaxy. Clouds excited by the formation of massive stars are photoionized, rather than neutral, and ionized hydrogen emits a continuous spectrum of radiation through the thermal bremsstrahlung process (see Section 2.9.1). As this process is thermal, it has a spectrum which rises with frequency at radio wavelengths if optically thick (see Eq. (4.7)), making it stronger at L-band relative to non-thermal emission, when compared with earlier observations at lower frequencies. Non-thermal radiation processes typically have a spectrum which decays as a power-law with frequency in the radio region. A catalogue of sources containing ionized hydrogen, comprising mostly massive, star-forming, Galactic gas clouds, was constructed by Westerhout (1958) from continuum observations at 1390 MHz, and its sources were classified by a W-number (for example W49), a nomenclature which has continued to be used up to the present day, and in particular for the case study of W3(OH) in Section 2.9.
In the past it may have been true that Stephen Senn's (2003) analogy was right. Paraphrasing his words for the sake of moderate language: scientists regarded statistics as the one-night stand: the quick fix, avoiding long-term entanglement. This analogy is no longer apt. Statistical procedures now drive many if not most areas of current astrophysics and cosmology. In particular the currently understood nature of our Universe is a product of statistical analysis of large and combined data sets. Here we briefly describe the scene in three areas dominating definition of the current model of the Universe and its history. The three areas inextricably tie together the shape and content of the Universe and the formation of structure and galaxies, leading to life as we know it. While these sketches are not reviews, we show by cross-referencing how frequently our preceding discussions play in to current research in cosmology.
The galaxy universe
The story of galaxy formation since 1990 is based on two premises. Firstly, it was widely accepted that the matter content in the Universe is primarily cold and dark – CDM prevails. The recognition of dark matter was slow, despite Zwicky (1937) demonstrating its existence via the cosmic virial theorem. The measurements of rotation curves of spiral galaxies (e.g. Rubin et al., 1980) convinced us.
Peter Scheuer started this. In 1977 hewalked into JVW's office in the Cavendish Lab and quietly asked for advice on what further material should be taught to the new intake of Radio Astronomy graduate students (that year including the hapless CRJ). JVW, wrestling with simple Chi-square testing at the time, blurted out ‘They know nothing about practical statistics …’. Peter left thoughtfully. A day later he returned. ‘Good news! The Management Board has decided that the students are going to have a course on practical statistics.’ Can I sit in, JVW asked innocently. ‘Better news! The Management Board has decided that you're going to teach it …’.
So, for us, began the notion of practical statistics. A subject that began with gambling is not an arcane academic pursuit, but it is certainly subtle as well. It is fitting that Peter Scheuer was involved at the beginning of this (lengthy) project; his style of science exemplified both subtlety and pragmatism. We hope that we can convey something of both. If an echo of Peter's booming laugh is sometimes heard in these pages, it is because we both learned from him that a useful answer is often much easier – and certainly much more entertaining – than you at first think.
After the initial course, the material for this book grew out of various further courses, journal articles and the abundant personal experience that results from understanding just a little of any field of knowledge that counts Gauss and Laplace amongst its originators.
Teaching is highly educational for teachers. Teaching from the first edition revealed to us how much students enjoyed Monte Carlo methods, and the ability with such methods to test and to check every derivation, test, procedure or result in the book. Thus, a change in the second edition is to introduce Monte Carlo as early as possible (Chapter 2). Teaching also revealed to us areas in which we assumed too much (and too little). We have therefore aimed for some smoothing of learning gradients where slope changes have appeared to be too sudden. Chapters 6 and 7 substantially amplify our previous treatments of Bayesian hypothesis testing/modelling, and include much more on model choice and Markov chain Monte Carlo (MCMC) analysis. Our previous chapter on 2D (sky distribution) analysis has been significantly revised. We have added a final chapter sketching the application of statistics to some current areas of astrophysics and cosmology, including galaxy formation and large-scale structure, weak gravitational lensing, and the cosmological microwave background (CMB) radiation.
We received very helpful comments from anonymous referees whom CUP consulted about our proposals for the second edition. These reviewers requested that we keep the book (a) practical and (b) concise and – small, or ‘backpackable’, as one of them put it. We have additional colleagues to thank either for further discussions, finding errata or because we just plain missed them from our first edition list: Matthew Colless, Jim Condon, Mike Disney, Alan Heavens, Martin Hendry, Jim Moran, Douglas Scott, Robert Smith and Malte Tewes.
Frustra fit per plura quod potest fieri per pauciora – it is futile to do with more things that which can be done with fewer.
(William of Ockham, c.1285–1349)
Nature laughs at the difficulties of integration.
(Pierre-Simon de Laplace, 1749–1827; Gordon & Sorkin, 1959)
One of the attractive features of the Bayesian method is that it offers a principled way of making choices between models. In classical statistics, we may fit to a model, say by least squares, and then use the resulting χ2 statistic to decide if we should reject the model. We would do this if the deviations from the model are unlikely to have occurred by chance. However, it is not clear what to do if the deviations are likely to have occurred, and it is even less clear what to do if several models are available. For example, if a model is in fact correct, the significance level derived from a χ2 test (or, indeed, any significance test) will be uniformly distributed between zero and one (Exercise 7.1).
The problem with model choice by χ2 (or any similar classical method) is that these methods do not answer the question we wish to ask. For a model H and data D, a significance level derived from a minimum χ2 tells us about the conditional probability, prob(D | H).
The stock market is an excellent economic forecaster. It has predicted six of the last three recessions.
(Paul Samuelson)
The only function of economic forecasting is to make astrology look respectable.
(John Kenneth Galbraith)
In contrast to previous chapters, we now consider data transformation, how to transform data in order to produce improved outcomes in either extracting or enhancing signal.
There are many observations consisting of sequential data, such as intensity as a function of position as a radio telescope is scanned across the sky or as signal varies across a row on a CCD detector, single-slit spectra, time-measurements of intensity (or any other property). What sort of issues might concern us?
(i) trend-finding; can we predict the future behaviour of data?
(ii) baseline detection and/or assessment, so that signal on this baseline can be analysed;
(iii) signal detection, identification, for example, of a spectral line or source in sequential data for which the noise may be comparable in magnitude to the signal;
(iv) filtering to improve signal-to-noise ratio;
(v) quantifying the noise;
(vi) period-finding; searching the data for periodicities;
(vii) correlation of time series to find correlated signal between antenna pairs or to find spectral lines;
(viii) modelling; many astronomical systems give us our data convolved with some more or less known instrumental function, and we need to take this into account to get back to the true data.
Statistics, the most important science in the whole world: for upon it depends the practical application of every other science and of every art.
(Florence Nightingale)
If your experiment needs statistics, you ought to have done a better experiment.
(Ernest Rutherford)
Science is about decision. Building instruments, collecting data, reducing data, compiling catalogues, classifying, doing theory – all of these are tools, techniques or aspects which are necessary. But we are not doing science unless we are deciding something; only decision counts. Is this hypothesis or theory correct? If not, why not? Are these data self-consistent or consistent with other data? Adequate to answer the question posed? What further experiments do they suggest?
We decide by comparing. We compare by describing properties of an object or sample, because lists of numbers or images do not present us with immediate results enabling us to decide anything. Is the faint smudge on an image a star or a galaxy? We characterize its shape, crudely perhaps, by a property, say the full-width half-maximum, the FWHM, which we compare with the FWHM of the point-spread function. We have represented a data set, the image of the object, by a statistic, and in so doing we reach a decision.
Statistics are there for decision and because we know a background against which to take a decision.
In embarking on statistics we are entering a vast area, enormously developed for the Gaussian distribution in particular. This is classical territory; historically, statistics were developed because the approach now called Bayesian had fallen out of favour. Hence, direct probabilistic inferences were superseded by the indirect and conceptually different route, going through statistics and intimately linked to hypothesis testing. The use of statistics is not particularly easy. The alternatives to Bayes' methods are subtle and not very obvious; they are also associated with some fairly formidable mathematical machinery. We will avoid this, presenting only results and showing the use of statistics, while trying to make clear the conceptual foundations.
Statistics
Statistics are designed to summarize, reduce or describe data. The formal definition of a statistic is that it is some function of the data alone. For a set of data X1, X2, …, some examples of statistics might be the average, the maximum value or the average of the cosines. Statistics are therefore combinations of finite amounts of data. In the following discussion, and indeed throughout, we try to distinguish particular fixed values of the data, and functions of the data alone, by upper case (except for Greek letters). Possible values, being variables, we will denote in the usual algebraic spirit by lower case.
(interchange between Peter Scheuer and his then student, CRJ)
(The) premise that statistical significance is the only reliable indication of causation is flawed.
(US Supreme Court, Matrixx Initiatives, Inc. vs. Siracusano, 22 March 2011)
It is often the case that we need to do sample comparison: we have someone else's data to compare with ours; or someone else's model to compare with our data; or even our data to compare with our model. We need to make the comparison and to decide something. We are doing hypothesis testing – are our data consistent with a model, with somebody else's data? In searching for correlations as we were in Chapter 4, we were hypothesis testing; in the model-fitting of Chapter 6 we are involved in data modelling and parameter estimation.
A frequentist point of view might be to consider the entire science of statistical inference as hypothesis testing followed by parameter estimation. However, if experiments were properly designed, the Bayesian approach would be right: it answers the sample-comparison questions we wished to pose in the first place, namely what is the probability, given the data, that a particular model is right? Or: what is the probability, given two sets of data, that they agree? The two-stage process should be unecessary at best.