Hostname: page-component-89b8bd64d-r6c6k Total loading time: 0 Render date: 2026-05-12T10:07:04.144Z Has data issue: false hasContentIssue false

Discovering the Unexpected in Astronomical Survey Data

Published online by Cambridge University Press:  31 January 2017

Ray P. Norris*
Affiliation:
Western Sydney University, Locked Bag 1797, Penrith South, NSW 1797, Australia CSIRO Astronomy & Space Science, PO Box 76, Epping, NSW 1710, Australia
Rights & Permissions [Opens in a new window]

Abstract

Most major discoveries in astronomy are unplanned, and result from surveying the Universe in a new way, rather than by testing a hypothesis or conducting an investigation with planned outcomes. For example, of the ten greatest discoveries made by the Hubble Space Telescope, only one was listed in its key science goals. So a telescope that merely achieves its stated science goals is not achieving its potential scientific productivity.

Several next-generation astronomical survey telescopes are currently being designed and constructed that will significantly expand the volume of observational parameter space, and should in principle discover unexpected new phenomena and new types of object. However, the complexity of the telescopes and the large data volumes mean that these discoveries are unlikely to be found by chance. Therefore, it is necessary to plan explicitly for unexpected discoveries in the design and construction. Two types of discovery are recognised: unexpected objects and unexpected phenomena.

This paper argues that next-generation astronomical surveys require an explicit process for detecting the unexpected, and proposes an implementation of this process. This implementation addresses both types of discovery, and relies heavily on machine-learning techniques, and also on theory-based simulations that encapsulate our current understanding of the Universe.

Information

Type
Research Article
Copyright
Copyright © Astronomical Society of Australia 2017 
Figure 0

Figure 1. A plot of recent major astronomical discoveries, taken from Ekers (2009), of which seven were ‘known–unknowns’ (i.e. discoveries made by testing a prediction) and ten were ‘unknown–unknowns’ (i.e. a serendipitous result found by chance while performing an experiment with different goals). The data in this plot are taken from Wilkinson et al. (2004).

Figure 1

Table 1. Major discoveries made by the Hubble Space Telescope (HST). Of the HST ’s ‘top ten’ discoveries (as ranked by National Geographic magazine), only one was a key project used in the HST funding proposal (Lallo 2012). A further four projects were planned in advance by individual scientists but not listed as key projects in the HST proposal. Half the ‘top ten’ HST discoveries were unplanned, including two of the three most cited discoveries, and including the only HST discovery (Dark Energy) to win a Nobel prize. This Table was previously published by Norris et al. (2015).

Figure 2

Figure 2. Comparison of existing and planned deep 20-cm radio continuum surveys, adapted from a diagram in Norris et al. (2013) originally drawn by Isabella Prandoni. The horizontal axis shows the 5-σ sensitivity, and the vertical axis shows the sky coverage. The right-hand diagonal dashed line shows the approximate envelope of existing surveys, which is largely determined by the availability of telescope time. Surveys not at 20 cm are represented at the equivalent 20 cm flux density, assuming a spectral index of − 0.8. The squares in the top-left represent the new radio surveys discussed in this paper. The Square Kilometre Array (Dewdney et al. 2009) will hopefully conduct even larger surveys in the next decade, extending well to the left of EMU, but such plans are not yet concrete.

Figure 3

Figure 3. The flowchart for discovering unexpected objects in EMU.

Figure 4

Figure 4. The flowchart for discovering unexpected phenomena in the EMU WTF project.

Figure 5

Figure 5. The angular power spectrum for radio sources in the SPT field, taken from Rees et al. (in preparation). Points with error bars are the measured angular power spectrum of the data obtained by O’Brien et al. (2016), and the blue line shows the distribution predicted by the semi-empirical model described in the text. The dotted line shows the cosmological signal predicted by ΛCDM, and the dashed line show the effect of radio source size and double radio sources. The solid black line is the sum of these latter two predictions.