To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
The importance of simple descriptive data was recognised by William Farr, whom we mentioned briefly in Chapter 1 for his seminal work using the newly established vital statistics register of England in the nineteenth century. As we discussed in Chapter 1, this descriptive epidemiology, concerned as it is with ‘person, place and time’, attempts to answer the questions ‘Who?’, ‘What?’, ‘Where?’ and ‘When?’. This can include anything from a description of disease in a single person (a case report) or a special survey conducted to measure the prevalence of a particular health issue in a specific population, to reports from national surveys and data collection systems showing how rates of disease or other health-related factors vary in different geographical areas or over time (time trends). In this chapter we look in more detail at some of the most common types of descriptive data and where they come from. However, before embarking on a data hunt, we first need to decide exactly what it is we want to know, and this can pose a challenge. To make good use of the most relevant descriptive data, it is critical to formulate our question as precisely as possible.
While it is important to be able to read and interpret individual papers, the results of a single study are never going to provide the complete answer to a question. To move towards this, we need to review the literature more widely. There can be a number of reasons for doing this, some of which require a more comprehensive approach than others. If the aim is simply to increase our personal understanding of a new area, then a few papers might provide adequate background material. Traditional narrative reviews have value for exploring areas of uncertainty or novelty but give less emphasis to complete coverage of the literature and tend to be more qualitative, so it is harder to scrutinise them for flaws. Scoping reviews are more systematic but still exploratory. They are conducted to identify the breadth of evidence available on a particular topic, clarify key concepts and identify the knowledge gaps. In contrast, a major decision regarding policy or practice should be based on a systematic review and perhaps a meta-analysis of all the relevant literature, and it is this approach that we focus on here.
The global pandemic of COVID-19 that began in late 2019 highlighted the importance of rapid and thorough investigations of outbreaks. The response to COVID-19 was at a scale not previously seen, involving all sectors of society, including government and private industry. To control and minimise the impact of COVID-19, huge and costly efforts were required to effectively coordinate many different organisations, many of which were not primarily concerned with public health. This type of re-focusing of resources is common in outbreak and public health emergency settings, but is rarely seen at such scale.In this chapter we look at outbreak investigation in more detail and, in doing so, focus on infectious diseases, although not exclusively, because other agents such as toxins and chemicals can also result in ‘outbreaks’ of non-communicable intoxications, injuries and cancer.
When we speak of prevention in the context of public health, we usually think of what is sometimes called ‘primary prevention’, which aims to prevent disease from occurring in the first place; that is, to reduce the incidence of disease. Vaccination against childhood infectious diseases is a good example of primary prevention, as is the use of sunscreen to prevent the development of skin cancer. However, somewhat confusingly, the term ‘prevention’ is also used to describe other strategies to control disease. One of these is the use of screening to advance diagnosis to a point at which intervention is more effective, often described as ‘secondary prevention’. What is sometimes called ‘tertiary prevention’ is even more remote from the everyday concept of prevention, usually implying efforts to limit disease progression or the provision of better rehabilitation to enhance quality of life among those who have been diagnosed with a disease.
The intersection of physics and machine learning has given rise to the physics-enhanced machine learning (PEML) paradigm, aiming to improve the capabilities and reduce the individual shortcomings of data- or physics-only methods. In this paper, the spectrum of PEML methods, expressed across the defining axes of physics and data, is discussed by engaging in a comprehensive exploration of its characteristics, usage, and motivations. In doing so, we present a survey of recent applications and developments of PEML techniques, revealing the potency of PEML in addressing complex challenges. We further demonstrate the application of select such schemes on the simple working example of a single degree-of-freedom Duffing oscillator, which allows to highlight the individual characteristics and motivations of different “genres” of PEML approaches. To promote collaboration and transparency, and to provide practical examples for the reader, the code generating these working examples is provided alongside this paper. As a foundational contribution, this paper underscores the significance of PEML in pushing the boundaries of scientific and engineering research, underpinned by the synergy of physical insights and machine learning capabilities.
In the preceding chapters we have covered the core principles and methods of epidemiology and have shown you some of the main areas where epidemiological evidence is crucial for policy and planning. You will also have gained a sense of the breadth and depth of the subject from the examples throughout the book. To finish, we take a broader look at the role of epidemiological practice and logic in improving health. There is a growing desire for public health and medical research to be ‘translational’; that is, directly applicable to a population or patient. The process, whereby research evidence is used to change practice or policy, is known as ‘translation’, and the research outputs from epidemiology are critical at all stages (see Box 16.2); indeed, epidemiology has been described as ‘the epicenter of translational science’ (Hiatt, 2010).
The overarching goal of public health is to maximise the health of the population, and to achieve this we need evidence about what works and what does not work. Good studies are difficult to design and implement, and interpretation of their results and conclusions is not always as straightforward as we might hope. How, then, can we make the best use of this information? In the next three chapters we look at ways to identify, appraise, integrate and interpret the literature to generate the evidence we need to inform policy and practice. In this chapter we focus on interpreting the results from a single study, because if they are not valid they will be of limited value. The central question we have to answer when we read a study report is, ‘Are the results of the study valid?’
Confounding refers to a mixing or muddling of effects that can occur when the relationship we are interested in is confused by the effect of something else. It arises when the groups we are comparing are not completely exchangeable and so differ with respect to factors other than their exposure status. If one (or more) of these other factors is a cause of both the exposure and the outcome, then some or all of an observed association between the exposure and outcome may be due to that factor.
We study a signaling game between an employer and a potential employee, where the employee has private information regarding their production capacity. At the initial stage, the employee communicates a salary claim, after which the true production capacity is gradually revealed to the employer as the unknown drift of a Brownian motion representing the revenues generated by the employee. Subsequently, the employer has the possibility to choose a time to fire the employee in case the estimated production capacity falls short of the salary. In this setup, we use filtering and optimal stopping theory to derive an equilibrium in which the employee provides a randomized salary claim and the employer uses a threshold strategy in terms of the conditional probability for the high production capacity. The analysis is robust in the sense that various extensions of the basic model can be solved using the same methodology, including cases with positive firing costs, incomplete information about an individual’s own type, as well as an additional interview phase.
In the previous chapter we alluded to what is sometimes called ‘secondary’ prevention, where instead of trying to prevent disease from occurring, we try to detect it earlier, in the hope that this will enable more effective treatment and thus improved health outcomes. This is an aspect of public health that has great intuitive appeal, especially for serious conditions such as cancer, where the options for primary prevention can be very limited. However, screening programs are usually very costly exercises and they do not always deliver the expected benefits in terms of improved health outcomes. In this chapter we introduce you to the requirements for implementing a successful screening program and to some of the problems that we encounter when trying to determine whether such a program is actually beneficial in practice.
If the results of a study reveal an interesting association between an exposure and a health outcome, there is a natural tendency to assume that it is real. (Note: we are considering whether two things are associated. This does not imply that one causes the other to occur.) However, before we can even contemplate this possibility we have to try to rule out other possible explanations for the results. There are three main ‘alternative explanations’ that we have to consider whenever we analyse epidemiological data or read the reports of others, whatever the study design; namely, could the results be due to chance, bias or error, or confounding? We discuss the first of these, chance, in this chapter and cover bias and confounding in Chapters 7 and 8, respectively.