The effect of timers and precommitments on handwashing: a randomised controlled trial in a kitchen laboratory

Many foodborne illness outbreaks originate in food service establishments. We tested two behavioural interventions designed to improve the duration and quality of handwashing. We ran a three-armed parallel trial in a laboratory kitchen, from 7 March to 27 May 2022. Participants were n = 195 workers who handle food. We randomly allocated participants to three groups: Timer – tap-mounted timer that counted seconds while participants washed their hands; Precommitment – agreed to five statements on good hand hygiene before attending the kitchen; and Control. Participants completed a food preparation task under time pressure. Cameras focused on the sink captured handwashing. Outcome measures were number of times participants washed their hands; number of times they washed their hands using soap; number of times they washed using soap and washed the backs of their hands; and mean duration of handwashing attempts using soap. Participants in Timer washed their hands for 1.9 s longer on average than Control ( β = 2.20, 95% CI = 0.34-4.06, p = 0.021). Participants in Precommitment washed their hands for 2.5 s longer on average than Control ( β = 2.30, 95% CI = 0.33-4.27, p = 0.022). We found no statistically significant differences on any other outcome measure.


Background and aims
In 2009 in the UK, there were over 500,000 estimated cases of foodborne disease due to known pathogens (Food Standards Agency, 2014).Foodborne illnesses are caused by eating food contaminated with bacteria, viruses, other parasites, or chemical contaminants like heavy metals.Recent estimates put the number of yearly cases of foodborne illness at 2.4 million in the UK, imposing an estimated total burden of £9 billion (Daniel et al., 2018).Many foodborne illness outbreaks originate in food service establishments, with food workers' poor personal hygiene as a significant contributor to outbreaks (Olsen et al., 2000).For example, eating out accounts for an estimated 37% of all foodborne norovirus cases, and takeaways account for 26% (Food Standards Agency, 2022).Workers' hand hygiene is estimated to account for 89% of variance in outbreaks caused by food contaminated with pathogens in food service establishments (Guzewich and Ross, 1999).A more recent analysis of outbreaks associated with restaurants from 1998 to 2013 found that the most commonly reported contributing factors were those related to food handling and preparation practices in the restaurant (Angelo et al., 2017).
The present study aimed to test the effects of behavioural interventions aimed at improving handwashing behaviour in employees of Food Business Operators (FBOs).In the first stage of the study, we carried out a rapid review of the existing literature on hand hygiene interventions in FBO workers to identify the most promising interventions to test.We then carried out a randomised controlled trial in a kitchen laboratory to provide evidence of their effectiveness.

Review of literature on hand hygiene interventions in food handlers
Poor hand hygiene can be caused by washing for too short a time, not washing frequently enough, or by poor techniquein particular, failing to apply soap to all parts of the hand.Most hand hygiene studies investigate frequency (see, e.g., studies found in (Huis et al., 2012)).However, arguably, frequency of handwashing is insufficient because if hands are not washed for long enough or parts of the hand are missed, then pathogens are still likely to remain (Öncü and Vayısoğlu, 2021).Preliminary research on hand hygiene of UK food handlers, conducted before the onset of the COVID-19 pandemic, identified knowledge gaps around effective handwashing practice in food handlers, including around the length of time required for washing and drying hands (Ipsos Mori, 2017).This pre-pandemic research also found behavioural barriers to handwashing: participants said they were likely to forget to wash when busy or under pressure, that there was little expectation of handwashing from seniors, and necessary materials were often not available.
Post-pandemic, it seems less likely that hand hygiene failures are due to knowledge gaps.One of the UK Government's messages during the COVID-19 pandemic was 'Hands.Face.Space' and well-publicised official guidance was to wash for 20 s (NHS, 2022).In the US, the Centre for Disease Control promoted 'Five Steps to Wash Your Hands the Right Way', one of which was scrubbing hands for 20 s (CDC, 2022).However, it is well-known that knowledge does not on its own lead to action.For instance, in the case of handwashing, a meta-analysis of food safety training in commercial settings found that training improved knowledge in most studies, but it did not lead to any change in handwashing behaviour in five of the six behaviours examined (McFarland et al., 2019).Indeed, reviews of hand hygiene behaviours in various domains have found that the most common aim of interventions is to increase knowledge, with changes to people's motivations and physical environments either underexplored or potentially more effective (Huis et al., 2012;Staniford and Schmidtke, 2020).Behavioural interventions may be able to bridge this 'knowledge-action gap'.
Recent research has shown the potential benefits of interventions on the physical environment to improve duration, with two studies finding that installing timers at sinks were effective at increasing the average duration of handwashing attempts by food handlers.Her et al. (2019) installed tap-mounted timers that counted down 30 s when the water was turned on, alongside informational posters about their use and proper handwashing technique.The authors reported a statistically significant increase in mean handwashing duration from a pre-treatment baseline (11.6-15.7 s).Similarly, Yu et al. (2018) found statistically significant increases in the frequency (71.2-102.7 times/day), mean duration (12.8-19.2s) and the quality of handwashing attemptsrelative to a pre-treatment baselinewhen they introduced a 'motivational' soap dispenser that played 18 s of music whenever food handlers applied soap.
In terms of intervening on motivations, few studies have attempted to improve hand hygiene by intervening on intentions (Huis et al., 2012); however, some researchers have recommended that people use 'implementation intentions' to increase handwashing (Michie et al., 2020).In other domains, the act of making a precommitment or forming a plan to translate intention into action has been shown to support the regulation of existing behaviour and the uptake of a new behaviour.Forming specific implementation intentions of the form 'Whenever situation x arises, I will initiate the goal-directed response y!' (Gollwitzer, 1999;Brandstätter et al., 2001) has been shown to be an effective strategy when tackling self-regulatory problems (e.g., remembering to act, seizing opportunities and overcoming initial reluctance) (Gollwitzer and Sheeran, 2006).Additionally, previous research has demonstrated the effectiveness of committing to action in advance on the uptake of a wide range of behaviours, including weight loss (Coupe et al., 2019), smoking cessation (Black et al., 2020) and safe water consumption (Inauen and Mosler, 2016).In principle, if someone has the goal of washing their hands appropriately, then making precommitments or forming implementation intentions could help them achieve their goal.However, we are not aware of any studies exploring the effects of these strategies on handwashing.
Two issues that have beset previous research in this area are experimental design and outcome measures, with most studies using observational pre-post designs or self-reported measures (McFarland et al., 2019;Insfran-Rivarola et al., 2020).It is difficult to run a gold-standard randomised controlled trial in a field setting because there are usually only a small number of kitchens (or bathrooms or hospital wards) available to study, so there are not enough units for a well-powered trial.This is compounded by problems of outcome measurement, with the most reliable outcome measures also being the most labour-intensive (Haas and Larson, 2007).
Outcome measures pose particularly intractable problems, requiring trade-offs to be made.Self-reported measures may be a cheap way of gathering a lot of data (Ainsworth et al., 2021), but they tend to be limited to measures of handwashing frequency.In addition, there is a general problem of using self-reports to measure a socially desirable behaviour and some more specific problems around people's answers being sensitive to the way that the survey question is written (Hansen et al., 2021).One way that researchers have tried to avoid self-reports is by measuring soap use as a proxy for handwashing (e.g.(Judah et al., 2009;Porzig-Drummond et al., 2009)).This tends to be done in studies in bathrooms; researchers measure soap use over a period of time whilst also counting the number of people entering into the bathroom, in order to get an average measure of soap use per head.Of course, this method is unable to distinguish a few people using a lot of extra soap from an effect that is more evenly spread across the population.It also does not provide information about duration or quality of handwashing (Yu et al., 2018).
Another way to avoid self-reports is to use direct observation.This allows for the most accurate determination of duration and quality.However, as well as being time-and resource-intensive, there is the possibility that being aware of being observed can itself influence behaviour: the 'Hawthorne Effect' (Haas and Larson, 2007;Gould et al., 2017aGould et al., , 2017b)).Therefore, direct observation seems to be rather rare in the literature, but even when there is direct observation, it may be operationalised as a frequency checklist (e.g.(Husain et al., 2016;Štefančič and Jevšnik, 2020)).Adjustments that may decrease the likelihood of the Hawthorne Effect include the use of confederates, who pretend not to be watching (Hansen et al., 2021).However, this raises ethical issues around deception and may result in the confederate not being close enough to make detailed observations and therefore only providing a measure of handwashing frequency.Cameras provide a less labour-intensive and slightly more indirect measure of observation, which balance the ability to get accurate measures with a slightly lower feeling of being watched (Chapman et al., 2010;Birnbach et al., 2013;do Prado et al., 2015;Yu et al., 2018;Her et al., 2019;Barrett and Feng, 2020).
These observations are congruent with the outcome of a recent Cochrane review of interventions to increase hand hygiene compliance in hospitals, which found that it is difficult to draw many conclusions from the literature because of twin problems of a lack of methodological rigour and lack of concealment of the intervention (Gould et al., 2017a(Gould et al., , 2017b;;Gould et al., 2018).(Though we also note that hospitals are not cognate settings to food businesses, so apart from having similar methodological problems, that literature is not as pertinent here.We return to this in the discussion.)

Approach and hypotheses
We designed a randomised controlled trial in a kitchen laboratory to test the effect of two behavioural interventions aimed at improving handwashing behaviour in employees of FBOs: timers and precommitments.The use of a laboratory setting allowed for individual randomisation, with the outcome measures of frequency, duration and technique being recorded on video cameras.Participants were given a burger-making task under time pressure, to mimic the experience of being busy or feeling under pressure, which are cited as a barrier to handwashing in restaurant kitchens.There were decoy cameras on the work surfaces, as well as ones taking recordings by the sink, to prevent participants guessing what the purpose of the laboratory trial was and to minimise the experimenter demand effect.
Hypothesis 1: Installing a timer at the sink and a poster prompting proper handwashing technique will increase the duration and quality of handwashing attempts relative to control.Both the timer and poster are concerned with how food handlers wash their hands, not how often they do so, so we hypothesised no effect on handwashing frequency.Frequency was nonetheless of interest, as a reduction in handwashing frequency was a plausible 'backfire' effect if food handlers find handwashing for the full 20 s unduly burdensome.
Hypothesis 2: Asking participants to make a series of pre-trial commitments to comply with specific elements of the FSA's guidance on duration, frequency and quality of handwashing will cause an increase in measures of all three, relative to control.

Trial design
The trial used a three-arm between-subjects design.We carried out testing in a laboratory kitchen: a cookery school with three separate workstations, screened off from each other.Up to three participants were tested at a time, and each was assigned their own workstation and sink for use (see Figure 1).(The cookery school was in Wandsworth, London, UK. https://theavenuecookeryschool.com/contact/) We asked all participants to complete a food preparation task following a set recipe.The recipe required that participants handle raw meat and included multiple steps at which handwashing would be expected (eight handwashing occasions expected in total, if the task was completed).The task instructions emphasised the need to work as quickly as possible because food handlers say they are more likely to forget to wash their hands during busy periods (Ipsos Mori, 2017).Before attending the laboratory session, all participants were required to complete a short online training session and post-training quiz, in their own time.
At recruitment, we randomly assigned participants with equal probability to one of the three treatment groups: In the Timer group, two interventions were placed at the sink: (i) a tap-mounted timer that counts seconds while participants are washing their hands and (ii) a poster encouraging participants to use it to time 20 s rubbing their hands with soap, and showing the six steps of effective handwashing technique from the FSA's Safer food, better business guidance (Food Standards Agency, 2019).(Figure . 2 shows the poster, which includes an image of the timer device.) In the Precommitment group, the pre-session training included a prompt to commit to a series of statements about handwashingadapted from FSA guidelinesby electronically signing their name (typing it into a textbox).The statements were: • If I touch raw meat, then I will wash my hands afterwards • If I am going to touch or handle any food, then I will wash my hands first • When I wash my hands, I will wet my hands under warm running water and use soap • When I wash my hands, I will wash the back of each hand by rubbing against my palms • When I wash my hands, I will wash my fingertips by rubbing against my other hand's palm Posters reminding Precommitment group participants of these handwash statements were displayed to participants in view of the sink and the workstation during the food preparation task (see Figure .3).Participants in the Control group received no handwashing intervention other than the baseline training completed by all participants.Otherwise, all environmental influences were held constant.
Each laboratory session consisted of 3 slots and each slot was quasi-randomly assigned to a different treatment group (for each time and date, the three slots were sorted into a random order using Python, with the function random.shuffle()).The recruitment agency then recruited into a session's slots in order.For example, if a session's treatments are shuffled into [2, 3, 1], then the first participant recruited for that session was assigned to Group 2, the second recruited to the control, and the third to treatment 1.To minimise the risk of bias based on time of session, participants attending the same session were assigned to different treatment groups.Likewise, to minimise the risk of differences between the workstations confounding treatment effects, the treatment groups assigned to each workstation were rotated between sessions.Opaque barriers were installed to physically separate participants being tested in the same session and minimise potential spillover effects (i.e., participants washing their hands differently because they see other participants doing so or can see interventions from other treatment groups).
We did not inform participants about the focus of this study, or of the FSA's involvement, until the end of the experiment.When initially consenting to take  part, participants were given a very general description of the study's purpose (understanding how food handlers complete a guided food preparation task, under time constraints).A post-participation survey included a question asking participants what they thought the purpose of the study was, to check whether they had discerned the study objective.At the end of their session, all participants were fully debriefed and explicitly asked if they would like to withdraw from the study.

Procedure
The target population for this trial was UK food handlers employed at FBOs.To access our target population, a recruitment agency was used to recruit and allocate participants to testing slots, which had been pre-assigned to treatment groups.Participants were paid £60-70 for taking part.(Early morning slots offered a higher incentive as sign-up for these sessions was lower.) Before attending the lab session, participants were emailed a link to an online pretrial training session and post-training quiz.The training materials covered key aspects of the FSA's guidance on handwashing behaviour (see Supplementary Appendix A1 for details).The post-training quiz gave feedback on any incorrect answers and participants had to input the correct answer to proceed to the next question and complete the training.We recorded their first answers for use in the analysis.All participants were required to complete their training before their lab session date, and any who did not were excluded from the study.The recruitment agency sent regular reminders to participants to complete their training.
Upon arrival at the lab, participants were directed to their assigned workstation, where they were asked to read through a study information sheet and sign a consent form.
The food preparation task was described to participants as 'the burger task'.Participants were told that they would play the part of a worker in a hamburger restaurant preparing orders from customers.The task required participants to input a fictitious order on a tablet device we provided and then prepare the burger patty and assemble the garnish.Participants were instructed to complete the burger task twice within a fixed 25 min time limit, and that they should try and prepare the burger faster the second time.A written copy of the recipe was provided to all participants as a guide (see Supplementary Appendix A2).The task was designed with a total of 8 distinct handwash points, according to FSA guidance (also shown in Supplementary Appendix A2).Cameras focused on the sinks captured footage of participants' handwashing behaviour.An additional camera was also present in each workstation as a decoy, focused on the food preparation area, to mitigate experimenter demand effects.
After completing the food preparation task and cleaning their workstation, participants completed a post-participation survey on their tablet device.This survey included questions about the participants' understanding of the experiment and some additional background information (see Supplementary Appendix A3).After the postparticipation survey, participants were fully debriefed as to the purpose of the study.

Outcome measures
The primary outcome measures of the study are the frequency, duration and quality of handwash attempts made by each participant during the food preparation task.
• Duration: The mean duration of handwashing attempts where soap was used, from first application of soap to starting to rinse.(This definition necessarily means duration measures only covered handwashing instances in which the participants used soap.This outcome was therefore missing for any participants who never used soap.)• Frequency: A count of handwashing attempts made by each participant.
• Quality: ○ A count of handwashing attempts in which soap was used.○ A count of handwashing attempts in which the participant washed the backs of both of their hands (back-hands count).
Cameras placed above the sinks in each workstation recorded participants' handwashing behaviour in the session.The outcome measures were then derived from the coded video recordings by one primary coder and a dummy coder (see Supplementary Appendix A5).
Coders were not informed of the study's hypotheses or treatments and were blind to treatment as far as was possible.The sole exception to this was the tap-mounted timer, which was necessarily visible in footage of the sink, although only from the rear so it was not clear what the device was.All coders completed training against pilot footage (which was not used in the analysis) before coding any footage from the main study.As a quality check, we randomly selected 10% of participants to be double coded and analysed inter-coder reliability on this subset.
In some instances, it was unclear to the coders whether a sequence of footage contained one or multiple distinct handwashing attempts.This occurred when participants applied soap and rinsed multiple times.We amended the code frame to include a flag identifying these instances as 'ambiguous' and ran a sensitivity analysis excluding them for each primary outcome model.
The post-participation survey probed how participants engaged with the interventions.We asked participants how long they tried to spend washing their hands each time they did so, and how they kept track of time.We also tested whether participants in the Precommitment group could recall the statements they had committed to.The real precommitment statements were presented in lists alongside dummy statements and participants were asked to say which they had committed to.We presented the statements across two questionsone for statements prompted at workstations and one for statements prompted at the sink (see Fig. 3).
The post-participation survey also collected background information to inform our analysis, including what participants thought the purpose of the experiment was, information about their job and workplace, and demographic information.For a full list of questions asked, see Supplementary Appendix A3.
Power/sample size Pre-trial power analysis identified a target sample size of 207 participants (n = 69 in each group).This would allow us sufficient power (0.8) to detect a medium-sized effect (α = 0.05), accounting for the need to control the family-wise Type 1 error rate using Bonferroni adjustments when comparing all arms against one another.

Statistical methods
We preregistered our analysis plan on the Open Science Framework before data collection began (https://osf.io/zpvnx).
We analysed each of the primary outcome measures listed above separately.For each, we carried out a generalised linear model (GLM) with the primary outcome as the dependent variable.For the three 'count' outcomes, we used a Poisson log-link GLM.For average duration, no transformation was necessary, so we used an identity link function.
For each outcome measure, the model includes Experimental arm as the primary independent variable of interest.In the models, experimental arm is dummy coded into two binary variables to indicate the presence/absence of the two intervention sets.The primary comparisons of interest were the effect of each of the two interventions against the control arm, as set out in the hypotheses.We thus used a Bonferroni-adjusted α of 0.025 to account for the family-wise Type-1 error rate for these comparisons.The results section reports unadjusted p-values.
The two quality measures (count of handwashing occasions in which soap was used and count of handwashing occasions in which participants washed the backs of both of their hands) also include the count of handwashing attempts made as an essential covariate, to account for variation in handwashing frequency.
In addition, the initial model for each outcome measure features the following non-essential independent variables.(In this list, superscript a indicates covariates that were not listed in the initial analysis plan as counterbalancing between the treatment groups should have accounted for any systematic effects.We decided to include these additional covariates early in fieldwork in response to a higher drop-out rate than anticipated, which could have undermined our counterbalancing.As a robustness check, any best-fit models including these covariates were also checked without them.): • Which of the three workstations the participant was assigned to; • FBO employer type (collected in the post-trial questionnaire); • Job type (collected in the post-trial questionnaire); • Age (collected in the post-trial questionnaire); • Gender (collected in the post-trial questionnaire); • The participant's score on knowledge measures taken from the pre-trial survey; • Whether the participants correctly guessed the purpose of the trial was to observe handwashing behaviour; • Whether the participant finished the task; • Which day of the week testing took place on;ª • Which time slot testing took place in.ª To arrive at a parsimonious model and avoid over-fitting, we iteratively dropped nonessential independent variables from the models as long as doing so improved goodness of fit according to Akaike's Information Criterion.

Participants and balancing checks
The experiment ran between 7 March and 27 May 2022.In total, 214 participants took part over 21 sessions.We excluded 11 participants who had not completed their training and three who could be personally identified from their video footage (a requirement of the study's ethical approval).Technical issues led to lost footage for a further five participants.This left us a total of n = 195 (Timer: n = 69; Precommitment: n = 59; Control: n = 67) participants, which was below our target sample size.
Participants were aged between 17 and 64 years old (M = 35.2,SD = 11.9;excluding two participants who refused to answer this question).Eighty-three identified as male (43%) and 107 as female (55%).Four selected 'I identify in a different way' and one refused to answer the question.Baseline demographic characteristics for each treatment group can be found in Supplementary Appendix B1.
Eight participants (4.1%) ran out of time and were unable to finish the food preparation task fully.At the end of each session, but before debriefing, we asked participants what they thought the purpose of the experiment was.In total, 41 (21%) correctly guessed that the purpose of the experiment was to examine handwashing behaviour.The proportion of participants who correctly guessed the experiment purpose did not differ across treatment arms (X 2 (2,194) = 3.32, p = 0.190).

Inter-coder reliability
To determine agreement between the two coders on the subsample selected for double coding, we ran Kendall's W for each of the four primary outcomes.The results show statistically significant agreement between the two coders on every outcome measure (handwashing attempt count: W = 0.976, p = 0.007; mean duration: W = 0.926, p = 0.012; count of attempts in which soap was used: W = 0.998, p = 0.005; count of attempts in which soap was used and the backs of both hands were washed: W = 0.960, p = 0.008).

Survey measures on engagement with the interventions
The post-trial survey probed how well participants engaged with the interventions (see Tables B2.4 to B2.9 in Supplementary Appendix B2).
Thirty-two participants in the Timer group (46%) said they used the countdown timer attached to the sink to keep track of how long they spent washing their hands.Of those, 28 said the timer was the main thing they used to keep track.The rest of the group either said they did not use the timer (n = 31) or could not say how long they tried to wash their hands for (n = 6).The most common reason for not using the timer was 'I didn't notice it' (n = 15).
Eight participants in the Precommitment group (14%) correctly recalled all the statements they had committed to as part of the precommitment intervention.Forty-four (75%) correctly recalled both statements pertaining to handwashing frequency ('If I touch raw meat, then I will wash my hands afterwards' and 'If I am going to touch or handle any food, then I will wash my hands first') but only nine (15%) correctly recalled all three statements pertaining to handwashing quality ('Using soap and wetting my hands under warm running water', 'Washing the backs of my hands by rubbing against the other hand's palm' and 'Washing my fingertips by rubbing against my other hand's palm').

Descriptive statistics
Participants in the Control group washed their hands for 8.65 s on average (SD = 4.76) compared with 10.51 s (SD = 5.94) in the Timer group and 11.10 s (SD = 6.41) in the Precommitment group (Table 1).The mean duration was therefore lower than the 20 s advised by the FSA in all groups.

Confirmatory inferential analysis
Our best-fit linear regression model found statistically significant increases in duration for both interventions when compared against the Control group (after making a Bonferroni correction to account for the number of pairwise comparisons).The model's coefficients show the size of the increase in duration relative to the control, after accounting for other covariates (Table 2).The timer intervention was associated with a 2.20 s increase in handwashing duration (p = 0.021) and the precommitment intervention was associated with a 2.30 s increase in handwashing duration (p = 0.022).

Sensitivity analysis
Sensitivity analysis (see Supplementary Appendix B3) shows the effect of the timer intervention on duration is not robust to the exclusion of 'ambiguous' handwash attempts.(The coders were instructed to label a handwashing attempt as 'ambiguous' if it was unclear from the footage as to whether one or multiple handwashing attempts had been made.These ambiguous attempts have the potential to distort the study's primary outcomes if coded imperfectly, so re-running the primary analysis without them serves as a check of our findings' robustness.) Post hoc exploratory analysis If we exclude Timer group participants who said they did not use the timer from our best-fit model, the duration increase associated with the timer is larger (2.77 s, p = 0.016) and robust to exclusion of 'ambiguous' handwashing attempts.
Our results are broadly in line with our hypotheses: both interventions appeared to increase handwashing duration in our preregistered analysis.However, post hoc analyses suggest the effect of the timer intervention may be driven by a sub-group of Timer group participants who say they actually used the timer.

Descriptive statistics
Participants in the Control group washed their hands 6.13 times on average (SD = 2.65) compared with 5.48 times (SD = 2.13) in the Timer group and 6.37 times (SD = 2.62) in the Precommitment group (Table 1).The food preparation task included an expected eight handwashing occasions; only 17 participants (9%) washed their hands at least this often (Timer group: 6%, Precommitment group: 12%, Control group: 9%).

Confirmatory inferential analysis
Our best-fit Poisson log-link regression model found no effect of either intervention on the number of handwashing attempts made (Table 3).This is consistent with Hypothesis 1 (no effect of the timer intervention), but not Hypothesis 2 (more handwashing with the precommitment intervention).

Post hoc exploratory analysis
Our best-fit model did find a statistically significant effect of gender: women washed their hands more often than men on average, holding relevant covariates constant.This is consistent with previous findings (Judah et al., 2009), so we ran a post hoc model with an interaction term between gender and treatment group.The interaction model was a worse fit than the main effects model reported in Table 3 and the interaction term was not statistically significant.

Descriptive statistics
Participants in the Control group washed their hands with soap 5.24 times on average (SD = 2.44) compared with 4.68 times (SD = 2.23) in the Timer group and a %Soap-use is the proportion of handwash attempts where soap was used (soap-use count/handwash attempt count).b %Back-hands is the proportion of handwash attempts where backs of both hands were washed (back-hands count/ handwash attempt count).

Confirmatory inferential analysis
Our best-fit Poisson log-link regression model found no effect of either intervention on the number of handwashing attempts using soap, controlling for the total number of handwashing attempts (Table 4).This was contrary to our hypotheses, which posited that both interventions would increase soap use.

Sensitivity analysis
The result did not change when we ran an alternative model specification as a robustness check.The soap-use model suffered from under-dispersion, so the preregistered Poisson distribution may have been a poor fit.The alternative logistic regression model treated soap use as a binary dependent variable (1 = the participant used soap every time they washed their hands, 0 = the participants washed their hands without soap at least once).We used the same approach to covariate selection as the main analysis.For no combination of covariates trialled did the model explain our data significantly better than the null (intercept-only) model unless we dropped treatment group as a factor.

Descriptive statistics
Participants in the Control group washed the backs of both hands 3.86 times on average (SD = 2.50) compared with 3.78 times (SD = 2.37) in the Timer group and 4.34 times (SD = 2.35) in the Precommitment group (Table 1).

Confirmatory inferential analysis
Our best-fit Poisson log-link regression model found no effect of either intervention on the number of handwashing attempts in which participants washed the backs of both hands, controlling for the total number of handwashing attempts (Table 5).This was contrary to our hypotheses, which posited that both interventions would improve compliance with proper handwashing technique.
Taking together the results for soap use and handwashing technique, we find no evidence of an effect on handwashing quality for either intervention treatment.This result is contrary to both Hypothesis 1 and Hypothesis 2.

Post hoc exploratory analysis
Nonetheless, over the whole sample, participants who washed their hands for longer on average were more likely to wash the backs of both hands (ρ = 0.42, p < 0.001).When we split the sample by treatment group, this finding holds for the Timer (ρ = 0.55, p = <0.001) and Control groups (ρ = 0.49, p = <0.001),but not in the Precommitment group (ρ = 0.23, p = 0.111).

Discussion
We found evidence that both a timer and a precommitment intervention led to an increase in the amount of time participants spent rubbing their hands with soap before rinsing.Participants who had access to a tap-mounted timer washed their hands for 1.9 s longer on average than Control group participants.Likewise, participants who committed in advance to five statements of good hand hygiene washed their hands for 2.5 s longer on average than those in the Control group.
We did not find any evidence of a concomitant effect on either the frequency or quality of handwashing.Participants washed their hands on average 5.97 times across the food preparation task, washed their hands with soap 5.02 times, and washed the backs of both hands with soap 3.97 times.
These results provide mixed evidence for our initial hypotheses.Hypothesis 1 predicted the observed effect of the timer on duration and the lack of effect on frequency.However, it also stated there would be an improvement in the quality of handwashing, which was not observed.Hypothesis 2 correctly predicted the increase in handwashing duration in the Precommitment group, but we also expected to see increases in handwashing frequency and quality, neither of which were observed.

Effectiveness of the timer intervention
The tap-mounted timer had the expected effect on handwashing duration, but there were two surprising findings: Firstly, the magnitude of the effect was smaller than has been previously reported in other studies (Yu et al., 2018;Her et al., 2019).Yu and colleagues reported that attaching a musical timer to soap dispensers led to a 6.4 s increase in handwashing duration relative to pre-test baseline.Her and colleagues found a 4.1 s increase using the same timer device as used in this study, again relative to a pre-test baseline.Our observed mean difference (1.9 s) is smaller.Only eight participants in the Timer group washed their hands for the advised length of time or longer, so a ceiling effect is unlikely to explain why the timer appeared to have a smaller effect in this study than in previous studies.
There are methodological differences between this study and those previously published, which may account for some of the discrepancy in observed effect sizes.We applied a stricter definition of handwashing duration than those studiesbeginning with first application of soap and ending when rinsing starts.Yu and colleagues defined handwashing as ending when participants wiped their hands and Her and colleagues defined the endpoint as when the tap was turned off.These definitional differences seem likely to underpin the higher baseline durations reported in those studies.
Another reason we might expect a smaller effect than previously reported is if our participants did not engage as well with the intervention.This would not be surprising as previously reported studies were field trials run over multiple weeks, so participants encountered the interventions repeatedly.By contrast, our participants generally washed their hands 5-6 times in a 25 min task, so they had limited opportunities to familiarise themselves with the timer.Indeed, the most common reason given for not using the timer was 'I didn't notice it'.Nearly half (n = 31) of Timer group participants said they did not use the timer to keep track of how long they washed their hands for.If we drop these participants from our main analysis, the effect of the timer becomes larger and more robust.This finding makes intuitive sensethe timer works most well for those users consciously engaging with itbut it is the result of a post hoc split so we cannot infer causality.
The second surprising finding was that, while the timer did appear to have the expected effect on duration, we did not see a concomitant increase in participants washing the backs of their hands.We hypothesised that the timer would increase quality for two reasons: Firstly, the poster installed by the sink in the Timer group showed the FSA's guidance on handwashing techniques, including explicitly prompting participants to wash the backs of their hands.Secondly, washing one's hands for longer makes it easier to follow the guidance on technique.We did observe an overall relationship between mean duration and handwashing technique, but this did not manifest as a detectable difference across treatment group.In other words, participants spent longer washing their hands, but it is not clear that they achieved better hand hygiene by doing so.
This does raise the question of whether directly targeting handwashing duration is worthwhile.Previous work using the same timer device found an effect on duration with no concomitant improvement in quality measures (Her et al., 2019).Similarly, training methods focusing on duration in children led to some parts of the hands (e.g., fingertips, palms) being missed (Öncü and Vayısoğlu, 2021).
We did not observe an effect on soap use either, suggesting that the poster did not prompt participants who were not planning to use soap to do so.Soap use was generally high, being used for at least 80% of all handwashing attempts, across all groups.It should be noted that duration was only measured for handwashing attempts in which soap was used, so we did not explore the relationship between the soap use and duration.
Finally, we did not find evidence of a 'backfire' effect on handwashing frequency (Osman et al., 2020).In other words, the timer did not detectably discourage handwashing by making it more onerous.

Effectiveness of the precommitment intervention
The precommitment intervention did not have the expected effects on the frequency of handwashing, the frequency of soap-use, or whether participants applied proper handwashing technique by washing the backs of their hands.It was, however, associated with an increase in mean handwashing duration.We designed the precommitment intervention to directly target frequency and quality, with any effect on duration being a secondary consequence.Our results are therefore difficult to interpret cleanly.
Firstly, we consider why the precommitment intervention did not have the expected proximate effect on handwashing frequency or quality.In general, the evidence on the effectiveness of pledges and especially written pledges is mixed, with some studies finding positive effects (Katzev and Pardini, 1987;Lokhorst et al., 2013), but others finding they are not effective (e.g., Shu et al., 2012).
Perhaps participants made the commitment dishonestly and did not intend to follow through.The study offered money for taking part and committing to the prompt statements was mandatory for participants in the Precommitment group, so they were financially incentivised to do so.Even if the commitment were made honestly, we might not have induced a strong enough commitment for it to be effective.For instance, it has been hypothesised that people keep commitments to maintain a positive self-image (Cialdini, 2008) or reduce cognitive dissonance (Harmon-Jones and Harmon-Jones, 2012).It therefore follows that if the commitment is not strong enough to trigger self-image concerns or cause significant cognitive dissonance, it will not be effective (Sheeran et al., 2005).Since signing the prompt statements was mandatory to complete the online training, participants may have clicked through without feeling strongly enough committed to trigger these mechanisms.
Even if participants did intend to follow through on their prior commitments, there may have been an 'intention-action gap' at play (Sheeran and Webb, 2016).For example, they may have underestimated how difficult adhering to their commitment was when completing the food preparation task under time pressure.Alternatively, they may have failed to remember what they had committed to.The former seems unlikely: only three participants in the Precommitment group said they found the task 'Somewhat difficult' and none answered 'Very difficult'.Recall failure is more plausible: while a majority of the participants in the Precommitment group correctly recalled each prompt statement in the post-trial quiz, only eight (14%) correctly recalled them all.
Alternatively, it has been hypothesised that people keep commitments because the commitment is worded as an 'implementation intention', which lays down an automatic process where an action is triggered in response to a situational cue (Cialdini, 2008;Lokhorst et al., 2013).However, if the precommitment is not worded specifically enough (either the cue to action is not worded specifically enough or the connection between the situation and action is not specific enough) then it might not be effective (Gollwitzer et al., 2010).Implementation intentions have been shown to be successful at inducing rare or one-off actions, such as voting in an election or having a flu vaccination.Our precommitments were general enough to cover all food preparation tasks.It may be that they would have worked if they were more specifically tied to our burger-making task, which raises the possibility that implementation intentions are not as effective for such general activities.
As with the timer intervention, the absence of an effect on soap-use may be at least partially explained by a ceiling effect.Soap was used for at least 80% of all handwashing attempts, across all groups.
The finding of increased duration in the absence of any detectable changes in how frequently participants washed the backs of their hands is puzzling.It is possible that another aspect of handwashing technique did improve and that this accounted for the change in duration.The precommitment statements and posters prompted participants to wash their fingertips, but this could not be reliably measured from test footage and so was dropped from our list of outcomes ahead of preregistration.However, this is an entirely speculative account, with no supporting evidence.
Taken together, we found no evidence suggesting that the precommitment intervention had the intended immediate effect on handwashing frequency and quality.
There was an increase in duration relative to control, but with no obvious causal mechanism we recommend treating this finding with caution.

Gender differences
Women washed their hands approximately 12% more often than men during the task, independent of treatment group.In general, this is consistent with the finding that gender influences handwashing rates (Kinnison et al., 2004).However, we find no indication that the effectiveness of our interventions was moderated by gender as in previous hand hygiene intervention studies (Judah et al., 2009;Porzig-Drummond et al., 2009).No gender differences were observed for quality or duration of handwashing attempts.However, we note that we were not powered to detect interaction effects, which would have required a much larger sample size and may be an avenue for further research.

Relationship to other handwashing literatures
There is substantial literature on interventions to improve hand hygiene in Lower Middle Income Countries (LMICs) and in hospitals, but neither of these areas are very relevant to food handlers.Barriers and facilitators to hand hygiene in LMICs are very culturally specific, for example inadequate education and training, cultural beliefs, lack of resources and substandard government regulations (Delva et al., 2022); and in hospitals infrastructural deficits (Ataiyero et al., 2019) and overcrowding and lack of alcohol-based handrub (Loftus et al., 2019).In hospitals, there are increasing efforts to supplement or replace the use of soap and water with alcoholbased hand rub, as per the WHO multimodal hand hygiene improvement strategy (Lotfinejad et al., 2021).

Strengths and limitations
To our knowledge, this is the first randomised controlled laboratory trial monitoring handwashing behaviour in food handlers.This approach allowed us to control many potentially confounding sources of variation in hand hygiene to draw robust conclusions about the effectiveness of our interventions.For example, much of the existing literature compares hand hygiene before and after an intervention is introduced (Yu et al., 2018;Her et al., 2019).These 'pre-post' studies are often easier to implement than between-subjects randomisation, but risk confounding treatment effects with unrelated changes during the fieldwork period (e.g., prevalence of COVID-19).The laboratory setting of this study afforded us considerable control over the environment in which participants prepared food, as well as their baseline level of training.We could therefore isolate treatment effects and, taken with our secondary measures, make inferences about the mechanisms by which our interventions were operating.Finally, the emphasis on behavioural observation means our results will not suffer from the 'intention-action' gap often associated with measures of intention to act (Sheeran and Webb, 2016).By using video cameras to record handwashing, we were able to measure duration and quality of handwashing, as well as frequency.A set of decoy cameras distracted participants' attention from the purpose of the trialfairly successfully, given that only 21% correctly guessed the purpose of the experiment in the post-experiment survey.
While laboratory studies allow for more complete control of how interventions are applied, they do come with a number of critical limitations.Firstly, the study could not take place in a real professional kitchen, so the food preparation task and time pressure under which it was completed were necessarily artificial.Most obvious was the safety constraint that we could not ask participants to actually cook any food, just to prepare it.It also meant that participants were preparing food in an unfamiliar setting and in the absence of the usual social environment of their workplace.This means our findings may lack ecological validity.
The study cannot support inferences about the longevity of any observed effects.The food preparation task lasted up to 25 min, so participants in the Timer group did not have long to notice the timer and discern from the poster how to use it, while preparing food under a time limit.We cannot know from our results whether the effect on duration would grow stronger over time as use becomes habitual or if it would attenuate as food handlers learn to ignore it.Similarly, the training and precommitment intervention took place at most two weeks before the testing session.To determine longevity, one would need to conduct a field trial with a longer data collection period.
The nature of the population of interest means we cannot claim to have a representative sample.We opted to use a recruiter to access our sample.This had the advantage of speed and cost-effectiveness but did mean less control over the sampling process than if we had carried out all recruitment ourselves.Because our design allocated treatment at random, we would not expect sampling biases to confound our results, but it is possible that our participants were more or less sensitive to our interventions than food handlers as a population would be.The only practical alternative in the timeframe available was to recruit through an employer, which poses its own practical (will participants feel pressured to act in a given manner?) or ethical (how can we guarantee participants do not feel obliged to take part?) problems.
Finally, laboratory trials are expensive for the number of observations they yield, so (despite having a considerably larger number of units of randomisation than cluster randomised field trials) our sample size is relatively low.This means that we would not have the power to detect interactions or smaller effects of interest, or to interrogate our findings through subsample analysis.This issue was compounded by practical difficulties in recruitment.Late drop-out rates were far higher than the recruiter had anticipated, and this led us to miss our target sample size even after extending fieldwork to accommodate more testing sessions.
One major limitation of studies with small sample sizes is that they may be prone to false positive results, especially when considering the impact of interventions across multiple outcomes.A conservative solution to this problem is to adjust the threshold for statistical significance according to the number of outcomes tested, as we did when accounting for multiple comparisons between arms.In practice, however, this would reduce power so much it could preclude most laboratory trials from ever detecting a genuine effect, and even then, such adjustments may not achieve anything if the outcomes do not co-vary.Alternatively, one might drop some outcomes from the inferential analysis, but the risk of doing so is that important consequences of a prospective intervention are overlooked by policymakers, including critical 'backfire' effects.
Taking together the limitations set out above, there is reason for a dose of healthy scepticism when appraising the results of any one trial.We nonetheless believe our findings have considerable value when placed in context.Firstly, we must consider the public policy context for the work.The study's primary aim was to identify interventions to improve hand hygiene in FBO workers to reduce the spread of foodborne disease.We consider the success or failure of an intervention across outcomes through that lens, and accordingly have posed the question of how useful it is to increase handwashing duration without a concomitant increase in quality.
We must also consider our results in their research context.This paper begins with a review of the available evidence, the most relevant of which is again discussed in this section.Our findings for the tap-mounted timer intervention, in particular, replicate those reported from previous studies and one might, therefore, lend them greater credence than the findings for the precommitment intervention.
Finally, we should appraise the results of public policy research on how it may inform future policy development.In the case of a laboratory trial such as this, the logical next step for a promising intervention is to carry out a field trial to gauge effectiveness in the real world.Alternatively, insights into the mechanisms by which the intervention is purported to act may lead to additional intervention development or exploratory research to identify relevant barriers and facilitators.In our view, the findings reported in this paper constitute a strong basis for further work.

Conclusion
We found evidence that the tap-mounted timer had the expected direct effect of encouraging users to spend longer washing their hands but did not have any effect on quality.This result replicates similar findings from a prior pre-post field trial using the same timer device.We also found effects on duration but not quality for the precommitment intervention, but we have not got a good explanation for this finding, which would require further research to establish whether it is a real effect andif sowhat is its cause.There were no backfire effects on frequency, increased duration did not lead to participants washing their hands less often.We are not aware of any work exploring the long-term effectiveness of the timer devices on duration; this should be addressed in future research.More fundamentally, it is unclear whether the achieved increase in duration results in any meaningful improvement in hand hygiene because of the lack of improvement in handwashing quality.Supplementary material.To view supplementary material for this article, please visit https://doi.org/10.1017/bpp.2023.33.

Figure 1 .
Figure 1.Image of workstation in the experimental laboratory kitchen.

Figure 2 .
Figure 2. Poster placed by the sink for the Timer group.The top-right shows the tap-mounted 'SaniTimer' device, which begins a 30 s countdown whenever the tap is turned on.The poster encourages participants to use the timer to make sure they wash their hands for 20 s and use proper handwashing technique.

Figure 3 .
Figure3.Posters for the Precommitment group.The upper poster was placed in view of participants' workstations, so they could see it while preparing food.The lower poster was placed in view of the sink so it could be seen while participants washed their hands.

Table 2 .
GLM model with mean duration as the outcome variable Note: n = 188 (seven cases dropped due to missing data for one or more covariate(s)).

Table 3 .
GLM Poisson model with handwash frequency as the outcome variableNote: n = 189 (six cases dropped due to missing data for one or more covariate(s)).Exp(β) is the exponentiated beta coefficient.Here, >1 indicates a positive rate of change on the outcome variable and <1 indicates a negative rate of change.

Table 4 .
GLM Poisson model with soap count as the outcome variableNote: n = 191 (four cases dropped due to missing data for one or more covariate(s)).Exp(β) is the exponentiated beta coefficient.Here, >1 indicates a positive rate of change on the outcome variable and <1 indicates a negative rate of change.

Table 5 .
GLM Poisson model with back-hands count as the outcome variableNote: n = 184 (nine cases dropped due to missing data for one or more covariate(s)).Exp(β) is the exponentiated beta coefficient.Here, >1 indicates a positive rate of change on the outcome variable and <1 indicates a negative rate of change.