Recent years have seen a dramatic increase in the number of people experiencing common mental health conditions, especially among young women (Chen et al., Reference Chen, Huang, Zhang, Song, Zhao, Sun, Wang, Wang, Sun, Zhou, Zhu, Wang, Xu, Bai and Chang2025). Alongside this greater mental health burden, common physical health conditions such as obesity and diabetes are also rising in young people (Taylor, Reference Taylor2024; Xie et al., Reference Xie, Wang, Long, Ning, Li, Cao, Liao, Liu, Wang and Pan2022). As such, there is a growing need for high-quality research to better understand who is likely to develop a common mental health condition, how this relates to the development of common physical health conditions, and how best to intervene. Data from longitudinal population studies provide a rich source of information about the development of mental health conditions across time. However, there are additional, routinely collected sources of information that may shed light on health outcomes.
In the UK, the National Health Service (NHS) provides taxpayer-funded healthcare for those living in all four UK countries. Recent figures from the Association of British Insurers indicate that only 4.7 million people across the UK are covered by private medical insurance (Association of British Insurers, 2024), meaning that the majority of the UK population utilitizes the NHS for healthcare. As such, NHS data provide valuable insights into population health across the full range of geographical and socioeconomic circumstances. The adoption of electronic health records in healthcare settings has increased the possibility of using routinely collected health data for secondary research (Casey et al., Reference Casey, Schwartz, Stewart and Adler2016). Furthermore, a major government focus in recent years has been to integrate the use of electronic health records into all NHS trusts to improve care and support the development of research (Department of Health and Social Care, 2022), as well as reduce health inequalities (Chadd et al., Reference Chadd, Caute, Pettican and Enderby2025). This article outlines the process and outcome of linking the Twins Early Development Study, a UK population-based cohort study, to routine health records collected by the UK NHS.
The Twins Early Development Study (TEDS)
TEDS is a longitudinal cohort of twins born in England and Wales between 1994 and 1996 (Lockhart et al., Reference Lockhart, Bright, Ahmadzadeh, Breen, Bristow, Boyd, Downs, Hotopf, Palaiologou, Rimfeld, Maxwell, Malanchini, McAdams, McMillan, Plomin and Eley2023). Data have been collected at regular intervals between ages 18 months and 30 years, allowing researchers to explore the genetic and environmental contributions to individual differences in the development of cognition, behavior, mental and physical health. TEDS worked with the Office for National Statistics (ONS) to identify families with twins born between January 1994 and December 1996 in England and Wales using electronic birth records. Identified families were invited to join the study, with 13,769 consenting to take part at the first data collection when their twins were 18 months (Trouton et al., Reference Trouton, Spinath and Plomin2002). Subsequently, data have been collected when twins were aged 2, 3, 4, 7, 8, 9, 10, 12, 14, 16, 18, 21, 26, and 30 years old. Information has been reported by parents, twins and teachers, with specific informants varying between waves, providing rich insight into the twins’ behavior across contexts. All previous data collections have been reported in more detail elsewhere (Haworth et al., Reference Haworth, Davis and Plomin2013; Lockhart et al., Reference Lockhart, Bright, Ahmadzadeh, Breen, Bristow, Boyd, Downs, Hotopf, Palaiologou, Rimfeld, Maxwell, Malanchini, McAdams, McMillan, Plomin and Eley2023; Oliver & Plomin, Reference Oliver and Plomin2007; Rimfeld et al., Reference Rimfeld, Malanchini, Spargo, Spickernell, Selzam, McMillan, Dale, Eley and Plomin2019; Trouton et al., Reference Trouton, Spinath and Plomin2002) and on the TEDS Data Dictionary (TEDS Data Dictionary, n.d.). With participants now approximately 30 years old, TEDS is well placed to examine how early life experiences impact health and social outcomes in early adulthood.
The Benefits of Linkage
Linking to medical records increases the granularity and accuracy of health-related data by providing a wealth of new information about participants’ physical and mental health conditions, access to treatment, medication dosages, and interaction with services. Used in combination with the longitudinal data collected in TEDS, these records provide new opportunities to explore the mechanisms underpinning the development of health difficulties. The linked data will also provide insight into public health, based on service use, trajectories of healthcare contact and diagnoses, alongside information about lifestyle and subclinical symptoms from longitudinal research data.
In addition to increasing data, the inclusion of medical records in research increases the fairness of cohort-based research. Not all TEDS participants contribute to every wave of data collection, but rather, many re-engage after a period of nonparticipation, leading to variability in attrition rates over time. For example, 91% of the twins who completed the assessment at age 12 responded to at least one of the subsequent waves of assessment. The varying participation between ages 12, 16, 21 and 26 of TEDS twins who have not withdrawn from the study is shown in the changes from the green to yellow sections across time in Figure 1. This inconsistency in participation can result in missing data for some participants between ages. Linking to routinely collected data is one approach to provide alternative sources of data and to assess and inform statistical approaches to address bias and wider data quality issues. As seen in Figure 1, linkage work accesses data on many participants who have missed some assessment waves or not taken part at all since age 12. In addition to those who have been included in linkage (dark green) and those who have opted out (red), the linkage column in Figure 1 also includes participants who have not been included in medical record linkage so far. This group includes participants who we could not trace through the NHS Patient Demographic Service (PDS; more detail below) or who were traced but the address provided was not usable. This group also includes individuals who are part of the current Environmental Risk (E-Risk) study (Moffitt & E-Risk Study Team, Reference Moffitt2002) data collection but have not yet been interviewed. This group are being invited to provide consent for medical record linkage as part of that data collection.
Returns of twin-reported data at ages 12, 16, 21 and 26.
Note: Green represents participation at each wave for participants who took part at age 12. Those not yet included in linkage (dark yellow in the linkage column) were either not traceable via the Patient Demographic Service, were traced but with an unusable address, or are part of the Environmental Risk (E-Risk) study and therefore were not sent fair processing at this time.

Participation in longitudinal studies is easier for some groups than others. For example, in Western countries, highly educated, healthy and white individuals — reflecting a ‘healthy volunteer’ bias — tend to be overrepresented in research samples (Henrich et al., Reference Henrich, Heine and Norenzayan2010). Furthermore, individuals facing worse health outcomes, having higher rates of behaviors associated with ill-health and complex social adversities, are more likely to be lost from follow-up waves of data collection (Cornish et al., Reference Cornish, Macleod, Boyd and Tilling2021; Delfabbro et al., Reference Delfabbro, Winefield, Winefield, Malvaso and Plueckhahn2017; Watson & Wooden, Reference Watson, Wooden and Lynn2009; Young et al., Reference Young, Powers and Bell2006). There are several reasons why research is often worse at reaching or re-engaging individuals from marginalized groups, including mistrust, perceptions that there is no benefit to their community in taking part, stigma, frequent changes in contact details or home address (Bonevski et al., Reference Bonevski, Randell, Paul, Chapman, Twyman, Bryant, Brozek and Hughes2014). As such, there are groups of individuals who are excluded from medical and public health research studies and by extension are less likely to benefit from published evidence as they may not translate across population groups. By providing data on individuals who may not have been able to take part over the years, medical record linkage increases the fairness of research. In some cases, it may also encourage re-engagement in the study for participants with whom we have lost touch (see more below about the linkage process for this group). In the long term, linkage to routinely collected records also increases participant convenience. Providing detailed information about health and treatments can be time consuming for participants. Linked records allow for such details to be accessed without lengthy questionnaires, and provide information that may be difficult to accurately recall, such as date-sequenced information on different medication regimes.
Medical Record Linkage in TEDS
Accessing medical record data from the NHS is an ongoing multistage process that has included consulting TEDS participants, securing the relevant approvals, informing study members of the research plans, and finally undertaking accurate linkage. A full timeline of the linkage program thus far is outlined in Figure 2.
Timeline of medical record linkage approvals in TEDS.
Note: PPIE, Patient and public involvement and engagement; CAG, Confidentiality Advisory Group; REC, Research Ethics Committee; DARS, Data Access Request Service; UK LLC, UK Longitudinal Linkage Collaboration.

Patient and Public Involvement and Engagement (PPIE) Consultations
The TEDS twins were consulted at every stage throughout the development of the medical record linkage program to ensure the research was acceptable to study participants. This was done through a mixture of in-person and online Patient and Public Involvement and Engagement (PPIE) consultations. These sessions assessed three main themes: (1) the proposed study approach and methodology, (2) the clarity and readability of the study materials, and (3) the types of research questions that would be appropriate and beneficial to investigate with linked medical record data.
To ensure that PPIE results represented as broad a group of participants as possible, we carried out consultations specifically with individuals who had been inactive in the study. Participants included in this group had been recently traced by the TEDS team and had not taken part since at least age 18, some not since age 4.
The feedback received from both active and inactive twins during the consultations was overwhelmingly positive in terms of study participants agreeing that the opt-out method was acceptable and that linkage would expand the benefit of TEDS research. Feedback from participants also improved the clarity of the study materials. For example, based on feedback that some people may struggle to process the lengthy fair processing document, the TEDS team created an animated version of the document. When asked about their views on the use of medical records in research, all participants involved in the PPIE felt it was either very important or important to explore questions relating to mental health.
Ethics and Legal Basis
Within the UK, record linkage requires a legal basis that includes either seeking explicit participant consent or using specific legislation that enables public benefit research to set aside consent requirements (subject to making all reasonable efforts to inform participants and provide a means to object). Given that a key objective of medical record linkage work is to improve research equity through capturing data on marginalized and harder-to-research participants lost to attrition, and that the same factors that drive attrition would also impact efforts to obtain consent, seeking consent was not practical in TEDS. Proceeding without consent seeks an appropriate balance between a participants’ individual right to control the use of their data with the broader study aims for societal benefit.
To address this ethical issue, the NHS in England have developed a ‘national data opt-out’ mechanism, where NHS patients can choose to object to their NHS data being used for planning and research. Wales, Scotland and Northern Ireland do not currently have comparable national opt-out policies. However, other more localized options exist depending on the country. For example, in Wales, patients are able to opt out of their GP records being shared for planning and research purposes directly through their GP. TEDS respects all objections given directly to the study or through the national opt-out scheme. Participants can object at any point, not just when information is initially sent, and, as such, the number of participants with linked records may vary slightly over time.
TEDS was approved by the Health Research Authority Research Ethics Committee (REC; reference number: 22/SC/0313) and Confidentiality Advisory Group (CAG; reference number: 22/CAG/0038) to use an ‘opt-out’ method for accessing medical records, rather than a consent-based model. This opt-out objection mechanism requires studies to meet strict criteria, only being granted in circumstances where the NHS deems research to be in the public interest and obtaining consent is not practical.
Legal and Regulatory Basis for Opt-Out Method
TEDS complies with UK General Data Protection Regulation (GDPR) for holding and processing participant study data. Specifically for TEDS this is GDPR and Data Protection Act 2018 Article 6(1)(e) ‘performance of a task carried out in the public interest’, and where sensitive information is involved, it is Article 9(2)(j) in accordance with Article 89(1) ‘scientific or historical research purposes or statistical purposes’.
However, accessing and processing personal data for linkage purposes in the UK requires Duty of Confidentiality in UK Common Law. TEDS meets this by using an opt-out methodology covered by Section 251 of the National Health Service Act 2006. The Section 251 legal basis covers the use of confidential patient information for research in the public benefit where it is not practical to obtain consent. The process of gaining Section 251 approval includes an evaluation of the feasibility of fully anonymizing the data. In the context of TEDS, the full anonymization of medical record data would render it impossible to link to existing TEDS research data and conduct ongoing participant follow-up. However, to ensure confidentiality is maintained, records are deidentified and stored within a highly controlled trusted research environment. Because Section 251 approval allows for the disclosure of confidential patient information without consent, studies must meet rigorous standards to obtain this approval.
Linkage Process
As part of the opt-out approval, it is the responsibility of the study team to provide detailed information to participants on the research plans, a process henceforth referred to as fair processing. Fair processing provides the opportunity for individuals to make an informed decision about including their data in linkage and outlines the process for opting out. To use this method, research teams must make every reasonable effort to provide fair processing information to their study participants. With this in mind, TEDS planned fair processing to be completed in two waves. Wave 1 consisted of ∼11,500 participants for whom we were confident we held an up-to-date home address. These participants had completed at least one of the two most recent waves of whole cohort data collection, in which participants were asked to provide their current address. Fair processing was always sent by postal mail and also via email where email addresses were available. Additionally, any participants who had recently updated the study team with a change of address, or who had been successfully traced using publicly available records, were included. Subsequently, Wave 2 included participants without up-to-date contact details, many of whom had not participated in recent waves of data collection. For this group, the TEDS team gained approval from the NHS Data Access Request Service (DARS) to access these participants’ current contact details held by participants’ General Practice (GP) surgeries through the Patient Demographics Service (PDS). This additional step was taken to ensure that all TEDS participants had the opportunity to opt out of the medical record linkage program. As part of the fair processing campaign, every TEDS participant received the fair processing leaflet and a reminder letter approximately 2 weeks later by post. To opt out of medical record linkage, participants were asked to complete an opt-out form and return it using a prepaid postal envelope or via email. At future face-to-face data collections, either in person or online, TEDS participants will be asked for explicit consent for linkages. Consent collections are already underway for the subset of TEDS twins (n = 2232) included in the E-Risk Study.
To date, medical record linkage has had a positive uptake. Of those twins included in Wave 1, only 9.8% opted out of linkage activities after receiving fair processing materials. To date, the Wave 2 opt-out rate is similar at 7.7%. Of the subgroup who have been asked for consent, 85% have provided consent.
Once the fair processing process is complete (e.g., both the leaflet and reminder letter have been sent), the TEDS team sends participant identifiers and data use permission flags (‘File 1’) to the UK Longitudinal Linkage Collaboration (UK LLC), the national Trusted Research Environment (TRE) for the UK’s longitudinal research community led by the University of Bristol and University of Edinburgh. The UK LLC is designed to support the UK’s unparalleled collection of Longitudinal Population Studies (LPS) by providing record linkage and secure analysis and data curation services. The identifiers are sent directly from TEDS to the UK LLC’s linkage broker, the NHS Digital Health Care Wales (DHCW), who facilitate the linkage to electronic health records through wider NHS’ trusted partners across the UK’s four nations. Subsequently, relevant de-identified NHS data linked with the unique TEDS identifier is transferred back into the UK LLC via their TRE infrastructure host (Secure eResearch Platform, run by Swansea University). For an extra layer of security, the associated TEDS identifier is encrypted before the data are deposited in the TRE, so that the identifier is unrecognisable. In a separate transfer, TEDS deposits relevant de-identified study data from the TEDS research database into UK LLC (‘File 2’). Study data are sent alongside the same unique ID, encrypted in the same way, to ensure correct linkage with health records and future linkages. This is a two-step dynamic process with participants being added or removed as they receive fair processing or opt out. The NHS datasets are also regularly refreshed by the UK LLC data team to include updated medical records. The flow of participant identifiers and de-identified research and medical record data is outlined in Figure 3.
TEDS and UK LLC flow of participant identifiers and de-identified data.
Note: TRE, Trusted Research Environment. Figure adapted from Boyd et al., (Reference Boyd, Evans, Turner, Flaig, Oakley, Campbell, Thomas, McLachlan, Crane, Whitehorn, Calkin, Hill, Berman, Ford, Tobin, Porteous, Gomes, Garcia, Wong and Chaturvedi2025).

TEDS refreshes the identifiers and permission flags of participants in the UK LLC on a regular basis, enabling UK LLC to manage study withdrawals or objections, or to seek to establish a linkage for those who have updated their identifiers.
The protocol for TEDS linkage followed precedents set by other cohort studies, including TwinsUK (Moayyeri et al., Reference Moayyeri, Hammond, Hart and Spector2013) and the Avon Longitudinal Study of Parents and Children (ALSPAC; Boyd et al., Reference Boyd, Golding, Macleod, Lawlor, Fraser, Henderson, Molloy, Ness, Ring and Davey Smith2013).
Access to TEDS Data Using the UK LLC
The TEDS linkage program has been developed with the support of the UK Longitudinal Linkage Collaboration (Boyd et al., Reference Boyd, Evans, Turner, Flaig, Oakley, Campbell, Thomas, McLachlan, Crane, Whitehorn, Calkin, Hill, Berman, Ford, Tobin, Porteous, Gomes, Garcia, Wong and Chaturvedi2025). TEDS is now one of 22 longitudinal population studies hosting data in the UK LLC TRE. The UK LLC is a discoverable and accessible resource for public good research, that is, research that is likely to benefit a large number of people in the general population. The data available within the TRE is easy to explore through the UK LLC explore page (https://explore.ukllc.ac.uk/) and guidebook (https://guidebook.ukllc.ac.uk/docs/welcome.html).
UK-based bona fide researchers can apply to access data within the UK LLC to carry out research for public benefit through a streamlined application process that balances the needs of research studies and members of the public. Applicants must hold the Office of National Statistics (ONS) Accredited Researcher status prior to accessing any data, be based in the UK, and employed at an organization with sufficient capability to support the research. The application process includes completing an initial expression of interest form, which is reviewed by the UK LLC team to assess project feasibility. If approved, applicants are invited to submit a full application form that is reviewed by the UK LLC team, UK LLC Public Advisory Group, as well as relevant study teams (i.e., TEDS review applications to use TEDS data). All approved research projects are publicly catalogued in the UK LLC Data Use Register (https://ukllc.ac.uk/data-use-register).
Linkage Sample in TEDS
Record linkage to the NHS England centralized database is conducted by NHS England using a series of deterministic linkage algorithms (All linked data copyright ©2026, NHS England. Re-used with the permission of NHS England and UK LLC. All rights reserved). The first algorithm identifies exact, unique matches between all study identifiers (first name, family name, date of birth, address) and the equivalent identifiers in the NHS patient register. Where no exact match is found, a series of subsequent algorithms are applied which relax part of the criteria (e.g., only the first letter of the first name and exact matches of all other identifiers). Post linkage, the national data opt-out is checked and participants who have objected are removed. The current overall linkage rate between TEDS and the patient register is 78%. Once linked to the patient register, internal and consistent NHS identifiers are used to identify and extract the records of participants from a wide range of electronic health records databases.
Linkage failure may occur for a number of reasons. These include participants who have requested the national opt-out or who do not reside or seek care in England, errors in recorded information, changes in personal identifiers over time (e.g., name, gender, or registered address) or an insufficient number of available identifiers (Doidge et al., Reference Doidge, Christen and Harron2021). We note that linkage is especially challenging for twins as twin pairs often share a wide range of identifiers (e.g., social naming patterns in the UK mean they often share the first letter of their first name, as well as family name, date of birth and address). TEDS does not hold the NHS numbers of the study participants, which may partially explain the 22% of participant
s who were not successfully linked. NHS England does not provide a breakdown of reasons for linkage failure, as information on national opt-out is considered sensitive and not sharable where the opt-out is applied. Furthermore, no metrics are provided on the linkage quality (degree of matching identifiers) of the linked or nonlinked samples. Notably, although the UK LLC has approval in place for records held by both NHS Scotland and Wales, these data have not yet been deposited in the TRE. The availability of these records will likely increase the TEDS linkage rate, particularly as ∼5% of the TEDS sample reside in Wales.
In terms of socio-demographic characteristics collected at first contact, the Wave 1 linkage sample has higher levels of female and monozygotic twins than the TEDS sample at first contact (Table 1). Twins included in the Wave 1 linkage sample also have parents with slightly higher levels of qualifications and employment relative to the full first contact sample. However, the Wave 1 linkage sample only includes twins who have completed a recent wave of data collection. As such, the higher rates of female and monozygotic twins and a skew towards higher family SES likely reflect participation biases seen in volunteer samples. In comparison, participants in Wave 2 have lower levels of monozygotic twin pairs, female sex, and family-level SES indicators than the full first contact sample. This indicates that the inclusion of Wave 2 participants in the linked sample helps to reduce biases arising from sample attrition. Of those who we hoped to include in Wave 2, 73.3% were successfully traced through the NHS Patient Demographic Service. Of those successfully traced, 9.5% had unusable addresses. Overall, ∼7200 participants with whom we had lost touch were sent fair processing in Wave 2.
Comparison of socio-demographic characteristics recorded at first contact for the full TEDS and linkage subsamples

Note: All twin and family socio-demographic characteristics reported in this table are based on parent-reported questionnaire data at first contact. The standard analytical exclusions have not been applied as these may not be relevant for analyses using linked data. The Wave 1 demographics have been calculated based on participants included in Wave 1 following fair processing and opt-outs. Wave 2 demographics have been calculated based on participants included in fair processing Wave 2 who have not opted out. The Ns and demographics of the final linked sample in the UK LLC will differ slightly depending on the success rate of linkage to medical records.
Available NHS Datasets Linked in the UK LLC
So far, the routine health datasets linked to TEDS participants include data held by NHS England. The full list of available datasets, the number of TEDS participants included, and temporal coverage is presented in Table 2.
NHS datasets available in the UK Longitudinal Linkage Collaboration

Note: Some NHS datasets (MHSDS, Talking Therapies for Anxiety and Depression, CSDS) contain multiple data frames, some of which contain a smaller number of participants. More specific information is available in the UK LLC Guidebook (Welcome to UK LLC Guidebook — UK LLC Dataset Documentation, n.d.). Data Copyright © (2025), NHS England. Data reused with the permission of NHS England. All rights reserved.
Primary Care Datasets
Records are available from two primary care datasets. The Primary Care Medicines dataset includes information on medicines dispensed in the community resulting from prescriptions issued by GPs, clinics, dentists, and community nursing services. The General Practice Extraction Service Data for Pandemic Planning and Research (GDPPR) dataset includes a more extensive list of clinical codes relating to health conditions relevant to pandemic planning and research. These datasets have coverage across England. Projects using the GDPPR dataset are legally restricted to informing COVID-19 relevant research and must not be used for any General Practice performance management.
Hospital Datasets
TEDS participants are linked to all seven of the NHS England hospital datasets included in the UK LLC. The three main hospital datasets are Outpatient care, Admitted Patient care, and Emergency care. Outpatient covers all hospital appointments that do not require an overnight hospital stay, including referrals, waiting times, and diagnoses. Admitted Patient care covers the same for inpatient care, as well as datasets relating to Augmented Care Periods, Maternity Care, and Critical Care. There are two emergency care datasets; the Accident and Emergency dataset was retired in 2020 and replaced by the Emergency Care Data Set. Both emergency datasets include information on presenting symptoms, outcomes, waiting times, diagnosis and treatments. Together these hospital datasets include records from 1998 onwards, covering the majority of the TEDS twins’ lives but excluding birth and early-year data.
Mental Health Datasets
There are data available from two NHS England mental health datasets: Mental Health Services Data Set (MHSDS) and the NHS Talking Therapies for Anxiety and Depression. MHSDS includes data on individuals who have received treatment for mental health, wellbeing, learning disability, autism or other neurodevelopmental conditions in an NHS-funded secondary care service.
The NHS Talking Therapies dataset (previously known as Improving Access to Psychological Therapies) includes information about adults who have received NHS-funded psychological treatment for anxiety, depression or related disorders.
Community Datasets
The Community Services Data Set (CSDS) and Maternity Services Data Set (MSDS) are the two community datasets available for use. The CSDS includes data on a wide range of personal, demographic, social, and medical circumstances recorded in any community setting (e.g., health centers, schools, community centers). The MSDS dataset contains pregnancy and birth information for individuals who have accessed maternity care through the NHS. This will exclude data on the TEDS participants’ own birth but will provide information on their pregnancies and the birth of their children, some of whom participate in the Children of TEDS Study (Ahmadzadeh et al., Reference Ahmadzadeh, Eley, Plomin, Dale, Lester, Oliver, McMillan and McAdams2019).
COVID-19 Datasets
The UK LLC hosts electronic records from six NHS England COVID-19 datasets relating to testing, vaccinations, and COVID-related hospitalizations. These datasets all began in 2020; however, some are no longer collecting new information. As outlined by the NHS terms of use, projects must be related to COVID-19 in order to access these data.
Registration Datasets
TEDS participants have linked data for the Cancer, Civil Registrations of Death, and Demographics registers. The Cancer Registration includes information on cancer diagnosis and treatment. The Civil Registrations of Death contains information collected by the Office for National Statistics on deaths reported to registry offices. The Demographics dataset outlines information such as name, date of birth, and sex.
Cross-Cohort Analysis
UK LLC hosts the TEDS collected data and linked records in an integrated relational database. The TEDS data sits alongside data from over 20 other major longitudinal studies and a pooled sample of more than 400,000 participants. This provides users of TEDS data with a straightforward mechanism to either run replication analysis in other cohorts or to combine studies for meta- or pooled-analysis. While these latter models are complicated by the need to harmonize study collected data, they are facilitated by the linked data being identical in structure and methodology across all studies. This potential is further enhanced by TEDS (born 1994−1996) sitting alongside: the ‘Next Steps’ cohort (born 1990−1991); the Avon Longitudinal Study of Parents and Children (born 1991−1992); and (UK) Millennium Cohort Study (born 2001−2002) which currently collectively have a combined sample size of >40,000 in the LLC and which can be used to understand outcomes across diverse UK Millennial and Generation Z populations.
Conclusion
Linkages to electronic health records pose a number of challenges relating to protocol development, access and ethical approvals, data quality, analysis platforms, and generalizability of findings (Honeyford et al., Reference Honeyford, Expert, Mendelsohn, Post, Faisal, Glampson, Mayer and Costelloe2022). Indeed, the road to linkage in the TEDS cohort has been long and winding. Perhaps the greatest challenge lay in developing a protocol that was acceptable to TEDS participants, enabled both expression of dissent and wider fairness of outcomes, and maintained long-term participant trust. Our extensive participant consultation was specifically designed to inform this decision-making process and provided confidence that an opt-out approach could be used without negatively impacting participant relationships.
Integrating medical records and longitudinal population study data has the potential to offer powerful insights into public health. The NHS is facing both longstanding and emerging challenges (Khan, Reference Khan2023), and the wealth of data collected by longitudinal population studies such as TEDS will allow researchers to address many critical and timely health related research questions. For example, prediction models of physical and mental health treatment outcomes can be built using TEDS data, incorporating a broader range of longitudinal information than is available in medical records alone. The twin structure and genomic data available in TEDS could also answer novel questions, such as about genetic influences on service use. Cross-cohort analyses facilitated by the UK LLC’s collaborative research environment will enable the investigation of rare and uncommon disorders. The collaboration between longitudinal population studies and integrated platforms such as the UK LLC make medical record linkage more feasible. The integration of rich, detailed longitudinal population study data with real-world data from health services used by the majority of the UK population provides a unique opportunity for TEDS to contribute to highly impactful research that will improve our understanding of health and wellbeing for many years to come.
Data availability
Access to TEDS linked data is governed by application directly to the UK LLC (https://ukllc.ac.uk/apply).
Acknowledgments
The authors gratefully acknowledge the ongoing contribution of the participants in the Twins Early Development Study (TEDS) and their families. For the purposes of open access, the author has applied a Creative Commons Attribution (CC BY) licence to any Accepted Author Manuscript version arising from this submission. The views expressed are those of the authors and not necessarily those of the ESRC, MRC or King’s College London.
This work uses data accessed within UK LLC’s Trusted Research Environment (TRE). We thank the Secure eResearch Platform (SeRP UK) team at Swansea University and NHS Digital Health and Care Wales for providing the TRE’s infrastructure and support. This work uses data provided by participants of the contributing LPS, which have been collected through their LPS or as part of their care and support and/or interactions with UK government services. We wish to recognise and thank the participants and each contributing to the LPS team. We thank the following LPS for contributing data that made this research possible: TEDS - Twins Early Development Study. This work uses data provided by patients and collected by the NHS as part of their care and support. NHS England collates patient data and gives permission for publicly beneficial uses via its Data Access Service.
Financial support
TEDS is supported by the UK Medical Research Council (MRC) (MR/V012878/1 and previously MR/M021475/1). The E-Risk substudy is also supported by the MRC (MR/X010791/1). H.L.F. receives salary support from the Economic and Social Research Council (ESRC) Centre for Society and Mental Health at King’s College London [ES/S012567/1]; [UKRI861]. AB is funded by UK Research & Innovation (Medical Research Council MR/X021556/1, Economic and Social Research Council ES/X000567/1). TE is part-funded by the National Institute for Health and Care Research (NIHR) Maudsley Biomedical Research Centre (BRC).
Competing interests
None to declare.
Ethical statement
The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2008. This project has been approved by UK LLC and its contributing data owners. Information about this project and its outputs can be accessed via UK LLC’s Data Use Register. UK LLC has ethical approval from the Health Research Authority Research Ethics Committee (Haydock Committee; ref: 25/NW/0314).




