Association between genetic and socioenvironmental risk for schizophrenia during upbringing in a UK longitudinal cohort

Background Associations of socioenvironmental features like urbanicity and neighborhood deprivation with psychosis are well-established. An enduring question, however, is whether these associations are causal. Genetic confounding could occur due to downward mobility of individuals at high genetic risk for psychiatric problems into disadvantaged environments. Methods We examined correlations of five indices of genetic risk [polygenic risk scores (PRS) for schizophrenia and depression, maternal psychotic symptoms, family psychiatric history, and zygosity-based latent genetic risk] with multiple area-, neighborhood-, and family-level risks during upbringing. Data were from the Environmental Risk (E-Risk) Longitudinal Twin Study, a nationally-representative cohort of 2232 British twins born in 1994–1995 and followed to age 18 (93% retention). Socioenvironmental risks included urbanicity, air pollution, neighborhood deprivation, neighborhood crime, neighborhood disorder, social cohesion, residential mobility, family poverty, and a cumulative environmental risk scale. At age 18, participants were privately interviewed about psychotic experiences. Results Higher genetic risk on all indices was associated with riskier environments during upbringing. For example, participants with higher schizophrenia PRS (OR = 1.19, 95% CI = 1.06–1.33), depression PRS (OR = 1.20, 95% CI = 1.08–1.34), family history (OR = 1.25, 95% CI = 1.11–1.40), and latent genetic risk (OR = 1.21, 95% CI = 1.07–1.38) had accumulated more socioenvironmental risks for schizophrenia by age 18. However, associations between socioenvironmental risks and psychotic experiences mostly remained significant after covariate adjustment for genetic risk. Conclusion Genetic risk is correlated with socioenvironmental risk for schizophrenia during upbringing, but the associations between socioenvironmental risk and adolescent psychotic experiences appear, at present, to exist above and beyond this gene-environment correlation.

Murray, 2014; Lederbogen et al., 2011;van Winkel, Stefanis, & Myin-Germeys, 2008). An enduring question, however, is whether associations of socioenvironmental risk factors with psychosis are truly causal. The process by which people 'choose' where to live is not random: it is influenced by our individual characteristics and resources. These selection processes likely result in compositional differences between individuals living in advantaged v. disadvantaged environments. For example, functional impairment due to high genetic risk could lead individuals to drift downward in social mobility into more crowded and impoverished settings. This selection process could occur over an individual's lifetime (intra-generational), as well as across generations (intergenerational) (Solmi, Lewis, Zammit, & Kirkbride, 2019), and thereby confound the observed associations between the environment and psychosis.
Findings from Scandinavian registry data (Mortensen et al., 1999;Pedersen, 2001) have long shown that associations between urbanicity and psychosis are not confounded by family psychiatric history, the workhorse measure of genetic risk. However, family history captures only the diagnosed fraction of genetic risk for psychopathology (Yang, Visscher, & Wray, 2010). Polygenic risk scores (PRS)which aggregate the effects of thousands of genetic variants to index an individual's genetic risk for a phenotype (Dudbridge, 2013;Plomin, Haworth, & Davis, 2011)offer new opportunities to examine the role of genetics in the link between socioenvironmental risk and psychosis. A handful of recent studies have explored this question. Higher schizophrenia PRS has been associated with urbanicity and deprivation in adulthood (Colodro-Conde et al., 2018;Sariaslan et al., 2016), and with deprivation and air pollution at birth (Horsdal et al., 2019;Solmi et al., 2019). In contrast, other studies have reported null associations between PRS and urbanicity at birth (Paksarian et al., 2018;Solmi et al., 2019). Furthermore, we recently identified consistent associations between PRS for education and ecological risk in childhood; but associations were inconsistent for PRS for schizophrenia (Belsky et al., 2019). Findings remain equivocal. Therefore, the present study addressed this topic by examining the association between genetic risk and socioenvironmental risk for schizophrenia during upbringing.
Like family history, PRS are imperfect measures of genetic risk. Complex traits are polygenic, and PRS currently capture only a small portion of this genetic variance (Ripke et al., 2014;Wray et al., 2018); while at the same time confirming the presence of substantial pleiotropy between different disorders like psychosis and depression (Legge et al., 2019;Pain et al., 2018). Therefore, this study incorporated five genetic risk indices, including schizophrenia PRS, depression PRS, maternal psychotic symptoms, family psychiatric history, and latent genetic risk scores derived using the twin design. This allowed us to contrast findings between PRS, which are partial through direct measures of genetic risk that are uninfluenced by the environment, with family-and twinbased measures, which capture risk more comprehensively but reflect both genetic and environmental influences. Furthermore, as well as area-level features like urbanicity, our study included detailed, high-resolution measures of neighborhood-and familylevel risks that are typically enriched in cities. Based on our previous findings, measures included urbanicity, air pollution, neighborhood deprivation, neighborhood crime, neighborhood disorder, neighborhood social cohesion, residential mobility, and family poverty. Finally, rather than focusing on one time-point (e.g. birth or adulthood), our study spanned birth to age 18. Our rationale was that socioenvironmental exposures early in life, rather than in adulthood, often associate with psychosis most strongly (Pedersen & Mortensen, 2001), meaning that interventions during childhood and adolescence hold the best promise for improving mental health outcomes (Davidson, Grigorenko, Boivin, Rapa, & Stein, 2015). Additionally, children v. young adults have relatively little choice about where they live, meaning that intra-generational drift may increase across development. A focus from birth to adolescence will therefore highlight whether correlations between genetic and environmental risk indicate (a) reverse causation in which genetic risk leads adults into riskier environments, meaning that addressing risks would have little impact on psychosis outcome; or (b) that genetic risk influences exposure to causal risk factors, in which case addressing risks would reduce young people's risk for psychosis.

Study cohort
Participants were members of the Environmental Risk (E-Risk) Longitudinal Twin Study, which tracks the development of a nationally representative birth cohort of 2232 twin children born in 1994-1995 across England and Wales and initially assessed at age 5. Follow-up home-visits were conducted when participants were aged 7, 10, 12 and 18 (participation rates were 98%, 96%, 96% and 93%, respectively). At age 18, the E-Risk sample included 2066 participants, comprising 56% monozygotic (MZ) twin pairs and 49% males. There were no differences between those who did and did not take part at age 18 in terms of age-5 socioeconomic status (SES) (χ 2 = 0.86; p = 0.65), age-5 IQ scores (t = 0.98; p = 0.33), or age-5 internalizing or externalizing behavioral problems (t = 0.40; p = 0.69 and t = 0.41; p = 0.68, respectively). E-Risk families are representative of UK households across the spectrum of neighborhood socioeconomic conditions according to consumer-classification systems (CACI Information Services, 2006): at follow-up, 27% of E-Risk participants lived in 'wealthy achiever' neighborhoods compared to 25% of households nationwide; 7% v. 12% lived in 'urban prosperity' neighborhoods; 28% v. 27% lived in 'comfortably off' neighborhoods; 13% v. 14% lived in 'moderate means' neighborhoods; and 26% v. 21% lived in 'hard-pressed' neighborhoods (Odgers et al., 2012b). Likewise, E-Risk families are representative of UK households according to the Index of Multiple Deprivation (online Supplementary Figure 1). The Joint South London and Maudsley and the Institute of Psychiatry Research Ethics Committee approved each phase of the study. Parents gave written informed consent and participants gave written assent at ages 5-12 and written informed consent at age 18. Further details about the sample are reported elsewhere (Moffitt, 2002) and in the Supplementary Materials. Table 1 displays characteristics for the analysis sample, together with information on missing data.

Genotyping participants
We used Illumina HumanOmni Express 24 BeadChip arrays (Version 1.1; Illumina, Hayward, CA) to assay common singlenucleotide polymorphism (SNP) variation in the genomes of cohort members. The resulting database was restricted to SNPs called successfully in >98% of the cohort and in Hardy-Weinberg equilibrium (p > 0.001). We imputed additional SNPs using the IMPUTE2 software (Version 2.3.1; https://mathgen. The E-Risk cohort contains MZ twins, who are genetically identical; we therefore empirically measured genotypes of one randomly selected twin per pair and assigned these data to their MZ co-twin. We directly measured genotypes of both members of dizygotic (DZ) twin pairs. Prephasing and imputation were conducted using a 50-million-base-pair sliding window. The resulting genotype databases included genotyped SNPs and SNPs imputed with 90% probability of a specific genotype among the European descent members of the E-Risk cohort (N = 1999 participants in 1011 families). Note that the analysis sample for the present study included only ethnically White participants because PRS currently has uncertain validity in non-European descent populations (Martin et al., 2017;Vassos et al., 2017).
Polygenic risk scores PRS were created for European descent E-Risk participants following the method described by Dudbridge (2013) (Pardiñas et al., 2018) were matched with SNPs in the E-Risk cohort, irrespective of nominal significance with schizophrenia. We then performed clumping by retaining the SNP with the smallest p value from each LD block (excluding SNPs with r 2 > 0.1 in 500-kb windows), then weighted retained SNPs by effect estimate. To control for possible population stratification, we conducted a principal component analysis of our genome-wide SNP database using PLINK (Version 1.9; Chang et al., 2015). One twin was selected at random from each family for principal component analysis. SNP loadings for principal components were applied to co-twin genetic data to compute principal component values for the full sample. The 10 principal components explained 2.8% of the variance in the schizophrenia PRS. We residualized polygenic scores for the first 10 principal components estimated from the genome-wide SNP data. The residualized scores were normally distributed. We standardized residuals for analysis (M = 0, S.D. = 1, range = −3.75-3.53). For depression PRS, SNPs reported in the latest GWAS for major depression (Wray et al., 2018) were matched with SNPs in the E-Risk cohort, irrespective of nominal significance with depression. Clumping and principal components analysis were conducted as above. Residualized scores were normally distributed, and these were once again standardized for analysis (M = 0, S.D. = 1, range = −3.31-3.39).

Maternal psychotic symptoms and family psychiatric history
Maternal psychotic symptoms and family history (Table 1) were assessed in private interviews with the mothers when children were aged 12. For maternal psychotic symptoms, mothers were interviewed using the Diagnostic Interview Schedule (DIS; Robins, Cottler, Bucholz, and Compton, 1995) for DSM-IV (American Psychiatric Association, 1994) which provides a symptom count for characteristic symptoms of schizophrenia (e.g. hallucinations, delusions, anhedonia). For family psychiatric history, mothers reported on their own mental health history and the mental health history of their biological mothers, fathers, sisters, brothers, and the twins' biological father (Weissman et al., 2000). This was converted into the proportion of family members with a history of psychiatric disorder (Milne et al., 2008).

Latent genetic risk scores
Latent genetic risk scores were derived based on each participant's zygosity and whether their co-twin had psychotic experiences. As described by Arseneault et al. (2011), each twin is represented in the data twice, first as the target and then as the co-twin. A continuum of genetic risk is generated, on the basis that MZ twins are ∼100% genetically identical, whereas DZ twins share ∼50% of the same genes. Thus, we assigned a score of 0 to participants if their MZ co-twin did not have psychotic experiences; a score of 1 to participants if their DZ co-twin did not have psychotic experiences; a score of 2 to participants if their DZ twin did have psychotic experiences; and a score of 3 to participants if their MZ twin did have psychotic experiences.

Adolescent psychotic experiences
At age 18, each E-Risk participant was privately interviewed by a research worker about 13 psychotic experiences occurring since age 12. Seven items referred to delusions and hallucinations (Polanczyk et al., 2010), such as 'have you ever thought you were being watched, followed or spied on?' and 'do you hear voices that others cannot?'. Six items referred to unusual experiences which drew on item pools since formalized in prodromal psychosis instruments including the PRIME-screen and SIPS (Loewy, Pearson, Vinogradov, Bearden, & Cannon, 2011), such as 'I believe I have special abilities or powers beyond my natural talents'. Further information on this measure is provided in Supplementary Materials. Research workers coded each item 0, 1, or 2 indicating respectively: 'not present', 'probably present' and 'definitely present'. All 13 items were summed (M = 1.19, S.D. = 2.58, range = 0-18), and scores were placed into an ordinal scale, corresponding to the number of different psychotic experiences rather than frequency of occurrence. Just over 30% of participants had at least one psychotic experience between ages 12 and 18: 69.8% reported no psychotic experiences (coded 0; n = 1440), 15.5% reported one or two psychotic experiences (coded 1; n = 319), 8.1% reported three-five psychotic experiences (coded 2; n = 166), and 6.7% reported six or more psychotic experiences (coded 3; n = 138). This is similar to the prevalence of psychotic experiences self-reported in other general population non-twin samples (Spauwen, Krabbendam, Lieb, Wittchen, & van Os, 2004;Yoshizumi, Murase, Honjo, Kaneko, & Murakami, 2004;Yung et al., 2009).

Socioenvironmental risks
Socioenvironmental risk variables are described below, in Fig. 1, and in the Supplementary Materials. Some variables were on ordinal scales and some were continuous. All continuous variables were placed into ordinal scales to allow consistent analyses and comparison between variables. Categorizations were based on official classifications and/or precedent. Urbanicity, neighborhood deprivation, and air pollution were available at several timepoints. Therefore, we included data from multiple time-points across childhood and adolescence to explore temporal changes in associations. Urbanicity was derived from classifications from 2011 census data (Office for National Statistics, 2013) and linked to participants' home postcodes at ages 5, 12, and 18. Classifications incorporate data on residential density, settlement size and population sparsity. Resolution is at the Lower Layer Super Output Area (LSOA) level, which contain an average of 1500 residents. A three-level urbanicity variable was used representing rural, intermediate and urban settings (Table 1). E-Risk participants are nationally-representative in terms of ONS urbanicity classifications; for instance, the nationwide distribution of rural, intermediate, and urban residents in the UK is 18.9%, 45.0%, and 36.1%, respectively.
Air pollution exposure was measured using a coupled regional chemical transport model and street-scale dispersion model (Beevers, Kitwiroon, Williams, & Carslaw, 2012;Carslaw, 2011). Based on our previous findings in this cohort (Newbury et al., 2019), the present study focusses on nitrogen dioxide (NO 2 ) exposure. Annualized estimates of ambient NO 2 were linked to participants' home postcodes at ages 10 and 18. The model achieves 20 × 20 meter resolution. For this study, annualized NO 2 exposure was categorized into quartiles (1 = least polluted; 4 = most polluted).
Neighborhood deprivation was constructed using A Classification of Residential Neighborhoods (ACORN), a geodemographic discriminator developed by CACI Information Services (CACI Information Services; http://www.caci.co.uk/). Detailed information about ACORN's classification of neighborhood-level socioeconomic-status (SES) has been provided previously (Caspi, Taylor, Moffitt, & Plomin, 2000;Odgers et al., 2009Odgers et al., , 2012b. Briefly, CACI utilized over 400 census variables for Great Britain (e.g. educational qualifications, unemployment, housing tenure) and CACI's consumer lifestyle database. Classifications ranged from 'Wealthy Achiever' (coded 1) to 'Hard Pressed' (coded 5) neighborhoods (Table 1). Each family in our sample was matched to the ACORN code for its neighborhood via postcodes at ages 5, 12, and 18 .
Neighborhood crime was measured using police data for England and Wales for 2011 and linked to participants' home postcodes at age 18. The total number of crimes occurring each month within a one-mile radius of each participant's home was tallied (Table 1). Scores were placed into quartiles for analysis (1 = lowest crime; 4 = highest crime).
Neighborhood disorder and social cohesion were measured via resident reports sent to >5000 immediate neighbors of participants in 2008 (Odgers et al., 2009;Odgers, Caspi, Bates, Sampson, & Moffitt, 2012a). Items enquired about physical and social signs of threat in the neighborhood (neighborhood disorder) and the quality and quantity of social interactions between neighbors (social cohesion). Items were averaged across each neighborhood characteristic and across respondents from the same neighborhood, and linked to home postcodes at age 12 (Table 1). Scores were grouped into tertiles for analysis at the 33 rd and 66 th centile (1 = lowest neighborhood disorder/highest social cohesion; 3 = highest neighborhood disorder/lowest social cohesion).
Family poverty (SES) was measured as a composite of parental income, education, and occupation via interviews with mothers when children were aged 5. The latent variable was categorized into tertiles (i.e. low-, medium-, and high-SES; Trzesniewski, Moffitt, Caspi, Taylor, and Maughan, 2006; Table 1). The present study uses a reverse-scored measure from 1 (high-SES) to 3 (low-SES).

Cumulative environmental risk scale
To compare participants with the most v. least disadvantaged upbringings we created a cumulative environmental risk scale by summing the top scores of each of the socioenvironmental variables. For analyses with the genetic risk indices, the variable included data up to age 18. For analyses with psychotic experiences, the variable included data only up to age 12 to ensure temporal ordering of exposures and outcome. For risk factors measured at multiple phases (urbanicity, deprivation, NO 2 ) we averaged scores across phases prior to constructing the cumulative scale. Percentages of participants exposed to 0, 1, 2, 3, 4 or 5 or more risks up to age 12 were: 26.5%, 20.8%, 16.7%, 15.2%, 10.9% and 10.0%; and up to age 18 were; 22.5%, 21.0%, 15.8%, 13.7%, 11.4%, and 15.6%, respectively (scores of 5-8 risks were collapsed due to small numbers within these categories).

Statistical analyses
We conducted the analyses using Stata (v16.0) following three steps. First, we used ordinal logistic regression to examine associations of the genetic risk indices with adolescent psychotic experiences. Second, we used ordinal logistic regression to examine associations of the genetic risk indices with the socioenvironmental risk variables, with the genetic risk indices treated as the independent variables. For step 2, one twin per each MZ twin pair was randomly dropped, because MZ twins correlate ∼100% for their genetic sequence and typically share the same neighborhood environments during childhood. Alongside steps 1 and 2, we plotted genetic risk Z-scores across psychotic experience scores and the cumulative environmental risk scale. Third, we used ordinal logistic regression to examine the associations of socioenvironmental risks from birth to age 12 with adolescent psychotic experiences and added the genetic risk indices as covariates to examine attenuation of effects. All genetic risk indices were standardized for analysis (i.e. M = 0, S.D. = 1) to allow comparison between measures. Steps 1-3 accounted for the nonindependence of twin observations using the cluster command in Stata. This procedure is derived from the Huber-White variance estimator and provides robust standard errors adjusted for within-cluster correlated data (Rogers, 1994). Analyses were conducted following multiple imputation using chained equations to handle missing covariate data (described further in the Supplementary Materials). Multiple imputations were only conducted for the 1999 White, genotyped participants because schizophrenia PRS currently has uncertain validity in non-European descent populations (Martin et al., 2017;Vassos et al., 2017). Complete case analyses are reported in the Supplementary Materials.

Results
Is genetic risk associated with adolescent psychotic experiences?
Is genetic risk associated with socioenvironmental risk for schizophrenia during upbringing? Figure 2b plots genetic risk Z-scores across the cumulative environmental risk scale and shows that participants with higher genetic risk tended to experience more socioenvironmental risks for schizophrenia during upbringing. Table 2 displays odds ratios for these associations. Schizophrenia PRS was associated at p < 0.05 with the cumulative environmental risk scale, age-18 urbanicity, neighborhood crime, neighborhood disorder, residential mobility, and family poverty (all OR's >1.12). Depression PRS was associated at p < 0.05 with the cumulative environmental risk scale, age-12 and age-18 neighborhood deprivation, neighborhood crime, and neighborhood social cohesion (all ORs >1.11). Maternal psychotic symptoms were associated at p < 0.05 only with residential mobility and family poverty (all OR's >1.18). Family psychiatric history was associated at p < 0.05 with the cumulative environmental risk scale, age-12 and age-18 neighborhood deprivation, neighborhood disorder, residential mobility, and family poverty (all OR's >1.15). Finally, latent genetic risk was associated at p < 0.05 with the cumulative environmental risk scale, age-5 and age-18 neighborhood deprivation, neighborhood crime, residential mobility, and family poverty (all ORs >1.12).
Are associations between socioenvironmental risks and adolescent psychotic experiences attenuated by covariate adjustment for genetic risk?
Associations between socioenvironmental risks and adolescent psychotic experiences are reported in Table 3 and online Supplementary Table 1. Children exposed to the highest levels of each risk factor had greater odds for psychotic experiences compared to children with the lowest exposure to these risks (all unadjusted ORs = 1.43-2.98; all ps < 0.05). Effect sizes were slightly reduced following covariate adjustment for schizophrenia PRS (2-9% reduction), depression PRS (2-12% reduction) and maternal psychotic symptoms (0-12% reduction), and were often reduced more substantially following covariate adjustment for family history (0-45% reduction) and latent genetic risk (8-29% reduction). Effect sizes were reduced by 15% (urbanicity) to 75% (residential mobility) after simultaneous adjustment for all genetic risk indices together. Nevertheless, associations mostly remained significant.

Complete case analyses
Complete case analyses revealed similar point estimates (online Supplementary Tables 2 and 3).

Discussion
In a cohort of British youth followed from birth to age 18, participants at higher genetic risk were exposed to more socioenvironmental risk factors for schizophrenia during upbringing. Associations in terms of elevated odds were apparent for almost all risk factors examined. Beyond urbanicity, our study incorporated detailed neighborhood-and family-level risks measured from multiple sources. Our sample also captures the full range of socioeconomic and neighborhood conditions found in the UK, and we were thus able to contrast high-and low-risk groups. In doing so, we found that youth with higher genetic risk tended to grow up in more deprived neighborhoods with higher crime and disorder, and within poorer families who moved house more frequently. In fact, gene-environment correlations were more consistent for neighborhood-and family-level risks compared to the area-level risks. This could suggest that gene-environment correlations at the area-level comprise more pronounced geneenvironment correlations at the neighborhood-and family-level. It could also be that larger sample sizes than ours are required to powerfully detect gene-environment correlations at the area-level. Therefore, future research into geosocial patterning of genetic risk requires both precise measures of the environment and representative samples that include high-risk groups.
There was tentative evidence that gene-environment correlations increased across development. For instance, schizophrenia PRS was associated with urbanicity at age 18, but not at ages 5 or 12; and depression PRS and family history were associated with deprivation at ages 12 and 18, but not at age 5. Our findings are consistent with those from a previous report in which schizophrenia PRS was associated with urbanicity at age 15 but not urbanicity at birth (Paksarian et al., 2018). These findings provide support for a process of intra-generational drift, in which gene-environment correlations increase over time due to selection into different geographical and social conditions. A process of intra-generational drift is also supported by our novel finding that genetic risk is associated with more residential mobility during upbringing. Continued follow-up of the E-Risk cohort will allow us to test for downward drift further into adulthood. However, effect size differences across time-points were small and should be interpreted with caution. Furthermore, given that several risk factors were measured early in development, our findings cannot entirely be explained by intra-generational drift. Instead, our findings also support an inter-generational process in which families at higher genetic risk drift downward in social mobility across generations into more disadvantaged environments. That is, participants who inherited higher genetic risk also tended to be born and raised in riskier environments.
Following covariate adjustment for the genetic risk indices, effect sizes were often attenuated more substantially by family history and latent genetic risk v. PRS. This likely reflects, in part, that family-and twin-based measures capture genetic risk more comprehensively at present compared to PRS. This could also be because family history captures unmeasured environmental risks shared between family members which contributed to the expression of mental illness in the family. Nevertheless, the association between socioenvironmental risk factors and psychotic experiences existed above and beyond five genetic risk indices in our study. Though residual confounding is inevitable, our findings lend support for an important role of socioenvironmental conditions during upbringing in early expressions of psychosis.

Limitations
We acknowledge several limitations. First, PRS for schizophrenia currently capture ∼30% of the SNP-based heritability for schizophrenia (Ripke et al., 2014), despite behavioral genetics studies demonstrating that >70% of schizophrenia variance is due to genetic factors (Lichtenstein et al., 2009). Indeed, schizophrenia PRS was only weakly associated with adolescent psychotic experiences in our study. The ability of PRS to capture genetic liability and predict phenotypic variance is expected to increase as the size of GWAS samples increases (Visscher et al., 2017). Second, family psychiatric history (and by design, maternal psychotic symptoms) was biased towards maternal family history (though mothers also reported on the twins' father's psychiatric history). In large cohort studies with finite resources it is common to focus on maternal reports, partly because mothers typically retain main custody in instances of parental separation (Braver, Ellman, Votruba, & Fabricius, 2011). Third, it was necessary to restrict all analyses (MICE and complete case) to European descent participants because PRS are currently suitable only within ancestries matched to the original GWAS (Martin et al., 2017;Vassos et al., 2017). Fourth, adolescent psychotic experiences were self-reported and may have included false positives. In addition, the measure covered only positive psychotic symptoms. Negative and disorganized symptoms are potentially more clinically-pertinent (Ho, Nopoulos, Flaum, Arndt, & Andreasen, 1998), but are also, unfortunately, harder to assess (Stahl & Buckley, 2007). Finally, the E-Risk sample comprises twins, who could differ from singletons in terms of both measured and unmeasured factors. However, E-Risk families are representative of the UK population in terms of urbanicity (Office for National Statistics, 2013) and  Government, 2015). The prevalence of psychotic experiences in our sample was also comparable to those from general population studies of non-twins (Spauwen et al., 2004;Yoshizumi et al., 2004;Yung et al., 2009).

Conclusion
Urbanicity, deprivation, neighborhood crime and disorder, residential mobility and poverty have been consistently associated with both clinical and subclinical expressions of psychosis. Our findings have implications for interpreting the causality of these associations. It is very unlikely that associations are entirely confounded by genetics. Indeed, associations between socioenvironmental risk factors and adolescent psychotic experiences existed above and beyond five indices of genetic risk in our study. However, our findings add to growing evidence that a degree of genetic confounding exists. Future investigations into the environment and psychosis should continue to consider gene-environment correlation as a non-causal contributor to associations. Understanding the processes leading to intra-and intergenerational drift could also open up valuable new avenues for improving psychiatric interventions and reducing socioenvironmental inequalities.
Supplementary material. The supplementary material for this article can be found at https://doi.org/10.1017/S0033291720003347.