Association of etiological factors across the extreme end and continuous variation in disordered eating in female Swedish twins

Background Accumulating evidence suggests that many psychiatric disorders etiologically represent the extreme end of dimensionally distributed features rather than distinct entities. The extent to which this applies to eating disorders (EDs) is unknown. Methods We investigated if there is similar etiology in (a) the continuous distribution of the Eating Disorder Inventory-2 (EDI-2), (b) the extremes of EDI-2 score, and (c) registered ED diagnoses, in 1481 female twin pairs at age 18 years (born 1992–1999). EDI-2 scores were self-reported at age 18. ED diagnoses were identified through the Swedish National Patient Register, parent-reported treatment and/or self-reported purging behavior of a frequency and duration consistent with DSM-IV criteria. We differentiated between anorexia nervosa (AN) and other EDs. Results The heritability of the EDI-2 score was 0.65 (95% CI 0.61–0.68). The group heritabilities in DeFries–Fulker extremes analyses were consistent over different percentile-based extreme groups [0.59 (95% CI 0.37–0.81) to 0.65 (95% CI 0.55–0.75)]. Similarly, the heritabilities in liability threshold models were consistent over different levels of severity. In joint categorical-continuous models, the twin-based genetic correlation was 0.52 (95% CI 0.39–0.65) between EDI-2 score and diagnoses of other EDs, and 0.26 (95% CI 0.08–0.42) between EDI-2 score and diagnoses of AN. The non-shared environmental correlations were 0.52 (95% CI 0.32–0.70) and 0.60 (95% CI 0.38–0.79), respectively. Conclusions Our findings suggest that some EDs can partly be conceptualized as the extreme manifestation of continuously distributed ED features. AN, however, might be more distinctly genetically demarcated from ED features in the general population than other EDs.

Traditionally, psychiatric disorders have been conceptualized as discrete entities with a clear demarcation between affected and unaffected individuals. However, accumulating evidence suggests that many psychiatric disorders represent the extreme end of dimensionally distributed traits with no clear phenotypic or etiological distinction between affected and unaffected individuals (Martin, Taylor, & Lichtenstein, 2018). This has been demonstrated for a range of disorders, including autism spectrum disorder (Colvert et al., 2015;Lundstrom et al., 2012;Robinson et al., 2016), attention-deficit/hyperactivity disorder (ADHD; Demontis et al., 2019;Greven et al., 2016;, depression (Direk et al., 2017;Eley, 1997), anxiety disorders (Taylor et al., 2019), and schizophrenia (Zavos et al., 2014). The dimensional susceptibility is believed to be due to an abundance of common genetic variants, with small cumulative effects (Cross-Disorder Group of the Psychiatric Genomics Consortium, 2013; Plomin, Haworth, & Davis, 2009;Sullivan, Daly, & O'Donovan, 2012).
It has so far only been investigated on a phenotypic level, whether EDs are best conceptualized as discrete entities or as the extreme end of variation in continuously distributed cognitions and behaviors characteristic of disordered eating (e.g. body dissatisfaction, dieting, binge-eating, compensatory behaviors; hereafter called ED features). Taxometric approaches have yielded conflicting results, supporting both categorical and dimensional conceptualizations of EDs (for a review see Keel, Brown, Holland, & Bodell, 2012). Studies using the (newer) mixture modeling approach, which directly compares the fit of categorical, dimensional, and hybrid models, also provided inconsistent results: while Keel, Crosby, Hildebrandt, Haedt-Matt, and Gravener (2013) suggested a hybrid model structure for BN, Luo, Donnellan, Burt, and Klump (2016) found evidence for a dimensional model for EDs.
An additional approach is to investigate on an etiological level whether EDs can be conceptualized as a continuum. Both ED diagnoses and ED features are moderately to highly heritable (Baker et al., 2009;Wade et al., 1999;Yilmaz, Hardaway, & Bulik, 2015). Similar heritability estimates for ED diagnoses and ED features do not, however, necessarily imply that the same genetic variants associated with ED diagnoses are also associated with dimensionally distributed ED features (which would be expected under a dimensional model), since different genetic variants could be mainly responsible for the respective heritability. Furthermore, genome-wide association studies suggest a polygenic architecture of AN (Watson et al., 2019), similarin principleto most other psychiatric disorders (Sullivan et al., 2012), and potentially supporting a dimensional hypothesis for EDs.
The aim of the present study was to investigate, in a large-scale population-based twin sample, if EDs can be viewed etiologically as the extreme end of continuous variation in ED features, rather than as distinct entities. First, we investigated whether the etiology in the entire distribution of ED features is similar to the etiology in different extreme groups of ED features. Second, we estimated the genetic correlation between ED features and ED diagnoses. Additionally, we estimated this genetic correlation separately for diagnoses of AN and EDs other than AN, as AN has been proposed to differ from other EDs with respect to prevalence (Keel & Klump, 2003;Keski-Rahkonen & Mustelin, 2016), heritability (Bulik et al., 2010;Dellava, Thornton, Lichtenstein, Pedersen, & Bulik, 2011), as well as comorbidity (Hudson et al., 2007).

Participants
Participants were part of the Child and Adolescent Twin Study in Sweden (CATSS), an ongoing longitudinal twin study (Anckarsäter et al., 2011). Parents of twins born in Sweden from 1992 onwards are invited to participate in connection with their twins' 9th birthday (CATSS-9, earlier cohorts included 12-year-olds; answering frequency = 76%). When the twins are aged 18, families are contacted again, irrespective of participation at age 9, and asked to fill out a web-based questionnaire (CATSS-18, answering frequency = 59%). The current study included the birth cohorts 1992-1999. The mean age at CATSS-18 was 18.4 years (S.D. = 0.3, range = 17.9-19.5). Zygosity was either ascertained using a panel of 48 single nucleotide polymorphisms, or an algorithm of five questions regarding twin similarity. CATSS has ethical approval from the Regional Ethical Review Board in Stockholm.
We excluded individuals with congenital or early brain damage syndromes, chromosomal syndromes, unknown zygosity, more than 25% missing data on any of the subscales of the ED feature measurement, and pairs where only one twin in a pair had responded (Fig. 1). We also excluded all males, due to the low number of males identified with a clinical ED (n = 20) 1 . The final sample consisted of 1481 female pairs [768 monozygotic (MZ) & dizygotic (DZ) same-sex 713 pairs].

ED features (continuous measurement)
In CATSS-18, ED features were self-reported on three subscales of the Eating Disorder Inventory-2 (EDI-2; Garner, 1991): Drive for Thinness (seven items), Bulimia (seven items) and Body Dissatisfaction (nine items). Drive for Thinness measures excessive concern with dieting, preoccupation with weight, and fear of weight gain; Bulimia measures the tendency to engage in binge eating and to think about purging; and Body Dissatisfaction measures dissatisfaction with one's overall body shape and the size of specific parts of the body. The response options are 'never' (1), 'rarely' (2), 'sometimes' (3), 'often' (4), 'usually' (5), and 'always' (6). One item on the Body Dissatisfaction scale ('I like the shape of my buttocks') was not included in the EDI-2 during the first 6 years of CATSS-18 and therefore only available for 26.2% of the sample; however, the reversed item 'I think my buttocks are too large' was included. A total EDI-2 score (hereafter EDI-2 score) was computed as the mean of all items on the three subscales when participants had responded to at least 75% of items on each subscale (i.e. at least six out of seven items on Drive for Thinness and Bulimia, and at least seven out of nine items on Body Dissatisfaction). The EDI-2 has been validated in adult Scandinavian females and in non-Swedish adolescents (McCarthy, Simmons, Smith, Tomlinson, & Hill, 2002;Salbach-Andrae et al., 2010). Cronbach's alpha for internal consistency was 0.93 in our sample.
ED diagnoses (categorical measurement) ED diagnoses were identified from three sources. First, we used lifetime diagnoses from the Swedish National Patient Register (NPR), which includes diagnoses from psychiatric inpatient care from 1973 onwards and from specialized outpatient care from 2001 onwards. NPR diagnoses are coded according to ICD-9 and ICD-10. The CATSS data used in this study were linked to the NPR up until 31 December 2016. As a consequence, the end of follow-up in the NPR in our sample was 17 years for the youngest birth cohort (born 1999) and 24 for the oldest birth cohort (born 1992). EDs in the NPR have not yet been subjected to formal reliability/validity testing; however high validity of the NPR has been reported for a range of other psychiatric disorders (Idring et al., 2012;Ludvigsson et al., 2011;Rück et al., 2015;Sellgren, Landen, Lichtenstein, Hultman, & Langstrom, 2011). The following ICD-10 diagnoses of interest were retrieved for all participants: F50.0 (AN), F50.1 (atypical AN), F50.2 (BN),

Psychological Medicine
F50.3 (atypical BN), and F50.9 [eating disorder not otherwise specified (EDNOS)]. The Swedish ICD-10 does not provide data on AN subtypes (restricting v. binge eating/purging type). It is important to note that the diagnosis of F50.1 (atypical AN) in the ICD system was typically given when the amenorrhea criterion in F50.0 was not present. This is different from the current DSM-5 classification of atypical AN (meets all criteria for AN except low weight). Individuals with DSM-5 atypical AN are not captured by AN/atypical AN in this analysis. We excluded individuals with the following codes: overeating associated with other psychological disturbances (F50.4), vomiting associated with other psychological disturbances (F50.5), other eating disorders (F50.8), feeding disorder of infancy and childhood (F98.2), and Pica of infancy and childhood (F98.3). In total, 93 out of 2962 individuals (3.1%) had an ED diagnosis of interest in the NPR (Table 1).
Second, we used parent-reports of treatment for ED to identify individuals with ED. In CATSS-18, the parents of twins were asked: 'Has the twin been treated for AN?' and 'Has the twin been treated for BN?'. If parents responded 'Yes, earlier' or 'Yes, now' to one or both of these questions, the twin was identified with having/having had an ED. Parent-reported treatment data were available for 72.5% of the sample. Forty-eight individuals were identified with ED; of those, 46 were treated for AN and eight were treated for BN (six individuals were treated for both AN and BN; Table 1). Twenty-six of the 46 individuals with parent-reported treatment for AN also had a diagnosis of AN in the NPR (56.5%). Only one of the eight individuals with parent-reported treatment for BN also had a diagnosis of BN in the NPR (12.5%).
Third, we used self-reports of purging behavior to identify individuals with ED (available for 99.9% of the sample). In CATSS-18, the twins were asked: 'Did you ever use vomiting, laxatives, diuretics, or enemas to lose weight or to control your weight?'. If they responded 'Yes, repeatedly over at least three months' or 'Yes, repeatedly over the last three months' they were identified as having/having had an ED (n = 96, 3.2%; Table 1). This procedure was deemed to be valid, since the answer options used for identification correspond closely to the DSM-IV criteria for AN binge-eating/purging type ('regularly engaged in binge-eating or purging behavior') and BN ('binge eating and inappropriate compensatory behaviors both occur, on average, at least twice a week for 3 months'), and individuals with repeated purging over at least 3 months would likely receive an ED diagnosis if assessed in a clinical context. When matched to the NPR, 33% of those with regular purging behavior also had an ED diagnosis in the NPR.
The total number of individuals identified with any ED by registered diagnoses, parent-reported treatment and self-reported purging was n = 169 (5.7%, Table 1). More than half (55.0%) of all individuals with ED were identified through the NPR. One third (33.7%) of individuals with ED were identified through more than one method. In a final step, we differentiated EDs into AN (including NPR diagnoses of AN and atypical AN, and parent-reported treatment for AN, n = 70) and other EDs (OEDs, including NPR diagnoses of BN, atypical BN, and EDNOS, as well as parent-reported treatment for BN and selfreported purging behavior, n = 99; Table 1). Individuals diagnosed with both AN and OEDs were included in the AN group. In order to investigate attrition, we compared NPR diagnoses of EDs between our sample and those females who only participated at baseline (age 9), but not at follow-up (age 18, n = 2251, ca. 43% of the baseline sample). The prevalence of EDs was very similar in responders and non-responders (3.1% and 3.3%, χ 2 = 0.15, p = 0.70), suggesting that attrition did not bias the representativeness of our sample.

The twin design
The twin design is based on comparing the phenotypic resemblance of MZ twins with that of DZ twins. While MZ twins share all of their segregating alleles, DZ twins share on average
50%. The twin model decomposes variance in a trait into additive genetic effects (A), indicated by higher correlations in MZ pairs than in DZ pairs; non-additive genetic effects (D), indicated when MZ correlations are more than twice as high as DZ correlations; shared environment (C), indicated by DZ correlations more than half as high as the MZ correlations; and non-shared environment (E), evidenced by differences within MZ pairs (and including measurement error). The principles of the twin design are described extensively elsewhere (Plomin, DeFries, Knopik, & Neiderhiser, 2013;Posthuma et al., 2003).
Full sample heritability of EDI-2 score We initially fitted a fully saturated model to the observed data, including means, variances, and covariances, to act as a baseline comparison model when comparing model fits. This model was then used to test the assumptions of the twin model. Assumption testing revealed that means and variances could be equated across twin order and zygosity (online Supplementary  Table S1). Twin correlations were estimated by zygosity from a constrained saturated model, in which the means and variances were equated within twin pairs and across zygosity. We then fitted a univariate model to estimate the degree of genetic and environmental influences on the EDI-2 score in the entire sample. The DZ twin correlation was slightly larger than half of the MZ twin correlation, therefore we fitted an ACE model. We tested the significance of individual parameters by constraining them to be equal zero. The best-fitting models were chosen based on the likelihood ratio test (if model fit did not deteriorate significantly, the reduced model was favored). In addition, we used the Bayesian information criterion (BIC; lower BIC values  (Derks, Dolan, & Boomsma, 2004;Markon & Krueger, 2004).

Extremes analyses of EDI-2 score
We used two different analytic techniques to investigate, whether the etiology of the EDI-2 score was consistent across the entire sample and among those showing extreme EDI-2 scores: DeFries-Fulker extremes analysis and liability threshold modeling. Extreme scorers (probands) were defined using percentilebased cut-offs on the EDI-2, that is, scoring within the 1st, 3rd, 5th, and 10th percentile of the EDI-2 (see the number of probands per percentile in Table 2), in order to maximize power while also capturing individuals with high symptom severity and significant impairment (Robinson et al., 2011). DeFries-Fulker extremes analysis assesses consistency in the etiology of a given trait across different severity levels (here the 1st, 3rd, 5th, and 10th percentile of the EDI-2 score) by modeling an individual's expected score as a function of their co-twin's proband status (DeFries & Fulker, 1985;Purcell & Sham, 2003). The twins' EDI-2 scores were transformed so that the population mean was zero and the proband mean was 1. The transformed co-twin means are interpreted in a similar manner to twin correlations. DeFries-Fulker analysis seeks to estimate group heritability (h 2 g ), which estimates the degree to which the genetic influences on extreme scores also influence continuous variation in the same trait. In the classical procedure, a regression equation is fitted to estimate h 2 g ; the equation predicts co-twin scores from the continuous scores of probands. Zygosity is also included as a predictor in the equation; the regression of zygosity on co-twin scores is an estimate of h 2 g . Here we used a model fitting implementation of the procedure (Purcell & Sham, 2003).
In a second step, liability threshold models were used to estimate the etiology of categorically defined extreme scores on the EDI-2. The liability threshold model is based on dichotomous data, but assumes an underlying continuous distribution of liability to the categorical construct. Probandwise concordance rates were calculated as 2 × (number of concordant pairs)/[2 × (number of concordant pairs) + number of discordant pairs], indicating the probability that a co-twin of a proband is also a proband (Table 2). Using liability threshold models, we estimated the proportion of variation in the liability to extreme ED features that was genetic and environmental at the 1st, 3rd, 5th, and 10th percentile of the EDI-2 score.
Joint categorical-continuous models of EDI-2 score and ED diagnoses We used joint categorical-continuous models to estimate the degree to which genetic influences on the EDI-2 score overlapped with genetic influences on AN and OED diagnoses. The joint categorical-continuous model is a hybridization of a liability threshold model (here for AN/OED diagnoses) with a model for a continuous variable (here for EDI-2 score). Initially, the correlations between EDI-2 score in one twin and ED diagnosis in their co-twin were estimated. If these cross-twin cross-trait correlations are greater in MZ than in DZ twins, genetic influence on the covariation of the EDI-2 score with AN/OED diagnoses is suggested. We then fitted a joint categorical-continuous model; this model allowed us to estimate the genetic, shared environmental, and non-shared environmental correlations between EDI-2 score and AN/OED diagnoses. The genetic correlation (r g ) estimates the degree to which genetic influences on one phenotype are shared with those on another phenotype. A genetic correlation of 1.0 (0.0) indicates that all (none) of the additive genetic influence on two phenotypes is shared between them. The genetic correlation, and the heritability of each trait, can be used to calculate bivariate heritability; this refers to the degree to which the genetic factors that overlap across two traits explain the correlation between them. Like the liability threshold model, the categorical component of this model assumes a normal distribution of continuous liability underlying ED diagnosis. In line with the univariate models, we fitted an ACE and its nested models.

Descriptive statistics
The mean EDI-2 score was 2.66 (S.D. = 0.90, range = 1-6). The distribution of EDI-2 scores was only slightly skewed (skew = 0.57), nevertheless we transformed the EDI-2 score with the natural logarithm in order to improve the accuracy of the results, as the interpretability of raw scores was not important for the purpose of this study. The prevalence of EDs by and across zygosity is presented in Table 1.

Full sample heritability of EDI-2 score
The twin correlation in the full sample was higher for MZ than for DZ pairs, suggesting genetic influences on the EDI-2 score (Table 3). According to the likelihood ratio test, the ACE model did not fit significantly worse than the saturated model The number of probands applies to both extremes analyses: DeFries-Fulker extremes analyses and liability threshold models. a In DeFries-Fulker extremes analysis, the scores are transformed so that the population mean is zero, and the proband mean is 1. Transformed co-twin means are interpreted in a similar manner to twin correlations.

Extremes analyses of EDI-2 score
AE models showed the best fit in all DeFries-Fulker extremes analyses (Table 4). The number of probands per percentile and the transformed co-twin means are shown in Table 2. The group heritability estimates in the DeFries-Fulker extremes analysis (0.59-0.65) were significant, consistent over different levels of severity (1st, 3rd, 5th, and 10th percentile of the EDI-2 score), and similar to the heritability estimates in the full sample, therefore indicating genetic continuity between the continuous distribution and the extremes (Fig. 2). For the liability threshold models, ACE models did not fit significantly worse than saturated models according to the likelihood ratio tests, and they also had lower BIC values. AE models showed the best fit (Table 4). The heritability estimates were consistent over different levels of severity (0.64-0.70; Fig. 2), indicating consistent etiology for different levels of ED feature severity. We did not conduct the liability threshold analysis for the 1st percentile due to low power.

Joint categorical-continuous models of EDI-2 score and ED diagnoses
The cross-twin cross-trait correlations between EDI-2 score and AN/OED diagnoses were more than twice as large in MZ twins compared to DZ twins (Table 3, see the number of concordant and discordant twin pairs in online Supplementary Table S2). This suggests not only additive genetic influence (A), but possibly also non-additive genetic influence (D) on the association between EDI-2 score and AN/OEDs. However, in the univariate model of the EDI-2 score we did not find the influence of D and therefore did not consider it plausible that there would be influence of D on the covariance of EDI-2 score and AN/OEDs. ACE models did not fit significantly worse than saturated models according to the likelihood ratio tests and also had lower BIC values. AE models showed the best fits (Table 4). Heritability was similar for EDI-2 score (h 2 = 0.65), AN (h 2 = 0.63), and OEDs (h 2 = 0.67). The phenotypic correlation was lower for EDI-2 score and AN (r PH = 0.39) than for EDI-2 score and OEDs (r PH = 0.52) (Fig. 3). Similarly, the genetic correlation was lower for EDI-2 score and AN (r A = 0.26) than for EDI-2 score and OEDs (r A = 0.52) (online Supplementary Table S3; Fig. 3). Genetic factors accounted for 43% of the correlation between EDI-2 score and AN, and for 66% of the correlation between EDI-2 score and OEDs.

Discussion
We tested whether EDs can be viewed etiologically as the extreme end of a continuous distribution of ED features rather than as distinct disorders. The etiology of ED features, as measured with the EDI-2, appeared to be consistent across differing severity levels. To a moderate degree, the genetic influences on OED diagnoses also influenced continuous ED features. However, this did not hold true for AN, where the genetic influences on the diagnosis influenced continuous ED features to a lesser extent.
The genetic correlation between ED features and OED diagnoses was moderate. This result implies that OEDs can be conceptualized as the extreme end of dimensionally distributed ED features, rather than as discrete entities. Our findings extend previous evidence from phenotypic studies that EDs are on a continuum (Holm-Denoma, Richey, & Joiner, 2010;Luo et al., 2016;Olatunji et al., 2012;Tylka & Subich, 2003) and they add to the growing body of literature confirming psychiatric disorders as etiologically congruent with dimensional measures of psychopathology (Martin et al., 2018).
For AN, however, the evidence was less strong, as the genetic correlation with ED features was lower than for OEDs. This suggests that the conceptualization of psychiatric disorders as the extreme manifestation of etiologically continuous features might not apply to AN. This result is in line with previous research showing that AN differs from OEDs in a variety of important ways. These include (a) a higher stability of prevalence of AN over time and cultures compared to OEDs (Keel & Klump, 2003;Keski-Rahkonen & Mustelin, 2016), (b) potential differences in risk factors (e.g. childhood adversity; Larsen et al., 2017), (c) different coexisting disorders (Hudson et al., 2007), and (d) the fact that heritability estimates of AN decrease as definitions get broadened (Bulik et al., 2010;Dellava et al., 2011), while this has not been found for BN (Bulik et al., 2010). Furthermore, recent genome-wide association studies suggest that AN may have both psychiatric and metabolic genetic components (e.g. high-density lipoprotein cholesterol, fasting insulin, fasting glucose; Watson et al., 2019). Dimensional measures of ED features may capture the psychiatric components of AN, while they do not cover metabolic factors, which may have lowered the genetic correlation. An important direction for future research will be to evaluate how much of the variance in AN can be attributed to metabolic factors. An alternative explanation for the discontinuity is that the measured dimensions are not the  Table 4. Model fit statistics of (a) the ACE and nested models for EDI-2 score in the full sample, (b) the DeFries-Fulker extremes analyses for EDI-2 score by threshold (percentile of the EDI-2 score), (c) the liability threshold models for EDI-2 score by threshold (percentile of the EDI-2 score), and (d) the joint categorical-continuous models for EDI-2 score with diagnoses of any ED, AN, and OED The non-shared environmental correlations between ED features and ED diagnoses were above 0.5. This is in contrast to other dimension-disorder relationships such as in autism spectrum disorder and ADHD, where the non-shared environmental correlations are lower (Taylor et al., 2019). In autism spectrum disorder and ADHD, factors such as maternal valproate use during pregnancy and paternal age (D'Onofrio et al., 2014;  Variance component estimates in the full sample, the DeFries-Fulker extremes analyses and the liability threshold models. Extreme groups (probands) were defined using percentile-based cut-offs on the Eating Disorder Inventory-2 (1st, 3rd, 5th, and 10th percentile). Due to low power, the 1st percentile was not used in the liability threshold models. The numbers in larger font size within bars indicate the estimates; the numbers in smaller font size below each estimate indicate the 95% confidence interval for this estimate. Error bars visualize the 95% confidence intervals for the additive genetic contribution. A, additive genetic contribution; E, non-shared environmental contribution.
Modabbernia, Velthorst, & Reichenberg, 2017) might have a stronger relationship with the diagnoses than with the dimensions. In EDs, on the other hand, the higher non-shared environmental correlations between dimensions and diagnoses might reflect psychosocial factors such as pressure for thinness/ weight-related teasing, peer dieting behavior, and media exposure to appearance ideals (Culbert, Racine, & Klump, 2015;Mazzeo & Bulik, 2009), which influence both dimensions and diagnoses.

Implications
Our findings have important implications for future genomic studies of EDs and for how to conceptualize different ED presentations. First, our results suggest that the knowledge derived from studies of ED features may well generalize to diagnosable OEDs, but not necessarily to AN. Second, our data suggest that using ED features in genome-wide association studies may be a viable approach in order to increase sample size and improve statistical power for identifying common genetic variants of OEDs, as has been done for ADHD traits and depressive symptoms in the general population (Demontis et al., 2019;Direk et al., 2017;Middeldorp et al., 2016;Stergiakouli et al., 2015). However, our results indicate that genomic studies of AN should rely on categorical diagnoses rather than dimensional measurements of ED features.

Strengths and limitations
Our study has several strengths. We had a large sample size and used a validated measure to assess ED features. Uniquely, the linkage of the data with the NPR enabled us to estimate genetic correlations between diagnosed EDs and ED features, including differentiating between AN and OEDs. Additionally, we used parent-and self-reports to compensate for a likely underestimation of ED diagnoses in the NPR. Our findings also have to be considered in the light of several limitations. First, our results apply to females only, as we were underpowered to conduct the analyses for males. Previous research found evidence for quantitative and qualitative sex differences in genetic and environmental influences on ED features, that is, the magnitude of genetic and environmental effects as well as the type of genetic factors involved differs between males and females (Baker et al., 2009). Future studies therefore need to investigate a possible etiological continuum of EDs in males. Second, we were not able to differentiate BN from OEDs, due to the young age of the sample and the later onset of BN compared to AN (Micali et al., 2013;Zerwas et al., 2015). Therefore, studies with longer follow-up time are needed to investigate whether BN can be considered a continuously distributed phenomenon. Third, we were unable to distinguish between AN of the restricting v. binge-eating/purging subtype, which precluded our ability to identify factors that would render individuals with AN binge-eating/purging subtype more similar to individuals with BN than those with AN restricting subtype. Related, individuals with self-reported purging and without any NPR diagnosis were collapsed into the OED group; however, a group of these individuals could indeed have had AN of the binge-eating/purging subtype. The effect of this possible misclassification, given the obtained results, would have been an underestimation of the genetic correlation between features and diagnoses of OEDs. Finally, although the overall sample was large, the absolute number of registered ED diagnoses, especially BN, were low. However, we tried to compensate for this by using different methodological approaches as well as including parent-and self-reports in order identify additional cases.

Conclusion
Our data suggest that a moderate proportion of genetic risks associated with OEDs are also associated with continuous variation in ED features, implying that OEDs can be considered the extreme manifestation of etiologically continuous ED features. Molecular genetic studies of OEDs could benefit from complementing the study of categorically defined OEDs with dimensional ED features, in order to increase statistical power to detect genetic variants. However, the evidence was less strong for AN, suggesting that AN might be more genetically distinct from ED features in the general population compared to OEDs. . Variance components, correlations, and bivariate estimates from the joint categorical-continuous models. The numbers in larger font size within bars indicate the estimates; the numbers in smaller font size below each estimate indicate the 95% confidence interval for this estimate. Error bars visualize the 95% confidence intervals for the genetic and the non-shared environmental correlations. A, additive genetic contribution; E, non-shared environmental contribution; EDI-2, Eating Disorder Inventory-2 score; AN, anorexia nervosa; OED, other eating disorder.