We use cookies to distinguish you from other users and to provide you with a better experience on our websites. Close this message to accept cookies or find out how to manage your cookie settings.
To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
This chapter introduces a selection of methods applicable for identifying and extracting paradata from existing datasets and data documentation which can then be used to complement existing formal documentation of practices and processes. Data reuse, in its multiple forms, enables researchers to build upon the foundations laid by previous studies. Retrospective methods for eliciting paradata, including qualitative and quantitative backtracking and data forensics, provide means to get insights into past research practices and processes for data-driven analysis. The methods discussed in this chapter enhance understanding of data-related practices and processes, reproducibility of findings by facilitating the replication and verification of results through data reuse. Key references and further reading are provided after each method description.
This chapter introduces methods for generating and documenting paradata before and during data creation practices and processes (i.e. prospective and in-situ approaches, respectively). It introduces formal metadata-based paradata documentation using standards and controlled vocabularies to contribute to paradata consistency and interoperability. Narrative descriptions and recordings are advantageous for providing contextual richness and detailed documentation of data generation processes. Logging methods, including log files and blockchain technology, allow for automatic paradata generation and for maintaining the integrity of the record. Data management plans and registered reports are examples of measures to prospectively generate potential paradata on forthcoming activities. Finally, facilitative workflow-based approaches are introduced for step-by-step modelling of practices and processes. Rather than suggesting that a single approach to generating and documenting paradata will suffice, we encourage users to consider a selective combination of approaches, facilitated by adequate institutional resources, technical and subject expertise, to enhance the understanding, transparency, reproducibility and credibility of paradata describing practices and processes.
To make sense of data and use it effectively, it is essential to know where it comes from and how it has been processed and used. This is the domain of paradata, an emerging interdisciplinary field with wide applications. As digital data rapidly accumulates in repositories worldwide, this comprehensive introductory book, the first of its kind, shows how to make that data accessible and reusable. In addition to covering basic concepts of paradata, the book supports practice with coverage of methods for generating, documenting, identifying and managing paradata, including formal metadata, narrative descriptions and qualitative and quantitative backtracking. The book also develops a unifying reference model to help readers contextualise the role of paradata within a wider system of knowledge, practices and processes, and provides a vision for the future of the field. This guide to general principles and practice is ideal for researchers, students and data managers. This title is also available as open access on Cambridge Core.
Hidden Markov models (HMMs) are popular for modeling complex, longitudinal data. Existing identifiability theory for conventional HMMs assume emission probabilities are constant over time and the Markov chain governing transitions among the hidden states is irreducible, which are assumptions that may not be applicable in all educational and psychological research settings. We generalize existing conditions on homogeneous HMMs by considering heterogeneous HMMs with time-varying emission probabilities and the potential for absorbing states. Researchers are investigating a family of models known as restricted HMMs (RHMMs), which combine HMMs and restricted latent class models (RLCMs) to provide fine-grained classification of educationally and psychologically relevant attribute profiles over time. These RHMMs leverage the benefits of RLCMs and HMMs to understand changes in attribute profiles within longitudinal designs. The identifiability of RHMM parameters is a critical issue for ensuring successful applications and accurate statistical inference regarding factors that impact outcomes in intervention studies. We establish identifiability conditions for RHMMs. The new identifiability conditions for heterogeneous HMMs and RHMMs provide researchers insights for designing interventions. We discuss different types of assessment designs and the implications for practice. We present an application of a heterogeneous HMM to daily measures of positive and negative affect.
Femoral neck bone mineral density (FNBMD) is a high risk factor for femoral head fractures, and coffee intake affects bone mineral density, but the effect on FNBMD remains to be explored. First, we conducted an observational study in the National Health and Nutrition Examination Survey and collected data on coffee intake, FNBMD, and sixteen covariates. Weight linear regression was used to explore the association of coffee intake with FNBMD. Then, Mendelian randomisation (MR) was used to explore the causal relationship between coffee intake and FNBMD, the exposure factor was coffee intake, and the outcome factor was FNBMD. The inverse variance weighting (IVW) method was used for the analysis, while heterogeneity tests, sensitivity, and pleiotropy analysis were performed. A total of 5 915 people were included in the cross-sectional study, including 3 178 men and 2 737 women. In the completely adjusted model, no coffee was used as a reference. The ORs for the overall population at ‘< 1’, ‘1–<2’, ‘2–<4’, and ‘4+’ (95% CI) were 0.02 (–0.01, 0.04), 0.00 (–0.01, 0.02), –0.01 (–0.02, 0.00), and 0.00 (–0.01, 0.02), respectively. The male and female population showed no statistically significant differences in both univariate and multivariate linear regressions. In the MR study, the IVW results showed an OR (95% CI) of 1.06 (0.88–1.27), a P-value of 0.55, and an overall F-value of 80.31. The heterogeneity, sensitivity analyses, and pleiotropy had no statistical significance. Our study used cross-sectional studies and MR to demonstrate that there is no correlation or causal relationship between coffee intake and FNBMD.
Ultra-thin liquid sheets generated by impinging two liquid jets are crucial high-repetition-rate targets for laser ion acceleration and ultra-fast physics, and serve widely as barrier-free samples for structural biochemistry. The impact of liquid viscosity on sheet thickness should be comprehended fully to exploit its potential. Here, we demonstrate experimentally that viscosity significantly influences thickness distribution, while surface tension primarily governs shape. We propose a thickness model based on momentum exchange and mass transport within the radial flow, which agrees well with the experiments. These results provide deeper insights into the behaviour of liquid sheets and enable accurate thickness control for various applications, including atomization nozzles and laser-driven particle sources.
Depression has been linked to disruptions in resting-state networks (RSNs). However, inconsistent findings on RSN disruptions, with variations in reported connectivity within and between RSNs, complicate the understanding of the neurobiological mechanisms underlying depression.
Methods
A systematic literature search of PubMed and Web of Science identified studies that employed resting-state functional magnetic resonance imaging (fMRI) to explore RSN changes in depression. Studies using seed-based functional connectivity analysis or independent component analysis were included, and coordinate-based meta-analyses were performed to evaluate alterations in RSN connectivity both within and between networks.
Results
A total of 58 studies were included, comprising 2321 patients with depression and 2197 healthy controls. The meta-analysis revealed significant alterations in RSN connectivity, both within and between networks, in patients with depression compared with healthy controls. Specifically, within-network changes included both increased and decreased connectivity in the default mode network (DMN) and increased connectivity in the frontoparietal network (FPN). Between-network findings showed increased DMN–FPN and limbic network (LN)–DMN connectivity, decreased DMN–somatomotor network and LN–FPN connectivity, and varied ventral attention network (VAN)–dorsal attentional network (DAN) connectivity. Additionally, a positive correlation was found between illness duration and increased connectivity between the VAN and DAN.
Conclusions
These findings not only provide a comprehensive characterization of RSN disruptions in depression but also enhance our understanding of the neurobiological mechanisms underlying depression.
Psychostimulants and nonstimulants have partially overlapping pharmacological targets on attention-deficit/hyperactivity disorder (ADHD), but whether their neuroimaging underpinnings differ is elusive. We aimed to identify overlapping and medication-specific brain functional mechanisms of psychostimulants and nonstimulants on ADHD.
Methods
After a systematic literature search and database construction, the imputed maps of separate and pooled neuropharmacological mechanisms were meta-analyzed by Seed-based d Mapping toolbox, followed by large-scale network analysis to uncover potential coactivation patterns and meta-regression analysis to examine the modulatory effects of age and sex.
Results
Twenty-eight whole-brain task-based functional MRI studies (396 cases in the medication group and 459 cases in the control group) were included. Possible normalization effects of stimulant and nonstimulant administration converged on increased activation patterns of the left supplementary motor area (Z = 1.21, p < 0.0001, central executive network). Stimulants, relative to nonstimulants, increased brain activations in the left amygdala (Z = 1.30, p = 0.0006), middle cingulate gyrus (Z = 1.22, p = 0.0008), and superior frontal gyrus (Z = 1.27, p = 0.0006), which are within the ventral attention network. Neurodevelopmental trajectories emerged in activation patterns of the right supplementary motor area and left amygdala, with the left amygdala also presenting a sex-related difference.
Conclusions
Convergence in the left supplementary motor area may delineate novel therapeutic targets for effective interventions, and distinct neural substrates could account for different therapeutic responses to stimulants and nonstimulants.
Let $\overline {M}(a,c,n)$ denote the number of overpartitions of n with first residual crank congruent to a modulo c with $c\geq 3$ being odd and $0\leq a<c$. The central objective of this paper is twofold: firstly, to establish an asymptotic formula for the crank of overpartitions; and secondly, to establish several inequalities concerning $\overline {M}(a,c,n)$ that encompasses crank differences, positivity, and strict log-subadditivity.
Depression is associated with serious disease burden. Despite the multitude of antidepressant options available, the adherence rate is often low. Accounting for patient preferences can potentially boost adherence to antidepressant medication and elevate patient satisfaction. However, limited evidence exists regarding patient preferences for antidepressant selection. This study aims to elicit patient preferences regarding the benefits, risks, and cost attributes of antidepressants in China.
Methods
A best-worst scaling profile case experiment was conducted using a face-to-face survey administered to patients diagnosed with depression. Patients were recruited from general and psychiatric hospitals. We utilized a multiphase approach that integrated literature review, expert consultation, and best-worst scaling to develop attributes within choice sets. The attributes with each varying across two or three levels encompassed remission rate, sleep disorders, risk of headache or dizziness, risk of gastrointestinal adverse events, risk of liver or kidney injury, and monthly out-of-pocket costs. Each respondent answered seven choice tasks, including a dominant task. Data were analyzed using conditional logit, mixed logit, and generalized multinomial logit models. Subgroup analyses were conducted to explore preference heterogeneity.
Results
A total of 331 participants completed the survey and met the inclusion criteria. Almost all attribute levels were statistically significant. Overall, the most desirable characteristics of antidepressant medications were higher remission rates (80% and 55% rates; p<0.05), lower risk of liver or kidney injury (1% rate; p<0.05), and fewer monthly out-of-pocket costs (CNY100 [USD13.93, EUR12.75]; p<0.05). Risks of gastrointestinal adverse events (60% and 35% rates) and insomnia were the least preferred features. Regarding attributes, efficacy, the risk of gastrointestinal adverse events, and sleep disorders were relatively important factors influencing patient choice. Preferences differed slightly by age, degree of education, personal annual income, and treatments currently received.
Conclusions
Our study suggests that efficacy, gastrointestinal adverse effects, sleep disorders, and treatment costs are critical drivers behind medication choices among patients with depression. Preference heterogeneity also exists regarding individual and therapeutic characteristics, which need more samples and further analyses to identify. These discoveries hold the potential to enrich the shared decision-making process between physicians and patients within healthcare settings.
Computer-based interactive items have become prevalent in recent educational assessments. In such items, detailed human–computer interactive process, known as response process, is recorded in a log file. The recorded response processes provide great opportunities to understand individuals’ problem solving processes. However, difficulties exist in analyzing these data as they are high-dimensional sequences in a nonstandard format. This paper aims at extracting useful information from response processes. In particular, we consider an exploratory analysis that extracts latent variables from process data through a multidimensional scaling framework. A dissimilarity measure is described to quantify the discrepancy between two response processes. The proposed method is applied to both simulated data and real process data from 14 PSTRE items in PIAAC 2012. A prediction procedure is used to examine the information contained in the extracted latent variables. We find that the extracted latent variables preserve a substantial amount of information in the process and have reasonable interpretability. We also empirically prove that process data contains more information than classic binary item responses in terms of out-of-sample prediction of many variables.
Diagnostic classification models are confirmatory in the sense that the relationship between the latent attributes and responses to items is specified or parameterized. Such models are readily interpretable with each component of the model usually having a practical meaning. However, parameterized diagnostic classification models are sometimes too simple to capture all the data patterns, resulting in significant model lack of fit. In this paper, we attempt to obtain a compromise between interpretability and goodness of fit by regularizing a latent class model. Our approach starts with minimal assumptions on the data structure, followed by suitable regularization to reduce complexity, so that readily interpretable, yet flexible model is obtained. An expectation–maximization-type algorithm is developed for efficient computation. It is shown that the proposed approach enjoys good theoretical properties. Results from simulation studies and a real application are presented.
This paper establishes fundamental results for statistical analysis based on diagnostic classification models (DCMs). The results are developed at a high level of generality and are applicable to essentially all diagnostic classification models. In particular, we establish identifiability results for various modeling parameters, notably item response probabilities, attribute distribution, and Q-matrix-induced partial information structure. These results are stated under a general setting of latent class models. Through a nonparametric Bayes approach, we construct an estimator that can be shown to be consistent when the identifiability conditions are satisfied. Simulation results show that these estimators perform well under various model settings. We also apply the proposed method to a dataset from the National Epidemiological Survey on Alcohol and Related Conditions (NESARC).