We use cookies to distinguish you from other users and to provide you with a better experience on our websites. Close this message to accept cookies or find out how to manage your cookie settings.
To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
The theory of kernels offers a rich mathematical framework for the archetypical tasks of classification and regression. Its core insight consists of the representer theorem that asserts that an unknown target function underlying a dataset can be represented by a finite sum of evaluations of a singular function, the so-called kernel function. Together with the infamous kernel trick that provides a practical way of incorporating such a kernel function into a machine learning method, a plethora of algorithms can be made more versatile. This chapter first introduces the mathematical foundations required for understanding the distinguished role of the kernel function and its consequence in terms of the representer theorem. Afterwards, we show how selected popular algorithms, including Gaussian processes, can be promoted to their kernel variant. In addition, several ideas on how to construct suitable kernel functions are provided, before demonstrating the power of kernel methods in the context of quantum (chemistry) problems.
This chapter proposes a novel approach to the development and assessment of English disaster and environmental risk information for people from diverse language and cultural backgrounds who require readable, more accessible translations, regardless of education level, and cultural or linguistic background.¬¬¬ To illustrate the development of machine learning classifiers for the purpose of the predictive assessment of the likelihood that an English text will be translated into an accessible language, we will use Japanese as an illustrative case study, given the language and cultural contrast between English and Japanese.
Internet addiction (IA) refers to excessive internet use that causes cognitive impairment or distress. Understanding the neurophysiological mechanisms underpinning IA is crucial for enabling an accurate diagnosis and informing treatment and prevention strategies. Despite the recent increase in studies examining the neurophysiological traits of IA, their findings often vary. To enhance the accuracy of identifying key neurophysiological characteristics of IA, this study used the phase lag index (PLI) and weighted PLI (WPLI) methods, which minimize volume conduction effects, to analyze the resting-state electroencephalography (EEG) functional connectivity. We further evaluated the reliability of the identified features for IA classification using various machine learning methods.
Methods
Ninety-two participants (42 with IA and 50 healthy controls (HCs)) were included. PLI and WPLI values for each participant were computed, and values exhibiting significant differences between the two groups were selected as features for the subsequent classification task.
Results
Support vector machine (SVM) achieved an 83% accuracy rate using PLI features and an improved 86% accuracy rate using WPLI features. t-test results showed analogous topographical patterns for both the WPLI and PLI. Numerous connections were identified within the delta and gamma frequency bands that exhibited significant differences between the two groups, with the IA group manifesting an elevated level of phase synchronization.
Conclusions
Functional connectivity analysis and machine learning algorithms can jointly distinguish participants with IA from HCs based on EEG data. PLI and WPLI have substantial potential as biomarkers for identifying the neurophysiological traits of IA.
Electromyogram (EMG) has been a fundamental approach for prosthetic hand control. However it is limited by the functionality of residual muscles and muscle fatigue. Currently, exploring temporal shifts in brain networks and accurately classifying noninvasive electroencephalogram (EEG) for prosthetic hand control remains challenging. In this manuscript, it is hypothesized that the coordinated and synchronized temporal patterns within the brain network, termed as brain synergy, contain valuable information to decode hand movements. 32-channel EEGs were acquired from 10 healthy participants during hand grasp and open. Synergistic spatial distribution pattern and power spectra of brain activity were investigated using independent component analysis of EEG. Out of 32 EEG channels, 15 channels spanning the frontal, central and parietal regions were strategically selected based on the synergy of spatial distribution pattern and power spectrum of independent components. Time-domain and synergistic features were extracted from the selected 15 EEG channels. These features were employed to train a Bayesian optimizer-based support vector machine (SVM). The optimized SVM classifier could achieve an average testing accuracy of 94.39 $ \pm $ .84% using synergistic features. The paired t-test showed that synergistic features yielded significantly higher area under curve values (p < .05) compared to time-domain features in classifying hand movements. The output of the classifier was employed for the control of the prosthetic hand. This synergistic approach for analyzing temporal activities in motor control and control of prosthetic hands have potential contributions to future research. It addresses the limitations of EMG-based approaches and emphasizes the effectiveness of synergy-based control for prostheses.
Recent conceptualisations of bilingualism are moving away from strict categorisations, towards continuous approaches. This study supports this trend by combining empirical psycholinguistics data with machine learning classification modelling. Support vector classifiers were trained on two datasets of coded productions by Italian speakers to predict the class they belonged to (“monolingual”, “attriters” and “heritage”). All classes can be predicted above chance (>33%), even if the classifier's performance substantially varies, with monolinguals identified much better (f-score >70%) than attriters (f-score <50%), which are instead the most confusable class. Further analyses of the classification errors expressed in the confusion matrices qualify that attriters are identified as heritage speakers nearly as often as they are correctly classified. Cluster clitics are the most identifying features for the classification performance. Overall, this study supports a conceptualisation of bilingualism as a continuum of linguistic behaviours rather than sets of a priori established classes.
Coronaviruses of the human variety have been the culprit of global epidemics of varying levels of lethality, including COVID-19, which has impacted more than 200 countries and resulted in 5.7 million fatalities as of May 2022. Effective clinical management necessitates the allocation of sufficient resources and the employment of appropriately skilled personnel. The elderly population and individuals with diabetes are at increased risk of more severe manifestations of COVID-19. Countries with a higher gross domestic product (GDP) typically exhibit superior health outcomes and reduced mortality rates. Here, we suggest a predictive model for the density of medical doctors and nursing personnel for 134 countries using a support vector machine (SVM). The model was trained in 107 countries and tested in 27, with promising results shown by the kappa statistics and ROC analysis. The SVM model used for predictions showed promising results with a high level of agreement between actual and predicted cluster values.
Kernel methods provide an alternative family of non-linear methods to neural networks, with support vector machine being the best known among kernel methods. Almost all linear statistical methods have been non-linearly generalized by the kernel approach, including ridge regression, linear discriminant analysis, principal component analysis, canonical correlation analysis, and so on. The kernel method has also been extended to probabilisitic models, for example Gaussian processes.
Although aberrant brain regional responses are reported in social anxiety disorder (SAD), little is known about resting-state functional connectivity at the macroscale network level. This study aims to identify functional network abnormalities using a multivariate data-driven method in a relatively large and homogenous sample of SAD patients, and assess their potential diagnostic value.
Methods
Forty-six SAD patients and 52 demographically-matched healthy controls (HC) were recruited to undergo clinical evaluation and resting-state functional MRI scanning. We used group independent component analysis to characterize the functional architecture of brain resting-state networks (RSNs) and investigate between-group differences in intra-/inter-network functional network connectivity (FNC). Furtherly, we explored the associations of FNC abnormalities with clinical characteristics, and assessed their ability to discriminate SAD from HC using support vector machine analyses.
Results
SAD patients showed widespread intra-network FNC abnormalities in the default mode network, the subcortical network and the perceptual system (i.e. sensorimotor, auditory and visual networks), and large-scale inter-network FNC abnormalities among those high-order and primary RSNs. Some aberrant FNC signatures were correlated to disease severity and duration, suggesting pathophysiological relevance. Furthermore, intrinsic FNC anomalies allowed individual classification of SAD v. HC with significant accuracy, indicating potential diagnostic efficacy.
Conclusions
SAD patients show distinct patterns of functional synchronization abnormalities both within and across large-scale RSNs, reflecting or causing a network imbalance of bottom-up response and top-down regulation in cognitive, emotional and sensory domains. Therefore, this could offer insights into the neurofunctional substrates of SAD.
Although many previous studies reported structural plasticity of the hippocampus and amygdala induced by electroconvulsive therapy (ECT) in major depressive disorder (MDD), yet the exact roles of both areas for antidepressant effects are still controversial.
Methods
In the current study, segmentation of amygdala and hippocampal sub-regions was used to investigate the longitudinal changes of volume, the relationship between volume and antidepressant effects, and prediction performances for ECT in MDD patients before and after ECT using two independent datasets.
Results
As a result, MDD patients showed selectively and consistently increased volume in the left lateral nucleus, right accessory basal nucleus, bilateral basal nucleus, bilateral corticoamygdaloid transition (CAT), bilateral paralaminar nucleus of the amygdala, and bilateral hippocampus-amygdala transition area (HATA) after ECT in both datasets, whereas marginally significant increase of volume in bilateral granule cell molecular layer of the head of dentate gyrus, the bilateral head of cornu ammonis (CA) 4, and left head of CA 3. Correlation analyses revealed that increased volume of left HATA was significantly associated with antidepressant effects after ECT. Moreover, volumes of HATA in the MDD patients before ECT could be served as potential biomarkers to predict ECT remission with the highest accuracy of 86.95% and 82.92% in two datasets (The predictive models were trained on Dataset 2 and the sensitivity, specificity and accuracy of Dataset 2 were obtained from leave-one-out-cross-validation. Thus, they were not independent and very likely to be inflated).
Conclusions
These results not only suggested that ECT could selectively induce structural plasticity of the amygdala and hippocampal sub-regions associated with antidepressant effects of ECT in MDD patients, but also provided potential biomarkers (especially HATA) for effectively and timely interventions for ECT in clinical applications.
Support vector machines (SVMs) based on brain-wise functional connectivity (FC) have been widely adopted for single-subject prediction of patients with schizophrenia, but most of them had small sample size. This study aimed to evaluate the performance of SVMs based on a large single-site dataset and investigate the effects of demographic homogeneity and training sample size on classification accuracy.
Methods
The resting functional Magnetic Resonance Imaging (fMRI) dataset comprised 220 patients with schizophrenia and 220 healthy controls. Brain-wise FCs was calculated for each participant and linear SVMs were developed for automatic classification of patients and controls. First, we evaluated the SVMs based on all participants and homogeneous subsamples of men, women, younger (18–30 years), and older (31–50 years) participants by 10-fold nested cross-validation. Then, we hold out a fixed test set of 40 participants (20 patients and 20 controls) and evaluated the SVMs based on incremental training sample sizes (N = 40, 80, …, 400).
Results
We found that the SVMs based on all participants had accuracy of 85.05%. The SVMs based on male, female, young, and older participants yielded accuracy of 84.66, 81.56, 80.50, and 86.13%, respectively. Although the SVMs based on older subsamples had better performance than those based on all participants, they generalized poorly to younger participants (77.24%). For incremental training sizes, the classification accuracy increased stepwise from 72.6 to 83.3%, with >80% accuracy achieved with sample size >240.
Conclusions
The findings indicate that SVMs based on a large dataset yield high classification accuracy and establish models using a large sample size with heterogeneous properties are recommended for single subject prediction of schizophrenia.
Tourette disorder (TD), hallmarks of which are motor and vocal tics, has been related to functional abnormalities in large-scale brain networks. Using a fully data driven approach in a prospective, case–control study, we tested the hypothesis that functional connectivity of these networks carries a neural signature of TD. Our aim was to investigate (i) the brain networks that distinguish adult patients with TD from controls, and (ii) the effects of antipsychotic medication on these networks.
Methods
Using a multivariate analysis based on support vector machine (SVM), we developed a predictive model of resting state functional connectivity in 48 patients and 51 controls, and identified brain networks that were most affected by disease and pharmacological treatments. We also performed standard univariate analyses to identify differences in specific connections across groups.
Results
SVM was able to identify TD with 67% accuracy (p = 0.004), based on the connectivity in widespread networks involving the striatum, fronto-parietal cortical areas and the cerebellum. Medicated and unmedicated patients were discriminated with 69% accuracy (p = 0.019), based on the connectivity among striatum, insular and cerebellar networks. Univariate approaches revealed differences in functional connectivity within the striatum in patients v. controls, and between the caudate and insular cortex in medicated v. unmedicated TD.
Conclusions
SVM was able to identify a neuronal network that distinguishes patients with TD from control, as well as medicated and unmedicated patients with TD, holding a promise to identify imaging-based biomarkers of TD for clinical use and evaluation of the effects of treatment.
A few former studies suggested that there are partial overlaps in abnormal brain structure and cognitive function between hypochondriasis (HS) and schizophrenia (SZ). But their differences in brain activity and cognitive function were unclear.
Methods:
Twenty-one HS patients, 23 SZ patients, and 24 healthy controls (HC) underwent resting-state functional magnetic resonance imaging (rs-fMRI) with the regional homogeneity analysis (ReHo), subsequently exploring the relationship between ReHo value and cognitive functions. The support vector machines (SVM) were used on effectiveness evaluation of ReHo for differentiating HS from SZ.
Results:
Compared with HC, HS showed significantly increased ReHo values in right middle temporal gyrus (MTG), left inferior parietal lobe (IPL), and right fusiform gyrus (FG), while SZ showed increased ReHo in left insula, decreased ReHo values in right paracentral lobule. Additionally, HS showed significantly higher ReHo values in FG, MTG, and left paracentral lobule, but lower in insula than SZ. The higher ReHo values in insula were associated with worse performance in MATRICS consensus cognitive battery (MCCB) in HS group. SVM analysis showed a combination of the ReHo values in insula and FG was able to satisfactorily distinguish the HS and SZ patients.
Conclusion:
Our results suggested that the altered default mode network (DMN), of which abnormal spontaneous neural activity occurs in multiple brain regions, might play a key role in the pathogenesis of HS, and the resting-state alterations of insula are closely related to cognitive dysfunction in HS. Furthermore, the combination of the ReHo in FG and insula was a relatively ideal indicator to distinguish HS from SZ.
This book will help readers understand fundamental and advanced statistical models and deep learning models for robust speaker recognition and domain adaptation. This useful toolkit enables readers to apply machine learning techniques to address practical issues, such as robustness under adverse acoustic environments and domain mismatch, when deploying speaker recognition systems. Presenting state-of-the-art machine learning techniques for speaker recognition and featuring a range of probabilistic models, learning algorithms, case studies, and new trends and directions for speaker recognition based on modern machine learning and deep learning, this is the perfect resource for graduates, researchers, practitioners and engineers in electrical engineering, computer science and applied mathematics.
To improve speech emotion recognition, a U-acoustic words emotion dictionary (AWED) features model is proposed based on an AWED. The method models emotional information from acoustic words level in different emotion classes. The top-list words in each emotion are selected to generate the AWED vector. Then, the U-AWED model is constructed by combining utterance-level acoustic features with the AWED features. Support vector machine and convolutional neural network are employed as the classifiers in our experiment. The results show that our proposed method in four tasks of emotion classification all provides significant improvement in unweighted average recall.
A mobile manipulator system (MMS) consists of a robotic arm mounted on a mobile platform that is used in rescue and relief, space exploration, warehouse automation, etc. As the total system has 14 Degrees of Freedom (DOF), it does not have a closed-form inverse kinematics (IK) solution. A learning-based method is proposed, which uses the forward kinematics data to learn the IK relation for motion of an MMS on a rough terrain, using a one-class support vector machine (SVM) framework. Once trained, the model estimates the joint probability distribution of the MMS configuration and end-effector position. This distribution is used to find the MMS configuration for a given desired end-effector path. Past research using a Kohonen Self organizing map (KSOM) neural network-based open-loop control method has shown that the MMS deviates from its desired path while moving on an uneven terrain due to unknown disturbances such as wheel slip, slide, and terrain deformation. Therefore, a new sequential two-stage SVM-based end-effector path-tracking control scheme is proposed to control the end-effector path. In this scheme, the error in the end-effector path is continuously tracked with the help of a Microsoft Kinect 2.0 (Microsoft Regional Sales, Singapore 119968) and is sent as a feedback to the controller. Once the error reaches a threshold value, the error correction step of the controller gets activated to correct the error until the desired accuracy is reached. The effectiveness of the proposed approach is proved through extensive simulations and experiments conducted on 3D terrain in which it is shown that the end effector can follow the desired path with an average experimental error of around 2 cm between the desired and final corrected path.
We show that several machine learning estimators, including square-root least absolute shrinkage and selection and regularized logistic regression, can be represented as solutions to distributionally robust optimization problems. The associated uncertainty regions are based on suitably defined Wasserstein distances. Hence, our representations allow us to view regularization as a result of introducing an artificial adversary that perturbs the empirical distribution to account for out-of-sample effects in loss estimation. In addition, we introduce RWPI (robust Wasserstein profile inference), a novel inference methodology which extends the use of methods inspired by empirical likelihood to the setting of optimal transport costs (of which Wasserstein distances are a particular case). We use RWPI to show how to optimally select the size of uncertainty regions, and as a consequence we are able to choose regularization parameters for these machine learning estimators without the use of cross validation. Numerical experiments are also given to validate our theoretical findings.
Text readability assessment is a challenging interdisciplinary endeavor with rich practical implications. It has long drawn the attention of researchers internationally, and the readability models since developed have been widely applied to various fields. Previous readability models have only made use of linguistic features employed for general text analysis and have not been sufficiently accurate when used to gauge domain-specific texts. In view of this, this study proposes a latent-semantic-analysis (LSA)-constructed hierarchical conceptual space that can be used to train a readability model to accurately assess domain-specific texts. Compared with a baseline reference using a traditional model, the new model improves by 13.88% to achieve 68.98% of accuracy when leveling social science texts, and by 24.61% to achieve 73.96% of accuracy when assessing natural science texts. We then combine the readability features developed for the current study with general linguistic features, and the accuracy of leveling social science texts improves by an even higher degree of 31.58% to achieve 86.68%, and that of natural science texts by 26.56% to achieve 75.91%. These results indicate that the readability features developed in this study can be used both to train a readability model for leveling domain-specific texts and also in combination with the more common linguistic features to enhance the efficacy of the model. Future research can expand the generalizability of the model by assessing texts from different fields and grade levels using the proposed method, thus enhancing the practical applications of this new method.
An intelligent decision-making method was proposed for airport bird-repelling based on a Support Vector Machine (SVM) and bird-strike risk assessment. The bird-strike risk assessment model is established with two exponential functions to separate the risk levels, while the SVM method includes two steps of training and testing. After the risk assessment, the Bird-Repelling Strategy Classification Model (BRSCM) was trained based on the expert knowledge and large amount of historical bird information collected by the airport linkage system for bird detection, surveillance and repelling. Then, in the testing step, the BRSCM was continuously optimised according to the real-time intelligent bird-repelling strategy results. Through several bird-repelling examples of a certain airport, it is demonstrated that the decision accuracy of BRSCM is relatively high, and it could solve new problems by self-correction. The proposed method achieved the optimised operation of multiple bird-repelling devices against real-time bird information with great improvement of bird-repelling effects, overcoming the tolerance of birds to the bird-repelling devices due to their long-term repeated operation.
To elucidate the mechanisms of how snack foods may induce non-homeostatic food intake, we used resting state functional magnetic resonance imaging (fMRI), as resting state networks can individually adapt to experience after short time exposures. In addition, we used graph theoretical analysis together with machine learning techniques (support vector machine) to identifying biomarkers that can categorize between high-caloric (potato chips) vs. low-caloric (zucchini) food stimulation.
Methods
Seventeen healthy human subjects with body mass index (BMI) 19 to 27 underwent 2 different fMRI sessions where an initial resting state scan was acquired, followed by visual presentation of different images of potato chips and zucchini. There was then a 5-minute pause to ingest food (day 1=potato chips, day 3=zucchini), followed by a second resting state scan. fMRI data were further analyzed using graph theory analysis and support vector machine techniques.
Results
Potato chips vs. zucchini stimulation led to significant connectivity changes. The support vector machine was able to accurately categorize the 2 types of food stimuli with 100% accuracy. Visual, auditory, and somatosensory structures, as well as thalamus, insula, and basal ganglia were found to be important for food classification. After potato chips consumption, the BMI was associated with the path length and degree in nucleus accumbens, middle temporal gyrus, and thalamus.
Conclusion
The results suggest that high vs. low caloric food stimulation in healthy individuals can induce significant changes in resting state networks. These changes can be detected using graph theory measures in conjunction with support vector machine. Additionally, we found that the BMI affects the response of the nucleus accumbens when high caloric food is consumed.
The error of a distributed algorithm for big data classification with a support vector machine (SVM) is analysed in this paper. First, the given big data sets are divided into small subsets, on which the classical SVM with Gaussian kernels is used. Then, the classification error of the SVM for each subset is analysed based on the Tsybakov exponent, geometric noise, and width of the Gaussian kernels. Finally, the whole error of the distributed algorithm is estimated in terms of the error of each subset.