Search

8 - Supervised Learning
from Part III - Machine Learning for Data Science
Chirag Shah, University of Washington
Book:

A Hands-On Introduction to Data Science with R

Published online:

07 February 2026

Print publication:

22 January 2026, pp 203-256
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter provides a comprehensive introduction to supervised learning techniques for classification problems. It begins with logistic regression for binary classification, explaining the sigmoid function and gradient ascent optimization. The chapter then covers softmax regression for multi-class problems, followed by k-nearest neighbors (kNN) as an intuitive distance-based classifier.
Decision trees are explored in detail, including entropy, information gain, and the ID3 algorithm, along with derived decision rules and association rules. Random forests are presented as an ensemble method that addresses overfitting by combining multiple decision trees.
The chapter covers Naive Bayes classification based on Bayes’ theorem, despite its "naive" independence assumption. Finally, Support Vector Machines (SVMs) are introduced for both linear and non-linear classification using maximum margin hyperplanes.
Each technique includes hands-on R programming examples with real datasets, practical applications, and exercises to reinforce learning concepts.

Implementation of support vector machines to classify abnormal neuronal response during emotion regulation at an individual level in patients with newly diagnosed bipolar disorder – and its association with subsequent functional changes and mood episodes
Robert James Richard Blair, Alexander Tobias Ysbæk-Nielsen, Hanne Lie Kjærstad, Sahil Bajaj, Klara Coello, Maura Faurholt-Jepsen, Maj Vinberg, Lars Vedel Kessing, Julian Macoveanu, Kamilla Miskowiak
Journal:

Psychological Medicine / Volume 55 / 2025

Published online by Cambridge University Press:

11 November 2025, e338
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Background
In this study, a classifier (hyperplane) is determined to distinguish the neural responses during emotion regulation versus viewing images in healthy adults and then applied to determine (i) the effectiveness of the emotion regulation response (defined as emotion regulation distance from the hyperplane [DFHER]) in independent samples of healthy adults, patients with BD, and the patients’ unaffected relatives (URs) and (ii) the association of DFHER with the duration of future (hypo)manic and depressive episodes for patients with BD over a 16-month follow-up period.
Methods
Study participants (N = 226) included 65 healthy adults (35 used for support vector machine [SVM] learning [HCTrain] and 30 kept as an independent test sample [HCTest]), 87 patients with newly diagnosed BD (67% BD type 2) and 74 URs. BOLD response data came from an emotion regulation task. Clinical symptoms were assessed at baseline fMRI and after 16 months of specialized treatment.
Results
The SVM ML analysis identified a hyperplane with 75.7% accuracy. Patients with BD showed reduced DFHER relative to the HCTest and UR groups. Reduced DFHER was associated with reduced improvement in psychosocial functioning during the 16-month follow-up time (B = −1.663, p = 0.02).
Conclusions
The neural response during emotion regulation can be relatively well distinguished in healthy adults via ML. Patients with newly diagnosed BD show significant disruption in the recruitment of this emotion regulation response. Disrupted may indicate a reduced capacity for functional improvement during specialized treatment in a mood disorder clinic.

6 - Big Artificial Intelligence
Wilma A. Bainbridge, University of Chicago
Book:

Big Data in the Psychological Sciences

Published online:

23 October 2025

Print publication:

23 October 2025, pp 92-116
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter describes the important role of artificial intelligence (AI) in Big Data psychology research. First, we discuss the main goals of AI, and then delve into an example of machine learning and what is happening under the hood. The chapter then describes the Perceptron, a classic simple neural network, and how this has grown into deep learning AI which has become increasingly popular in recent years. Deep learning can be used both for prediction and generation, and has a multitude of applications for psychology and neuroscience. This chapter concludes with the ethical quandaries around fake data generated by AI and biases that exist in how we train systems, as well as some exciting clinical applications of AI relevant to psychology and neuroscience.

4 - Gaussian processes and other kernel methods
Anna Dawid, Uniwersytet Warszawski, Poland, Julian Arnold, Universität Basel, Switzerland, Borja Requena, ICFO - The Institute of Photonic Sciences, Alexander Gresch, Heinrich-Heine-Universität Düsseldorf, Marcin Płodzień, ICFO - The Institute of Photonic Sciences, Kaelan Donatella, Université de Paris VII (Denis Diderot), Kim A. Nicoli, University of Bonn, Paolo Stornati, ICFO - The Institute of Photonic Sciences, Rouven Koch, Aalto University, Finland, Miriam Büttner, Albert-Ludwigs-Universität Freiburg, Germany, Robert Okuła, Gdańsk University of Technology, Gorka Muñoz-Gil, Universität Innsbruck, Austria, Rodrigo A. Vargas-Hernández, McMaster University, Ontario, Alba Cervera-Lierta, Centro Nacional de Supercomputación, Juan Carrasquilla, Swiss Federal Institute of Technology in Zurich, Vedran Dunjko, Universiteit Leiden, Marylou Gabrié, Institut Polytechnique de Paris, Patrick Huembeli, Evert van Nieuwenburg, Universiteit Leiden, Filippo Vicentini, Institut Polytechnique de Paris, Lei Wang, Chinese Academy of Sciences, Beijing, Sebastian J. Wetzel, University of Waterloo, Ontario, Giuseppe Carleo, École Polytechnique Fédérale de Lausanne, Eliška Greplová, Technische Universiteit Delft, The Netherlands, Roman Krems, University of British Columbia, Vancouver, Florian Marquardt, Max-Planck-Institut für die Wissenschaft des Lichts, Michał Tomza, Uniwersytet Warszawski, Maciej Lewenstein, ICFO - Institute of Photonic Sciences, Alexandre Dauphin, Instituto de Ciencias Fotónicas
Book:

Machine Learning in Quantum Sciences

Published online:

13 June 2025

Print publication:

12 June 2025, pp 76-110
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The theory of kernels offers a rich mathematical framework for the archetypical tasks of classification and regression. Its core insight consists of the representer theorem that asserts that an unknown target function underlying a dataset can be represented by a finite sum of evaluations of a singular function, the so-called kernel function. Together with the infamous kernel trick that provides a practical way of incorporating such a kernel function into a machine learning method, a plethora of algorithms can be made more versatile. This chapter first introduces the mathematical foundations required for understanding the distinguished role of the kernel function and its consequence in terms of the representer theorem. Afterwards, we show how selected popular algorithms, including Gaussian processes, can be promoted to their kernel variant. In addition, several ideas on how to construct suitable kernel functions are provided, before demonstrating the power of kernel methods in the context of quantum (chemistry) problems.

5 - Improving the Readability of Japanese Translations of Natural Disaster Risks Through Predictive Automated English Information Design
Meng Ji, University of Sydney, Michael Oakes, University of Birmingham
Book:

Multilingual Environmental Communications

Published online:

16 May 2025

Print publication:

05 June 2025, pp 104-133
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter proposes a novel approach to the development and assessment of English disaster and environmental risk information for people from diverse language and cultural backgrounds who require readable, more accessible translations, regardless of education level, and cultural or linguistic background.¬¬¬ To illustrate the development of machine learning classifiers for the purpose of the predictive assessment of the likelihood that an English text will be translated into an accessible language, we will use Japanese as an illustrative case study, given the language and cultural contrast between English and Japanese.

Classification of internet addiction using machine learning on electroencephalography synchronization and functional connectivity
Hsu-Wen Huang, Po-Yu Li, Meng-Cin Chen, You-Xun Chang, Chih-Ling Liu, Po-Wei Chen, Qiduo Lin, Chemin Lin, Chih-Mao Huang, Shun-Chi Wu
Journal:

Psychological Medicine / Volume 55 / 2025

Published online by Cambridge University Press:

16 May 2025, e148
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Background
Internet addiction (IA) refers to excessive internet use that causes cognitive impairment or distress. Understanding the neurophysiological mechanisms underpinning IA is crucial for enabling an accurate diagnosis and informing treatment and prevention strategies. Despite the recent increase in studies examining the neurophysiological traits of IA, their findings often vary. To enhance the accuracy of identifying key neurophysiological characteristics of IA, this study used the phase lag index (PLI) and weighted PLI (WPLI) methods, which minimize volume conduction effects, to analyze the resting-state electroencephalography (EEG) functional connectivity. We further evaluated the reliability of the identified features for IA classification using various machine learning methods.
Methods
Ninety-two participants (42 with IA and 50 healthy controls (HCs)) were included. PLI and WPLI values for each participant were computed, and values exhibiting significant differences between the two groups were selected as features for the subsequent classification task.
Results
Support vector machine (SVM) achieved an 83% accuracy rate using PLI features and an improved 86% accuracy rate using WPLI features. t-test results showed analogous topographical patterns for both the WPLI and PLI. Numerous connections were identified within the delta and gamma frequency bands that exhibited significant differences between the two groups, with the IA group manifesting an elevated level of phase synchronization.
Conclusions
Functional connectivity analysis and machine learning algorithms can jointly distinguish participants with IA from HCs based on EEG data. PLI and WPLI have substantial potential as biomarkers for identifying the neurophysiological traits of IA.

Enhancing prosthetic hand control: A synergistic multi-channel electroencephalogram
Pooya Chanu Maibam, Dingyi Pei, Parthan Olikkal, Ramana Kumar Vinjamuri, Nayan M. Kakoty
Journal:

Wearable Technologies / Volume 5 / 2024

Published online by Cambridge University Press:

28 November 2024, e18
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Electromyogram (EMG) has been a fundamental approach for prosthetic hand control. However it is limited by the functionality of residual muscles and muscle fatigue. Currently, exploring temporal shifts in brain networks and accurately classifying noninvasive electroencephalogram (EEG) for prosthetic hand control remains challenging. In this manuscript, it is hypothesized that the coordinated and synchronized temporal patterns within the brain network, termed as brain synergy, contain valuable information to decode hand movements. 32-channel EEGs were acquired from 10 healthy participants during hand grasp and open. Synergistic spatial distribution pattern and power spectra of brain activity were investigated using independent component analysis of EEG. Out of 32 EEG channels, 15 channels spanning the frontal, central and parietal regions were strategically selected based on the synergy of spatial distribution pattern and power spectrum of independent components. Time-domain and synergistic features were extracted from the selected 15 EEG channels. These features were employed to train a Bayesian optimizer-based support vector machine (SVM). The optimized SVM classifier could achieve an average testing accuracy of 94.39 $ \pm $ .84% using synergistic features. The paired t-test showed that synergistic features yielded significantly higher area under curve values (p < .05) compared to time-domain features in classifying hand movements. The output of the classifier was employed for the control of the prosthetic hand. This synergistic approach for analyzing temporal activities in motor control and control of prosthetic hands have potential contributions to future research. It addresses the limitations of EMG-based approaches and emphasizes the effectiveness of synergy-based control for prostheses.

Moving to continuous classifications of bilingualism through machine learning trained on language production
M. I. Coco, G. Smith, R. Spelorzi, M. Garraffa
Journal:

Bilingualism: Language and Cognition / Volume 28 / Issue 1 / January 2025

Published online by Cambridge University Press:

24 May 2024, pp. 248-256
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Recent conceptualisations of bilingualism are moving away from strict categorisations, towards continuous approaches. This study supports this trend by combining empirical psycholinguistics data with machine learning classification modelling. Support vector classifiers were trained on two datasets of coded productions by Italian speakers to predict the class they belonged to (“monolingual”, “attriters” and “heritage”). All classes can be predicted above chance (>33%), even if the classifier's performance substantially varies, with monolinguals identified much better (f-score >70%) than attriters (f-score <50%), which are instead the most confusable class. Further analyses of the classification errors expressed in the confusion matrices qualify that attriters are identified as heritage speakers nearly as often as they are correctly classified. Cluster clitics are the most identifying features for the classification performance. Overall, this study supports a conceptualisation of bilingualism as a continuum of linguistic behaviours rather than sets of a priori established classes.

COVID-19 cluster identification and support vector machine classifier model construction using global healthcare and socio-economic features
Soumya Kanti Guha, Sandip Sadhukhan, Sougata Niyogi
Journal:

Epidemiology & Infection / Volume 151 / 2023

Published online by Cambridge University Press:

30 August 2023, e159
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Coronaviruses of the human variety have been the culprit of global epidemics of varying levels of lethality, including COVID-19, which has impacted more than 200 countries and resulted in 5.7 million fatalities as of May 2022. Effective clinical management necessitates the allocation of sufficient resources and the employment of appropriately skilled personnel. The elderly population and individuals with diabetes are at increased risk of more severe manifestations of COVID-19. Countries with a higher gross domestic product (GDP) typically exhibit superior health outcomes and reduced mortality rates. Here, we suggest a predictive model for the density of medical doctors and nursing personnel for 134 countries using a support vector machine (SVM). The model was trained in 107 countries and tested in 27, with promising results shown by the kappa statistics and ROC analysis. The SVM model used for predictions showed promising results with a high level of agreement between actual and predicted cluster values.

13 - Kernel Methods
William W. Hsieh, University of British Columbia, Vancouver
Book:

Introduction to Environmental Data Science

Published online:

23 March 2023

Print publication:

23 March 2023, pp 440-472
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Kernel methods provide an alternative family of non-linear methods to neural networks, with support vector machine being the best known among kernel methods. Almost all linear statistical methods have been non-linearly generalized by the kernel approach, including ridge regression, linear discriminant analysis, principal component analysis, canonical correlation analysis, and so on. The kernel method has also been extended to probabilisitic models, for example Gaussian processes.

Large-scale brain functional network abnormalities in social anxiety disorder
Xun Zhang, Xun Yang, Baolin Wu, Nanfang Pan, Min He, Song Wang, Graham J. Kemp, Qiyong Gong
Journal:

Psychological Medicine / Volume 53 / Issue 13 / October 2023

Published online by Cambridge University Press:

04 November 2022, pp. 6194-6204
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Background
Although aberrant brain regional responses are reported in social anxiety disorder (SAD), little is known about resting-state functional connectivity at the macroscale network level. This study aims to identify functional network abnormalities using a multivariate data-driven method in a relatively large and homogenous sample of SAD patients, and assess their potential diagnostic value.
Methods
Forty-six SAD patients and 52 demographically-matched healthy controls (HC) were recruited to undergo clinical evaluation and resting-state functional MRI scanning. We used group independent component analysis to characterize the functional architecture of brain resting-state networks (RSNs) and investigate between-group differences in intra-/inter-network functional network connectivity (FNC). Furtherly, we explored the associations of FNC abnormalities with clinical characteristics, and assessed their ability to discriminate SAD from HC using support vector machine analyses.
Results
SAD patients showed widespread intra-network FNC abnormalities in the default mode network, the subcortical network and the perceptual system (i.e. sensorimotor, auditory and visual networks), and large-scale inter-network FNC abnormalities among those high-order and primary RSNs. Some aberrant FNC signatures were correlated to disease severity and duration, suggesting pathophysiological relevance. Furthermore, intrinsic FNC anomalies allowed individual classification of SAD v. HC with significant accuracy, indicating potential diagnostic efficacy.
Conclusions
SAD patients show distinct patterns of functional synchronization abnormalities both within and across large-scale RSNs, reflecting or causing a network imbalance of bottom-up response and top-down regulation in cognitive, emotional and sensory domains. Therefore, this could offer insights into the neurofunctional substrates of SAD.

Volume of hippocampus-amygdala transition area predicts outcomes of electroconvulsive therapy in major depressive disorder: high accuracy validated in two independent cohorts
Jinping Xu, Wenfei Li, Tongjian Bai, Jiaying Li, Jinhuan Zhang, Qingmao Hu, Jiaojian Wang, Yanghua Tian, Kai Wang
Journal:

Psychological Medicine / Volume 53 / Issue 10 / July 2023

Published online by Cambridge University Press:

23 May 2022, pp. 4464-4473
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Background
Although many previous studies reported structural plasticity of the hippocampus and amygdala induced by electroconvulsive therapy (ECT) in major depressive disorder (MDD), yet the exact roles of both areas for antidepressant effects are still controversial.
Methods
In the current study, segmentation of amygdala and hippocampal sub-regions was used to investigate the longitudinal changes of volume, the relationship between volume and antidepressant effects, and prediction performances for ECT in MDD patients before and after ECT using two independent datasets.
Results
As a result, MDD patients showed selectively and consistently increased volume in the left lateral nucleus, right accessory basal nucleus, bilateral basal nucleus, bilateral corticoamygdaloid transition (CAT), bilateral paralaminar nucleus of the amygdala, and bilateral hippocampus-amygdala transition area (HATA) after ECT in both datasets, whereas marginally significant increase of volume in bilateral granule cell molecular layer of the head of dentate gyrus, the bilateral head of cornu ammonis (CA) 4, and left head of CA 3. Correlation analyses revealed that increased volume of left HATA was significantly associated with antidepressant effects after ECT. Moreover, volumes of HATA in the MDD patients before ECT could be served as potential biomarkers to predict ECT remission with the highest accuracy of 86.95% and 82.92% in two datasets (The predictive models were trained on Dataset 2 and the sensitivity, specificity and accuracy of Dataset 2 were obtained from leave-one-out-cross-validation. Thus, they were not independent and very likely to be inflated).
Conclusions
These results not only suggested that ECT could selectively induce structural plasticity of the amygdala and hippocampal sub-regions associated with antidepressant effects of ECT in MDD patients, but also provided potential biomarkers (especially HATA) for effectively and timely interventions for ECT in clinical applications.

Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants
Part of
- Psychosis Spectrum Disorders
- EPA Editors' Choice
Lung-Hao Lee, Chang-Hao Chen, Wan-Chen Chang, Po-Lei Lee, Kuo-Kai Shyu, Mu-Hong Chen, Ju-Wei Hsu, Ya-Mei Bai, Tung-Ping Su, Pei-Chi Tu
Journal:

European Psychiatry / Volume 65 / Issue 1 / 2022

Published online by Cambridge University Press:

23 December 2021, e1
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Background
Support vector machines (SVMs) based on brain-wise functional connectivity (FC) have been widely adopted for single-subject prediction of patients with schizophrenia, but most of them had small sample size. This study aimed to evaluate the performance of SVMs based on a large single-site dataset and investigate the effects of demographic homogeneity and training sample size on classification accuracy.
Methods
The resting functional Magnetic Resonance Imaging (fMRI) dataset comprised 220 patients with schizophrenia and 220 healthy controls. Brain-wise FCs was calculated for each participant and linear SVMs were developed for automatic classification of patients and controls. First, we evaluated the SVMs based on all participants and homogeneous subsamples of men, women, younger (18–30 years), and older (31–50 years) participants by 10-fold nested cross-validation. Then, we hold out a fixed test set of 40 participants (20 patients and 20 controls) and evaluated the SVMs based on incremental training sample sizes (N = 40, 80, …, 400).
Results
We found that the SVMs based on all participants had accuracy of 85.05%. The SVMs based on male, female, young, and older participants yielded accuracy of 84.66, 81.56, 80.50, and 86.13%, respectively. Although the SVMs based on older subsamples had better performance than those based on all participants, they generalized poorly to younger participants (77.24%). For incremental training sizes, the classification accuracy increased stepwise from 72.6 to 83.3%, with >80% accuracy achieved with sample size >240.
Conclusions
The findings indicate that SVMs based on a large dataset yield high classification accuracy and establish models using a large sample size with heterogeneous properties are recommended for single subject prediction of schizophrenia.

Multivariate classification provides a neural signature of Tourette disorder
Giuseppe A. Zito, Andreas Hartmann, Benoît Béranger, Samantha Weber, Selma Aybek, Johann Faouzi, Emmanuel Roze, Marie Vidailhet, Yulia Worbe
Journal:

Psychological Medicine / Volume 53 / Issue 6 / April 2023

Published online by Cambridge University Press:

03 November 2021, pp. 2361-2369
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Background
Tourette disorder (TD), hallmarks of which are motor and vocal tics, has been related to functional abnormalities in large-scale brain networks. Using a fully data driven approach in a prospective, case–control study, we tested the hypothesis that functional connectivity of these networks carries a neural signature of TD. Our aim was to investigate (i) the brain networks that distinguish adult patients with TD from controls, and (ii) the effects of antipsychotic medication on these networks.
Methods
Using a multivariate analysis based on support vector machine (SVM), we developed a predictive model of resting state functional connectivity in 48 patients and 51 controls, and identified brain networks that were most affected by disease and pharmacological treatments. We also performed standard univariate analyses to identify differences in specific connections across groups.
Results
SVM was able to identify TD with 67% accuracy (p = 0.004), based on the connectivity in widespread networks involving the striatum, fronto-parietal cortical areas and the cerebellum. Medicated and unmedicated patients were discriminated with 69% accuracy (p = 0.019), based on the connectivity among striatum, insular and cerebellar networks. Univariate approaches revealed differences in functional connectivity within the striatum in patients v. controls, and between the caudate and insular cortex in medicated v. unmedicated TD.
Conclusions
SVM was able to identify a neuronal network that distinguishes patients with TD from control, as well as medicated and unmedicated patients with TD, holding a promise to identify imaging-based biomarkers of TD for clinical use and evaluation of the effects of treatment.

Distinguishing hypochondriasis and schizophrenia using regional homogeneity: a resting-state fMRI study and support vector machine analysis
Kangyu Jin, Dongrong Xu, Zhe Shen, Guoxun Feng, Zhiyong Zhao, Jing Lu, Hailong Lyu, Fen Pan, Desheng Shang, Jingkai Chen, Shaohua Hu, Manli Huang
Journal:

Acta Neuropsychiatrica / Volume 33 / Issue 4 / August 2021

Published online by Cambridge University Press:

05 April 2021, pp. 182-190
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Objective:
A few former studies suggested that there are partial overlaps in abnormal brain structure and cognitive function between hypochondriasis (HS) and schizophrenia (SZ). But their differences in brain activity and cognitive function were unclear.
Methods:
Twenty-one HS patients, 23 SZ patients, and 24 healthy controls (HC) underwent resting-state functional magnetic resonance imaging (rs-fMRI) with the regional homogeneity analysis (ReHo), subsequently exploring the relationship between ReHo value and cognitive functions. The support vector machines (SVM) were used on effectiveness evaluation of ReHo for differentiating HS from SZ.
Results:
Compared with HC, HS showed significantly increased ReHo values in right middle temporal gyrus (MTG), left inferior parietal lobe (IPL), and right fusiform gyrus (FG), while SZ showed increased ReHo in left insula, decreased ReHo values in right paracentral lobule. Additionally, HS showed significantly higher ReHo values in FG, MTG, and left paracentral lobule, but lower in insula than SZ. The higher ReHo values in insula were associated with worse performance in MATRICS consensus cognitive battery (MCCB) in HS group. SVM analysis showed a combination of the ReHo values in insula and FG was able to satisfactorily distinguish the HS and SZ patients.
Conclusion:
Our results suggested that the altered default mode network (DMN), of which abnormal spontaneous neural activity occurs in multiple brain regions, might play a key role in the pathogenesis of HS, and the resting-state alterations of insula are closely related to cognitive dysfunction in HS. Furthermore, the combination of the ReHo in FG and insula was a relatively ideal indicator to distinguish HS from SZ.

Machine Learning for Speaker Recognition

Man-Wai Mak, Jen-Tzung Chien
Published online:

26 June 2020

Print publication:

19 November 2020
- Book
- - Get access
    
    Buy a print copy
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
This book will help readers understand fundamental and advanced statistical models and deep learning models for robust speaker recognition and domain adaptation. This useful toolkit enables readers to apply machine learning techniques to address practical issues, such as robustness under adverse acoustic environments and domain mismatch, when deploying speaker recognition systems. Presenting state-of-the-art machine learning techniques for speaker recognition and featuring a range of probabilistic models, learning algorithms, case studies, and new trends and directions for speaker recognition based on modern machine learning and deep learning, this is the perfect resource for graduates, researchers, practitioners and engineers in electrical engineering, computer science and applied mathematics.

Improving speech emotion recognition based on acoustic words emotion dictionary
Wang Wei, Xinyi Cao, He Li, Lingjie Shen, Yaqin Feng, Paul A. Watters
Journal:

Natural Language Engineering / Volume 27 / Issue 6 / November 2021

Published online by Cambridge University Press:

10 June 2020, pp. 747-761
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
To improve speech emotion recognition, a U-acoustic words emotion dictionary (AWED) features model is proposed based on an AWED. The method models emotional information from acoustic words level in different emotion classes. The top-list words in each emotion are selected to generate the AWED vector. Then, the U-AWED model is constructed by combining utterance-level acoustic features with the AWED features. Support vector machine and convolutional neural network are employed as the classifiers in our experiment. The results show that our proposed method in four tasks of emotion classification all provides significant improvement in unweighted average recall.

Kinematics-based end-effector path control of a mobile manipulator system on an uneven terrain using a two-stage Support Vector Machine
Hitesh Jangid, Subham Jain, Beteley Teka, Rekha Raja, Ashish Dutta
Journal:

Robotica / Volume 38 / Issue 8 / August 2020

Published online by Cambridge University Press:

22 November 2019, pp. 1415-1433
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
A mobile manipulator system (MMS) consists of a robotic arm mounted on a mobile platform that is used in rescue and relief, space exploration, warehouse automation, etc. As the total system has 14 Degrees of Freedom (DOF), it does not have a closed-form inverse kinematics (IK) solution. A learning-based method is proposed, which uses the forward kinematics data to learn the IK relation for motion of an MMS on a rough terrain, using a one-class support vector machine (SVM) framework. Once trained, the model estimates the joint probability distribution of the MMS configuration and end-effector position. This distribution is used to find the MMS configuration for a given desired end-effector path. Past research using a Kohonen Self organizing map (KSOM) neural network-based open-loop control method has shown that the MMS deviates from its desired path while moving on an uneven terrain due to unknown disturbances such as wheel slip, slide, and terrain deformation. Therefore, a new sequential two-stage SVM-based end-effector path-tracking control scheme is proposed to control the end-effector path. In this scheme, the error in the end-effector path is continuously tracked with the help of a Microsoft Kinect 2.0 (Microsoft Regional Sales, Singapore 119968) and is sent as a feedback to the controller. Once the error reaches a threshold value, the error correction step of the controller gets activated to correct the error until the desired accuracy is reached. The effectiveness of the proposed approach is proved through extensive simulations and experiments conducted on 3D terrain in which it is shown that the end effector can follow the desired path with an average experimental error of around 2 cm between the desired and final corrected path.

Robust Wasserstein profile inference and applications to machine learning
Part of
- Limit theorems
- Linear inference, regression
Jose Blanchet, Yang Kang, Karthyek Murthy
Journal:

Journal of Applied Probability / Volume 56 / Issue 3 / September 2019

Published online by Cambridge University Press:

01 October 2019, pp. 830-857

Print publication:

September 2019
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
We show that several machine learning estimators, including square-root least absolute shrinkage and selection and regularized logistic regression, can be represented as solutions to distributionally robust optimization problems. The associated uncertainty regions are based on suitably defined Wasserstein distances. Hence, our representations allow us to view regularization as a result of introducing an artificial adversary that perturbs the empirical distribution to account for out-of-sample effects in loss estimation. In addition, we introduce RWPI (robust Wasserstein profile inference), a novel inference methodology which extends the use of methods inspired by empirical likelihood to the setting of optimal transport costs (of which Wasserstein distances are a particular case). We use RWPI to show how to optimally select the size of uncertainty regions, and as a consequence we are able to choose regularization parameters for these machine learning estimators without the use of cross validation. Numerical experiments are also given to validate our theoretical findings.

Integrating LSA-based hierarchical conceptual space and machine learning methods for leveling the readability of domain-specific texts
Hou-Chiang Tseng, Berlin Chen, Tao-Hsing Chang, Yao-Ting Sung
Journal:

Natural Language Engineering / Volume 25 / Issue 3 / May 2019

Published online by Cambridge University Press:

05 April 2019, pp. 331-361
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Text readability assessment is a challenging interdisciplinary endeavor with rich practical implications. It has long drawn the attention of researchers internationally, and the readability models since developed have been widely applied to various fields. Previous readability models have only made use of linguistic features employed for general text analysis and have not been sufficiently accurate when used to gauge domain-specific texts. In view of this, this study proposes a latent-semantic-analysis (LSA)-constructed hierarchical conceptual space that can be used to train a readability model to accurately assess domain-specific texts. Compared with a baseline reference using a traditional model, the new model improves by 13.88% to achieve 68.98% of accuracy when leveling social science texts, and by 24.61% to achieve 73.96% of accuracy when assessing natural science texts. We then combine the readability features developed for the current study with general linguistic features, and the accuracy of leveling social science texts improves by an even higher degree of 31.58% to achieve 86.68%, and that of natural science texts by 26.56% to achieve 75.91%. These results indicate that the readability features developed in this study can be used both to train a readability model for leveling domain-specific texts and also in combination with the more common linguistic features to enhance the efficacy of the model. Future research can expand the generalizability of the model by assessing texts from different fields and grade levels using the proposed method, thus enhancing the practical applications of this new method.

Search Results

Refine search

Refine search

Actions for selected content:

26 results

8 - Supervised Learning

Summary

Implementation of support vector machines to classify abnormal neuronal response during emotion regulation at an individual level in patients with newly diagnosed bipolar disorder – and its association with subsequent functional changes and mood episodes

6 - Big Artificial Intelligence

Summary

4 - Gaussian processes and other kernel methods

Summary

5 - Improving the Readability of Japanese Translations of Natural Disaster Risks Through Predictive Automated English Information Design

Summary

Classification of internet addiction using machine learning on electroencephalography synchronization and functional connectivity

Enhancing prosthetic hand control: A synergistic multi-channel electroencephalogram

Moving to continuous classifications of bilingualism through machine learning trained on language production

COVID-19 cluster identification and support vector machine classifier model construction using global healthcare and socio-economic features

13 - Kernel Methods

Summary

Large-scale brain functional network abnormalities in social anxiety disorder

Volume of hippocampus-amygdala transition area predicts outcomes of electroconvulsive therapy in major depressive disorder: high accuracy validated in two independent cohorts

Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants

Multivariate classification provides a neural signature of Tourette disorder

Distinguishing hypochondriasis and schizophrenia using regional homogeneity: a resting-state fMRI study and support vector machine analysis

Machine Learning for Speaker Recognition

Improving speech emotion recognition based on acoustic words emotion dictionary

Kinematics-based end-effector path control of a mobile manipulator system on an uneven terrain using a two-stage Support Vector Machine

Robust Wasserstein profile inference and applications to machine learning

Integrating LSA-based hierarchical conceptual space and machine learning methods for leveling the readability of domain-specific texts

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

26 results

Summary

Summary

Summary

Summary

Summary

Machine Learning for Speaker Recognition