Assessing language abnormalities using NLP methods in speech excerpts of individuals at ultra-high risk for bipolar disorder

E. Kizilay; B. Arslan; E. Bora

doi:10.1192/j.eurpsy.2025.647

Assessing language abnormalities using NLP methods in speech excerpts of individuals at ultra-high risk for bipolar disorder

Published online by Cambridge University Press: 26 August 2025

E. Kizilay ,

B. Arslan and

E. Bora

Show author details

E. Kizilay*: Affiliation:
Department of Neurosciences
B. Arslan: Affiliation:
Department of Neurosciences
E. Bora: Affiliation:
Department of Neurosciences Department of Psychiatry, Dokuz Eylul University, Izmir, Türkiye Department of Psychiatry, University of Melbourne and Melbourne Health, Melbourne. Australia
*: *Corresponding author.

Article contents

Abstract

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

Introduction

Detection for individuals at ultra-high risk for bipolar disorder (UHR-BD) is crucial due to the exploration of potential biomarkers at the early stages of bipolar disorder, including language abnormalities. Formal thought disorder (FTD) is an important symptom that can be observed in BD, which may be mildly noticeable during the early stages of the disease. Automated methods have demonstrated the ability to evaluate FTD in psychotic disorders and can also be employed to evaluate FTD in the speech of individuals at UHR-BD.

Objectives

This study aimed to investigate the differences in language between UHR-BD and healthy controls (HC) using natural language processing (NLP) methods.

Methods

We collected speech samples from 20 individuals at UHR-BD and 20 HC during descriptions of eight Thematic Apperception Test (TAT) pictures, which were then manually transcribed. After transcribing the text, word2vec was used to convert it into vectors. The semantic similarity between words was calculated using a moving window approach to windows of words sized 5-10. Finally, the mean and variance of similarities were determined.

Results

The variances of similarities in the windows of 5 to 9 were increased in UHR-BD (p=0.004, p=0.005, p=0.01, p=0.02, and p=0.037, respectively). There was no significant difference regarding the mean similarity.

Conclusions

To our knowledge, this is the first study to evaluate language with NLP methods in individuals at UHR-BD. Our findings showed that the variance of semantic similarity differed between the two groups. This indicates NLP methods may be used in the UHR-BD group to detect FTD.

Disclosure of Interest

None Declared

Information

Type: Abstract
Information: European Psychiatry , Volume 68 , Special Issue S1: Abstracts of the 33rd European Congress of Psychiatry , April 2025 , pp. S296

DOI: https://doi.org/10.1192/j.eurpsy.2025.647 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.

Submit a response

Comments

No Comments have been published for this article.

Article contents

Assessing language abnormalities using NLP methods in speech excerpts of individuals at ultra-high risk for bipolar disorder

Abstract

Information

Comments

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests