Search results for Artificial Intelligence and Natural Language Processing

Abstractive summarization with deep reinforcement learning using semantic similarity rewards
Figen Beken Fikri, Kemal Oflazer, Berrin Yanıkoğlu
Journal:

Natural Language Engineering / Volume 30 / Issue 3 / May 2024

Published online by Cambridge University Press:

31 October 2023, pp. 554-576
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Abstractive summarization is an approach to document summarization that is not limited to selecting sentences from the document but can generate new sentences as well. We address the two main challenges in abstractive summarization: how to evaluate the performance of a summarization model and what is a good training objective. We first introduce new evaluation measures based on the semantic similarity of the input and corresponding summary. The similarity scores are obtained by the fine-tuned BERTurk model using either the cross-encoder or a bi-encoder architecture. The fine-tuning is done on the Turkish Natural Language Inference and Semantic Textual Similarity benchmark datasets. We show that these measures have better correlations with human evaluations compared to Recall-Oriented Understudy for Gisting Evaluation (ROUGE) scores and BERTScore. We then introduce a deep reinforcement learning algorithm that uses the proposed semantic similarity measures as rewards, together with a mixed training objective, in order to generate more natural summaries in terms of human readability. We show that training with a mixed training objective function compared to only the maximum-likelihood objective improves similarity scores.

Privacy-preserving Computing

for Big Data Analytics and AI
Kai Chen, Qiang Yang
Published online:

26 October 2023

Print publication:

16 November 2023
- Book
- - Get access
    
    Buy a print copy
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Privacy-preserving computing aims to protect the personal information of users while capitalizing on the possibilities unlocked by big data. This practical introduction for students, researchers, and industry practitioners is the first cohesive and systematic presentation of the field's advances over four decades. The book shows how to use privacy-preserving computing in real-world problems in data analytics and AI, and includes applications in statistics, database queries, and machine learning. The book begins by introducing cryptographic techniques such as secret sharing, homomorphic encryption, and oblivious transfer, and then broadens its focus to more widely applicable techniques such as differential privacy, trusted execution environment, and federated learning. The book ends with privacy-preserving computing in practice in areas like finance, online advertising, and healthcare, and finally offers a vision for the future of the field.

Neural Arabic singular-to-plural conversion using a pretrained Character-BERT and a fused transformer
Azzam Radman, Mohammed Atros, Rehab Duwairi
Journal:

Natural Language Engineering / Volume 30 / Issue 5 / September 2024

Published online by Cambridge University Press:

11 October 2023, pp. 1130-1154
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Morphological re-inflection generation is one of the most challenging tasks in the natural language processing (NLP) domain, especially with morphologically rich, low-resource languages like Arabic. In this research, we investigate the ability of transformer-based models in the singular-to-plural Arabic noun conversion task. We start with pretraining a Character-BERT model on a masked language modeling task using 1,134,950 Arabic words and then adopting the fusion technique to transfer the knowledge gained by the pretrained model to a full encoder–decoder transformer model, in one of the proposed settings. The second proposed setting directly fuses the output Character-BERT embeddings into the decoder. We then analyze and compare the performance of the two architectures and provide an interpretability section in which we track the features of attention with respect to the model. We perform the interpretation on both the macro and micro levels, providing some individual examples. Moreover, we provide a thorough error analysis showing the strengths and weaknesses of the proposed framework. To the best of our knowledge, this is the first effort in the Arabic NLP domain that adopts the development of an end-to-end fused-transformer deep learning model to address the problem of singular-to-plural conversion.

Part II - Models
Alessandro Lenci, Università di Pisa, Magnus Sahlgren, AI Sweden
Book:

Distributional Semantics

Published online:

07 September 2023

Print publication:

21 September 2023, pp 89-90
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

3 - Predicting Errors in Google Translations of Online Health Information
- By Meng Ji
Meng Ji, University of Sydney, Pierrette Bouillon, Université de Genève, Mark Seligman, Spoken Translation Technology
Book:

Translation Technology in Accessible Health Communication

Published online:

31 August 2023

Print publication:

21 September 2023, pp 78-99
- Chapter
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Summary

Machine translation technology is having increasing applications in health and medical settings that involve communications and interactions between people from diverse language, cultural background. Machine translation tools offer low-cost, and accessible solutions to help close the gap in cross-lingual health communications. The risks of machine translation need to be effectively controlled and properly managed to boost the confidence in this developing technology among health professionals. This study integrates the methodological benefits of machine learning in machine translation quality evaluation, and more importantly, the prediction of clinically relevant machine translation errors based on the study of linguistic features of the English source texts.

7 - Health Websites for All
- By Lucía Morado Vázquez, Silvia Rodríguez Vázquez
Meng Ji, University of Sydney, Pierrette Bouillon, Université de Genève, Mark Seligman, Spoken Translation Technology
Book:

Translation Technology in Accessible Health Communication

Published online:

31 August 2023

Print publication:

21 September 2023, pp 175-211
- Chapter
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Summary

Access to health-related information is vital in our society. Official health websites provide essential and beneficial information to the general population. In particular, they can represent a crucial public service when this information is fundamental to fight against a new health threat –such as the COVID-19 pandemic. Yet, for these websites to achieve their ultimate informative goal, they need to ensure that their content is accessible to all users, especially to people with disabilities. Many of these websites –especially those from institutions operating in multilingual countries – offer their content in several languages, which, by definition, is an accessibility best practice. However, the level of accessibility achieved might not always be the same in all the language versions available. In fact, previous studies focusing on other types of multilingual websites have shown that localized versions are usually less accessible than the original ones. In this chapter, we present a research study that involved the examination of seventy-four official multilingual health sites to understand the current situation in terms of accessibility compliance. In particular, the home pages in two languages – English, original version, and Spanish, localized version – were checked against two specific success criteria (SC) from the Web Content Accessibility Guidelines (WCAG) current standard, using both automatic and manual evaluation methods. We observed that although overall accessibility scores were similar, the localized pages obtained worse results in the two SC analyzed more in depth – that is, language and title of the page. We contend that this finding could be explained by a lack of accessibility awareness or knowledge of those participating in the localization process. We thus advocate the existence of web professionals with an interdisciplinary background that could create multilingual accessible sites, providing an inclusive web experience for all.

Part III - Practice
Alessandro Lenci, Università di Pisa, Magnus Sahlgren, AI Sweden
Book:

Distributional Semantics

Published online:

07 September 2023

Print publication:

21 September 2023, pp 169-170
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

10 - Conclusions and Outlook
from Part III - Practice
Alessandro Lenci, Università di Pisa, Magnus Sahlgren, AI Sweden
Book:

Distributional Semantics

Published online:

07 September 2023

Print publication:

21 September 2023, pp 353-363
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter concludes the book with a general summary of the current state of distributional semantics, its future prospects, and challenges.

Contents
Meng Ji, University of Sydney, Pierrette Bouillon, Université de Genève, Mark Seligman, Spoken Translation Technology
Book:

Translation Technology in Accessible Health Communication

Published online:

31 August 2023

Print publication:

21 September 2023, pp ix-xiii
- Chapter
- - You have access
  - Open access
- PDF
- HTML
- Export citation

Copyright page
Meng Ji, University of Sydney, Pierrette Bouillon, Université de Genève, Mark Seligman, Spoken Translation Technology
Book:

Translation Technology in Accessible Health Communication

Published online:

31 August 2023

Print publication:

21 September 2023, pp viii-viii
- Chapter
- - You have access
  - Open access
- PDF
- HTML
- Export citation

References
Alessandro Lenci, Università di Pisa, Magnus Sahlgren, AI Sweden
Book:

Distributional Semantics

Published online:

07 September 2023

Print publication:

21 September 2023, pp 364-421
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

5 - Enhancing Speech Translation in Medical Emergencies with Pictographs
- By Pierrette Bouillon, Johanna Gerlach, Magali Norré, Herve Spechbach
Meng Ji, University of Sydney, Pierrette Bouillon, Université de Genève, Mark Seligman, Spoken Translation Technology
Book:

Translation Technology in Accessible Health Communication

Published online:

31 August 2023

Print publication:

21 September 2023, pp 129-151
- Chapter
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Summary

In emergency care settings, there is a crucial need for automated translation tools. We focus here on the BabelDr system, a speech-enabled fixed-phrase translator used to improve communication in emergency settings between doctors and allophone patients. The aim of the chapter is two-fold. First, we will assess if a bidirectional version of the phraselator allowing patients to answer doctors’ questions by selecting pictures from open-source databases will improve user satisfaction. Second, we wish to evaluate pictograph usability in this context. Our hypotheses are that images will in fact help to improve patient satisfaction and that multiple factors influence pictograph usability. Factors of interest include not only the comprehensibility of the pictographs per se, but also how the images are presented to the user with respect to their number and ordering. We showed that most respondents prefer to use the interface with pictographs and that multiple factors influence participants’ ability to find a pictograph based on a written form, but that the comprehensibility of the individual pictographs is probably the most important.

Contributors
Meng Ji, University of Sydney, Pierrette Bouillon, Université de Genève, Mark Seligman, Spoken Translation Technology
Book:

Translation Technology in Accessible Health Communication

Published online:

31 August 2023

Print publication:

21 September 2023, pp xiv-xiv
- Chapter
- - You have access
  - Open access
- PDF
- HTML
- Export citation

1 - Speech and Translation Technologies
- By Mark Seligman
Meng Ji, University of Sydney, Pierrette Bouillon, Université de Genève, Mark Seligman, Spoken Translation Technology
Book:

Translation Technology in Accessible Health Communication

Published online:

31 August 2023

Print publication:

21 September 2023, pp 10-46
- Chapter
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Summary

This chapter explains significant speech and translation technologies for healthcare professionals. We first examine the progress of automatic speech recognition (ASR) and text-to-speech (TTS). Turning to machine translation (MT), we briefly cover fixed-phrase-based translation systems (“phraselators”), with consideration of their advantages and disadvantages. The major types of full (wide-ranging, relatively unrestricted) MT – symbolic, statistical, and neural – are then explained in some detail. As an optional bonus, we provide an extended explanation of transformer-based neural translation. We postpone for a separate chapter discussion of practical applications in healthcare contexts of speech and translation technologies.

5 - Random Encoding Models
from Part II - Models
Alessandro Lenci, Università di Pisa, Magnus Sahlgren, AI Sweden
Book:

Distributional Semantics

Published online:

07 September 2023

Print publication:

21 September 2023, pp 126-142
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter, we review random encoding models that directly reduce the dimensionality of distributional data without first building a co-occurrence matrix. While matrix distributional semantic models (DSMs) output either explicit or implicit distributional vectors, random encoding models only produce low-dimensional embeddings, and emphasize efficiency, scalability, and incrementality in building distributional representations. We discuss the mathematical foundation for models based on random encoding, the Johnson-Lindenstrauss lemma. We introduce Random Projection, before turning to Random Indexing and BEAGLE, a random encoding model that encodes sequential information in distributional vectors. Then, we introduce a variant of Random Indexing that uses random permutations to represent the position of the context lexemes with respect to the target, similarly to BEAGLE. Finally, we discuss Self-Organizing Maps, a kind of unsupervised neural network that shares important similarities with random encoding models.

1 - From Usage to Meaning: The Foundations of Distributional Semantics
from Part I - Theory
Alessandro Lenci, Università di Pisa, Magnus Sahlgren, AI Sweden
Book:

Distributional Semantics

Published online:

07 September 2023

Print publication:

21 September 2023, pp 3-25
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Distributional semantics is the study of how distributional information can be used to model semantic facts. Its theoretical foundation has become known as the Distributional Hypothesis: Lexemes with similar linguistic contexts have similar meanings. This chapter presents the epistemological principles of distributional semantics. First, we explore the historical roots of the Distributional Hypothesis, tracing them in several different theoretical traditions, including European structuralism, American distributionalism, the later philosophy of Ludwig Wittgenstein, corpus linguistics, and behaviorist and cognitive psychology. Then, we discuss the place of distributional semantics in theoretical and computational linguistics.

6 - Neural Network Models
from Part II - Models
Alessandro Lenci, Università di Pisa, Magnus Sahlgren, AI Sweden
Book:

Distributional Semantics

Published online:

07 September 2023

Print publication:

21 September 2023, pp 143-168
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The most recent development in distributional semantics is represented by models based on artificial neural networks. In this chapter, we focus on the use of neural networks to build static embeddings. Like random encoding models, neural networks incrementally learn embeddings by reducing the high dimensionality of distributional data without building an explicit co-occurrence matrix. Differing from the first generation of distributional semantic models (DSMs), also termed count models, the distributional representations produced by neural DSMs are the by-product of training the network to predict neighboring words, hence the name of predict models. Since semantically similar words tend to co-occur with similar contexts, the network learns to encode similar lexemes with similar distributional vectors. After introducing the basic concepts of neural computation, we illustrate neural language models and their use to learn distributional representations. Then we pass to describe the most popular static neural DSMs, CBOW, and Skip-Gram. We conclude the chapter with a comparison between count and predict models.

Acknowledgments
Alessandro Lenci, Università di Pisa, Magnus Sahlgren, AI Sweden
Book:

Distributional Semantics

Published online:

07 September 2023

Print publication:

21 September 2023, pp xvi-xvi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

2 - Speech and Translation Technologies
- By Mark Seligman
Meng Ji, University of Sydney, Pierrette Bouillon, Université de Genève, Mark Seligman, Spoken Translation Technology
Book:

Translation Technology in Accessible Health Communication

Published online:

31 August 2023

Print publication:

21 September 2023, pp 47-77
- Chapter
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Summary

Cross-language communication in healthcare is urgently needed. However, while development of the relevant linguistic technologies and related infrastructure has been accelerating, widespread adoption of translation- and speech-enabled systems remains slow. This chapter examines obstacles to adoption and directions for overcoming them, with emphasis on reliability and customization issues; discusses two major types of speech translation systems and their respective approaches to the same obstacles; surveys some healthcare-oriented communication systems, past and future; and concludes with an optimistic forecast for speech and translation applications in healthcare, tempered by due cautions.

Introduction
Meng Ji, University of Sydney, Pierrette Bouillon, Université de Genève, Mark Seligman, Spoken Translation Technology
Book:

Translation Technology in Accessible Health Communication

Published online:

31 August 2023

Print publication:

21 September 2023, pp 1-9
- Chapter
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Summary

This book offers state-of-the-art research on the design and evaluation of assistive translation tools and systems to facilitate cross-cultural and cross-lingual communications in health and medical settings. This book illustrates using case studies important principles of designing assistive health communication tools which are (1) detectability of errors to boost user confidence by health professionals; (2) adaptability or customizability for health and medical domains; (3) inclusivity of translation modalities (written, speech, sign language) to serve people with disabilities; and (4) equality of accessibility standards for localised multilingual websites of health contents. To summarize these key principles for promotion of accessible and reliable translation technology, we use the acronym I-D-E-A.

Artificial Intelligence and Natural Language Processing

Refine search

Refine search

Actions for selected content:

3241 results in Artificial Intelligence and Natural Language Processing

Abstractive summarization with deep reinforcement learning using semantic similarity rewards

Privacy-preserving Computing

Neural Arabic singular-to-plural conversion using a pretrained Character-BERT and a fused transformer

Part II - Models

3 - Predicting Errors in Google Translations of Online Health Information

Summary

7 - Health Websites for All

Summary

Part III - Practice

10 - Conclusions and Outlook

Summary

Contents

Copyright page

References

5 - Enhancing Speech Translation in Medical Emergencies with Pictographs

Summary

Contributors

1 - Speech and Translation Technologies

Summary

5 - Random Encoding Models

Summary

1 - From Usage to Meaning: The Foundations of Distributional Semantics

Summary

6 - Neural Network Models

Summary

Acknowledgments

2 - Speech and Translation Technologies

Summary

Introduction

Summary

Artificial Intelligence and Natural Language Processing

Refine search

Refine search

Actions for selected content:

Save Search

3241 results in Artificial Intelligence and Natural Language Processing

Privacy-preserving Computing

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary