Search results for Artificial Intelligence and Natural Language Processing

Linguistically aware evaluation of coreference resolution from the perspective of higher-level applications
Voldemaras Žitkus, Rita Butkienė, Rimantas Butleris
Journal:

Natural Language Engineering / Volume 30 / Issue 4 / July 2024

Published online by Cambridge University Press:

19 June 2023, pp. 821-850
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Coreference resolution is an important part of natural language processing used in machine translation, semantic search, and various other information retrieval and understanding systems. One of the challenges in this field is an evaluation of resolution approaches. There are many different metrics proposed, but most of them rely on certain assumptions, like equivalence between different mentions of the same discourse-world entity, and do not account for overrepresentation of certain types of coreferences present in the evaluation data. In this paper, a new coreference evaluation strategy that focuses on linguistic and semantic information is presented that can address some of these shortcomings. Evaluation model was developed in the broader context of developing coreference resolution capabilities for Lithuanian language; therefore, the experiment was also carried out using Lithuanian language resources, but the proposed evaluation strategy is not language-dependent.

A resampling-based method to evaluate NLI models
Felipe de Souza Salvatore, Marcelo Finger, Roberto Hirata, Jr., Alexandre G. Patriota
Journal:

Natural Language Engineering / Volume 30 / Issue 4 / July 2024

Published online by Cambridge University Press:

09 June 2023, pp. 793-820
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
The recent progress of deep learning techniques has produced models capable of achieving high scores on traditional Natural Language Inference (NLI) datasets. To understand the generalization limits of these powerful models, an increasing number of adversarial evaluation schemes have appeared. These works use a similar evaluation method: they construct a new NLI test set based on sentences with known logic and semantic properties (the adversarial set), train a model on a benchmark NLI dataset, and evaluate it in the new set. Poor performance on the adversarial set is identified as a model limitation. The problem with this evaluation procedure is that it may only indicate a sampling problem. A machine learning model can perform poorly on a new test set because the text patterns presented in the adversarial set are not well represented in the training sample. To address this problem, we present a new evaluation method, the Invariance under Equivalence test (IE test). The IE test trains a model with sufficient adversarial examples and checks the model’s performance on two equivalent datasets. As a case study, we apply the IE test to the state-of-the-art NLI models using synonym substitution as the form of adversarial examples. The experiment shows that, despite their high predictive power, these models usually produce different inference outputs for equivalent inputs, and, more importantly, this deficiency cannot be solved by adding adversarial observations in the training data.

Focusing on potential named entities during active label acquisition
Ali Osman Berk Şapcı, Hasan Kemik, Reyyan Yeniterzi, Oznur Tastan
Journal:

Natural Language Engineering / Volume 30 / Issue 3 / May 2024

Published online by Cambridge University Press:

06 June 2023, pp. 602-624
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Named entity recognition (NER) aims to identify mentions of named entities in an unstructured text and classify them into predefined named entity classes. While deep learning-based pre-trained language models help to achieve good predictive performances in NER, many domain-specific NER applications still call for a substantial amount of labeled data. Active learning (AL), a general framework for the label acquisition problem, has been used for NER tasks to minimize the annotation cost without sacrificing model performance. However, the heavily imbalanced class distribution of tokens introduces challenges in designing effective AL querying methods for NER. We propose several AL sentence query evaluation functions that pay more attention to potential positive tokens and evaluate these proposed functions with both sentence-based and token-based cost evaluation strategies. We also propose a better data-driven normalization approach to penalize sentences that are too long or too short. Our experiments on three datasets from different domains reveal that the proposed approach reduces the number of annotated tokens while achieving better or comparable prediction performance with conventional methods.

Explainable Natural Language Processing, by Anders Søgaard. San Rafael, CA: Morgan & Claypool, 2021. ISBN 978-1-636-39213-4. XV+107 pages.
Zihao Zhang
Journal:

Natural Language Engineering / Volume 30 / Issue 4 / July 2024

Published online by Cambridge University Press:

02 June 2023, pp. 882-885
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation

Tables
Laura G. Militello, Unveil, LLC, Christen E. Sushereba, Unveil, LLC, Sowmya Ramachandran, Stottler Henke Associates, Inc.
Book:

Handbook of Augmented Reality Training Design Principles

Published online:

18 May 2023

Print publication:

01 June 2023, pp xii-xii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Acknowledgments
Laura G. Militello, Unveil, LLC, Christen E. Sushereba, Unveil, LLC, Sowmya Ramachandran, Stottler Henke Associates, Inc.
Book:

Handbook of Augmented Reality Training Design Principles

Published online:

18 May 2023

Print publication:

01 June 2023, pp xvi-xviii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Copyright page
Laura G. Militello, Unveil, LLC, Christen E. Sushereba, Unveil, LLC, Sowmya Ramachandran, Stottler Henke Associates, Inc.
Book:

Handbook of Augmented Reality Training Design Principles

Published online:

18 May 2023

Print publication:

01 June 2023, pp iv-iv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Chapter 2 - Recognition Skills
Laura G. Militello, Unveil, LLC, Christen E. Sushereba, Unveil, LLC, Sowmya Ramachandran, Stottler Henke Associates, Inc.
Book:

Handbook of Augmented Reality Training Design Principles

Published online:

18 May 2023

Print publication:

01 June 2023, pp 8-16
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Recognition skills refer to the ability of a practitioner to rapidly size up a situation and know what actions to take. We describe approaches to training recognition skills through the lens of naturalistic decision-making. Specifically, we link the design of training to key theories and constructs, including the recognition-primed decision model, which describes expert decision-making; the data-frame model of sensemaking, which describes how people make sense of a situation and act; and macrocognition, which encompasses complex cognitive activities such as problem solving, coordination, and anticipation. This chapter also describes the components of recognition skills to be trained and defines scenario-based training.

Index
Laura G. Militello, Unveil, LLC, Christen E. Sushereba, Unveil, LLC, Sowmya Ramachandran, Stottler Henke Associates, Inc.
Book:

Handbook of Augmented Reality Training Design Principles

Published online:

18 May 2023

Print publication:

01 June 2023, pp 151-154
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Cluster-based ensemble learning model for improving sentiment classification of Arabic documents
Rana Husni Al Mahmoud, Bassam H. Hammo, Hossam Faris
Journal:

Natural Language Engineering / Volume 30 / Issue 5 / September 2024

Published online by Cambridge University Press:

01 June 2023, pp. 1091-1129
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
This article reports on designing and implementing a multiclass sentiment classification approach to handle the imbalanced class distribution of Arabic documents. The proposed approach, sentiment classification of Arabic documents (SCArD), combines the advantages of a clustering-based undersampling (CBUS) method and an ensemble learning model to aid machine learning (ML) classifiers in building accurate models against highly imbalanced datasets. The CBUS method applies two standard clustering algorithms: K-means and expectation–maximization, to balance the ratio between the major and the minor classes by decreasing the number of the major class instances and maintaining the number of the minor class instances at the cluster level. The merits of the proposed approach are that it does not remove the majority class instances from the dataset nor injects the dataset with artificial minority class instances. The resulting balanced datasets are used to train two ML classifiers, random forest and updateable Naïve Bayes, to develop prediction data models. The best prediction data models are selected based on F1-score rates. We applied two techniques to test SCArD and generate new predictions from the imbalanced test dataset. The first technique uses the best prediction data models. The second technique uses the majority voting ensemble learning model, which combines the best prediction data models to generate the final predictions. The experimental results showed that SCArD is promising and outperformed the other comparative classification models based on the F1-score rates.

Chapter 3 - Engagement
Laura G. Militello, Unveil, LLC, Christen E. Sushereba, Unveil, LLC, Sowmya Ramachandran, Stottler Henke Associates, Inc.
Book:

Handbook of Augmented Reality Training Design Principles

Published online:

18 May 2023

Print publication:

01 June 2023, pp 17-33
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Learner engagement is the foundation for effective training. This chapter describes two design principles for creating engaging augmented reality-based recognition skills training. The Immersion Principle describes ways in which training designers can create a sense of learner presence in the training through cognitive and physical engagement. The Hot Seat Principle describes a strategy to increase engagement by making the learner feel a sense of responsibility for training outcomes. This is particularly useful for team and small group training. The discussions of both principles include examples, theoretical links, and implications for people designing augmented reality training.

Chapter 9 - Conclusion
Laura G. Militello, Unveil, LLC, Christen E. Sushereba, Unveil, LLC, Sowmya Ramachandran, Stottler Henke Associates, Inc.
Book:

Handbook of Augmented Reality Training Design Principles

Published online:

18 May 2023

Print publication:

01 June 2023, pp 128-136
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter revisits each of the design principles, summarizing and drawing connections between them. Many of the principles are based on empirical evidence from traditional learning environments; a discussion on the boundary conditions of the design principles explores the extrapolations of this evidence to training recognition skills in dynamic, high stakes environments. The chapter closes with a discussion of the contributions and challenges of augmented reality to training.

References
Laura G. Militello, Unveil, LLC, Christen E. Sushereba, Unveil, LLC, Sowmya Ramachandran, Stottler Henke Associates, Inc.
Book:

Handbook of Augmented Reality Training Design Principles

Published online:

18 May 2023

Print publication:

01 June 2023, pp 137-150
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Chapter 8 - Synthesis
Laura G. Militello, Unveil, LLC, Christen E. Sushereba, Unveil, LLC, Sowmya Ramachandran, Stottler Henke Associates, Inc.
Book:

Handbook of Augmented Reality Training Design Principles

Published online:

18 May 2023

Print publication:

01 June 2023, pp 119-127
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The Learn, Experience, Reflect framework is discussed as an overarching guide to training design. The Learn component focuses on the declarative information that learners need to fully learn from the Experience and Reflect portions of training. This often includes didactic presentation of information. The Experience component is generally scenario based and should be used to support learners in applying new knowledge and abstract concepts to realistic situations. The Reflect component employs strategies to encourage learners to reflect on what they have learned and how to apply their new knowledge in the future. Examples and links to theoretical models are provided for each component, along with discussions of how best to employ the capabilities of augmented reality to designing training elements for each component.

Figures
Laura G. Militello, Unveil, LLC, Christen E. Sushereba, Unveil, LLC, Sowmya Ramachandran, Stottler Henke Associates, Inc.
Book:

Handbook of Augmented Reality Training Design Principles

Published online:

18 May 2023

Print publication:

01 June 2023, pp x-xi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Dedication
Laura G. Militello, Unveil, LLC, Christen E. Sushereba, Unveil, LLC, Sowmya Ramachandran, Stottler Henke Associates, Inc.
Book:

Handbook of Augmented Reality Training Design Principles

Published online:

18 May 2023

Print publication:

01 June 2023, pp v-vi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Contents
Laura G. Militello, Unveil, LLC, Christen E. Sushereba, Unveil, LLC, Sowmya Ramachandran, Stottler Henke Associates, Inc.
Book:

Handbook of Augmented Reality Training Design Principles

Published online:

18 May 2023

Print publication:

01 June 2023, pp vii-ix
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Polish natural language inference and factivity: An expert-based dataset and benchmarks
Daniel Ziembicki, Karolina Seweryn, Anna Wróblewska
Journal:

Natural Language Engineering / Volume 30 / Issue 2 / March 2024

Published online by Cambridge University Press:

01 June 2023, pp. 385-416
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Despite recent breakthroughs in Machine Learning for Natural Language Processing, the Natural Language Inference (NLI) problems still constitute a challenge. To this purpose, we contribute a new dataset that focuses exclusively on the factivity phenomenon; however, our task remains the same as other NLI tasks, that is prediction of entailment, contradiction, or neutral (ECN). In this paper, we describe the LingFeatured NLI corpus and present the results of analyses designed to characterize the factivity/non-factivity opposition in natural language. The dataset contains entirely natural language utterances in Polish and gathers 2432 verb-complement pairs and 309 unique verbs. The dataset is based on the National Corpus of Polish (NKJP) and is a representative subcorpus in regard to syntactic construction [V][że][cc]. We also present an extended version of the set (3035 sentences) consisting more sentences with internal negations. We prepared deep learning benchmarks for both sets. We found that transformer BERT-based models working on sentences obtained relatively good results ($\approx 89\%$ F1 score on base dataset). Even though better results were achieved using linguistic features ($\approx 91\%$ F1 score on base dataset), this model requires more human labor (humans in the loop) because features were prepared manually by expert linguists. BERT-based models consuming only the input sentences show that they capture most of the complexity of NLI/factivity. Complex cases in the phenomenon—for example, cases with entitlement (E) and non-factive verbs—still remain an open issue for further research.

Chapter 4 - Scenario Building
Laura G. Militello, Unveil, LLC, Christen E. Sushereba, Unveil, LLC, Sowmya Ramachandran, Stottler Henke Associates, Inc.
Book:

Handbook of Augmented Reality Training Design Principles

Published online:

18 May 2023

Print publication:

01 June 2023, pp 34-49
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Building scenarios for training recognition skills in complex domains requires the addition of hard-to-detect cues and unexpected events. This chapter describes the Periphery Principle, which emphasizes the importance of including critical cues in nonobvious ways so trainees learn how to seek them, and the Perturbation Principle, which encourages training designers to incorporate unexpected events into training scenarios so trainees learn to adapt to novel situations. The chapter presents methods for identifying peripheral cues and important perturbations for a particular domain or task, and gives examples of critical cue inventories and complexity tables that can be useful tools for training designers.

Preface
Laura G. Militello, Unveil, LLC, Christen E. Sushereba, Unveil, LLC, Sowmya Ramachandran, Stottler Henke Associates, Inc.
Book:

Handbook of Augmented Reality Training Design Principles

Published online:

18 May 2023

Print publication:

01 June 2023, pp xiii-xv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Artificial Intelligence and Natural Language Processing

Refine search

Refine search

Actions for selected content:

3241 results in Artificial Intelligence and Natural Language Processing

Linguistically aware evaluation of coreference resolution from the perspective of higher-level applications

A resampling-based method to evaluate NLI models

Focusing on potential named entities during active label acquisition

Explainable Natural Language Processing, by Anders Søgaard. San Rafael, CA: Morgan & Claypool, 2021. ISBN 978-1-636-39213-4. XV+107 pages.

Tables

Acknowledgments

Copyright page

Chapter 2 - Recognition Skills

Summary

Index

Cluster-based ensemble learning model for improving sentiment classification of Arabic documents

Chapter 3 - Engagement

Summary

Chapter 9 - Conclusion

Summary

References

Chapter 8 - Synthesis

Summary

Figures

Dedication

Contents

Polish natural language inference and factivity: An expert-based dataset and benchmarks

Chapter 4 - Scenario Building

Summary

Preface

Artificial Intelligence and Natural Language Processing

Refine search

Refine search

Actions for selected content:

Save Search

3241 results in Artificial Intelligence and Natural Language Processing

Summary

Summary

Summary

Summary

Summary