Hostname: page-component-77f85d65b8-8wtlm Total loading time: 0 Render date: 2026-03-28T18:45:26.940Z Has data issue: false hasContentIssue false

Predictive authoring for Brazilian Portuguese augmentative and alternative communication

Published online by Cambridge University Press:  27 May 2024

Jayr Pereira*
Affiliation:
Centro de Informática, Universidade Federal de Pernambuco, Recife, PE, Brazil Universidade Federal do Cariri (UFCA), Juazeiro do Norte, CE, Brazil
Rodrigo Nogueira
Affiliation:
Universidade Estadual de Campinas (UNICAMP), Campinas, SP, Brazil
Cleber Zanchettin
Affiliation:
Centro de Informática, Universidade Federal de Pernambuco, Recife, PE, Brazil
Robson Fidalgo
Affiliation:
Centro de Informática, Universidade Federal de Pernambuco, Recife, PE, Brazil
*
Corresponding author: Jayr Pereira; Email: jap2@cin.ufpe.br
Rights & Permissions [Opens in a new window]

Abstract

Individuals with complex communication needs often rely on augmentative and alternative communication (AAC) systems to have conversations and communicate their wants. Such systems allow message authoring by arranging pictograms in sequence. However, the difficulty of finding the desired item to complete a sentence can increase as the user’s vocabulary increases. This paper proposes using BERTimbau, a Brazilian Portuguese version of Bidirectional Encoder Representations from Transformers (BERT), for pictogram prediction in AAC systems. To fine-tune BERTimbau, we constructed an AAC corpus for Brazilian Portuguese to use as a training corpus. We tested different approaches to representing a pictogram for prediction: as a word (using pictogram captions), as a concept (using a dictionary definition), and as a set of synonyms (using related terms). We also evaluated the usage of images for pictogram prediction. The results demonstrate that using embeddings computed from the pictograms’ caption, synonyms, or definitions have a similar performance. Using synonyms leads to lower perplexity, but using captions leads to the highest accuracies. This paper provides insight into how to represent a pictogram for prediction using a BERT-like model and the potential of using images for pictogram prediction.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Example of a high-tech augmentative and alternative communication system using communication cards with pictograms from the Aragonese Center of Augmentative and Alternative Communication (ARASAAC). The screenshot depicts the interface of the reaact.com.br tool, where the user can easily select communication cards from the content grid (large bottom rectangle) and arrange them sequentially to construct sentences (e.g., cat wants). Additional functionalities are accessible through the buttons located in the right sidebar, enabling utilities such as text-to-speech functionality provided by the voice synthesizer.

Figure 1

Figure 2. Flow chart for model construction.

Figure 2

Figure 3. Sentence collection participants summary.

Figure 3

Figure 4. GPT-3 text prompt for sentence generation based on examples of human-composed sentences.

Figure 4

Figure 5. GPT-3 text prompt based on a controlled vocabulary for sentence generation.

Figure 5

Figure 6. The sentence Ele quer fazer xixi (he wants to pee) is represented using ARASAAC pictograms.

Figure 6

Table 1. Portguese dataset summary

Figure 7

Figure 7. Frequency distribution of words in the constructed corpus.

Figure 8

Figure 8. Frequency distribution of N-gram in the constructed corpus.

Figure 9

Figure 9. Word and n-gram frequency distribution in the human-composed corpus.

Figure 10

Figure 10. Coverage of the automatically generated corpus over the human-composed sentences.

Figure 11

Table 2. Evaluation results by model version descending sorted by ACC@1. ACC@{1, 9, 18, 25, 36} simulate the different grid sizes an augmentative and alternative communication (AAC) system can have

Figure 12

Figure 11. Example of predictions made by the captions model.