Hostname: page-component-5db58dd55d-h5th4 Total loading time: 0 Render date: 2026-05-31T18:09:07.393Z Has data issue: false hasContentIssue false

Comparative analysis of the mobilome yields new insights into its diversity, dynamics and evolution in parasites of the Trypanosomatidae family

Published online by Cambridge University Press:  13 June 2025

Percy Omar Tullume-Vergara
Affiliation:
Department of Parasitology, Institute for Biomedical Sciences, University of Sao Paulo, Sao Paulo, Brazil
Adriana Ludwig
Affiliation:
Department of Evolution, Ecology and Behaviour, University of Liverpool, Liverpool, UK
Vyacheslav Yurchenko
Affiliation:
Life Science Research Centre, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
Adriano Cappellazzo Coelho
Affiliation:
Departamento de Biologia Animal, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), Campinas, Brazil
Elizabeth Magiolo Coser
Affiliation:
Departamento de Biologia Animal, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), Campinas, Brazil
Marco Aurélio Krieger
Affiliation:
Vice Presidency of Production and Innovation in Health (VPPIS), Oswaldo Cruz Foundation (FIOCRUZ), Rio de Janeiro, Brazil
Marta M.G. Teixeira
Affiliation:
Department of Parasitology, Institute for Biomedical Sciences, University of Sao Paulo, Sao Paulo, Brazil
Jeffrey Jon Shaw
Affiliation:
Department of Parasitology, Institute for Biomedical Sciences, University of Sao Paulo, Sao Paulo, Brazil
Joao M.P. Alves*
Affiliation:
Department of Parasitology, Institute for Biomedical Sciences, University of Sao Paulo, Sao Paulo, Brazil
*
Corresponding author: Joao M.P. Alves; Email: jotajj@usp.br

Abstract

Transposable elements (TEs) have the ability to move and amplify inside the host genome, making them a pivotal source of genome plasticity. Presently, only 4 TE clades (all classified as Class I retrotransposons) have been identified in trypanosomatids. We predicted repeat content and manually curated TEs across the genomes of 57 trypanosomatids, shedding light on their proportions, diversity and dynamics. Our analysis yielded 214 TE consensus sequence models across the dataset, with abundance ranging from 0.1% to 7.2%. We found evidence of recent transposon activity in most species, with notable bursts in the Vickermania, Lafontella, Porcisia and Angomonas spp., along with Leishmania (Mundinia) chancei, L. (M.) orientalis and L. (M.) procaviensis. We confirmed that the 4 TE clades have colonized virtually all lineages of trypanosomatids, potentially playing a role in shaping their genome architecture. The effort of this work culminated in the establishment of the Trypanosomatid TE Database 1.0, a resource designed to standardize the TE annotation process that can serve as a foundation for future studies on trypanosomatid TEs.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press.
Figure 0

Figure 1. Schematic structure of autonomous elements from each known clade of TE in trypanosomatids. ORFs are shown as long rectangles, and the size of elements varies within clades (not drawn to scale). (A) CRE clade, containing: reverse transcriptase (RT), restriction enzyme-like endonuclease motif (RLE), a poly-A tail, 1 or 2 internal repeat regions (RPT); most copies present 1 or 2 zinc finger-like structures. (B) Autonomous INGI elements, containing: apurinic/apyrimidinic endonuclease (APE), RT and RNase h (RH), a highly conserved 77-nt sequence (pr77), a variable-length poly(A) tail. (C) VIPER (vestigial interposed retroelement) and (D) TATE (telomerase-associated transposable element) clades have similar structures, containing: a putative gag-like gene, 2 overlapped ORFs encoding for tyrosine recombinase (YR) and RT/RH, split direct repeats (SDRs), represented as arrows A1 at the 5′ end and B1, A2 and B2 at the 3′ end.

Figure 1

Table 1. Diversity of Class I TEs in trypanosomatid genomes

Figure 2

Figure 2. Overview of the steps to curate the final TE database and comparison among RE types. (A) Sankey plot displaying the raw RE libraries built by RepeatModeler (RM) and dnaPipeTe pipelines. For each phase, the grey portion indicates the number of consensus models removed by the filtering process, while the coloured segment depicts the number that continued to the next step. (1) Initial clustering reduced the number of family copies (5960 for RM and 1470 for dnaPipeTE) to streamline curation and minimize redundant TE models. (2) Multicopy genes (3797 for RM and 3432 for dnaPipeTE) were identified and removed using a homology-based approach. (3) Potential satellite sequences (415 for RM and 102 for dnaPipeTE) and ‘unknown’ sequences (1500 for RM and 5126 for dnaPipeTE) were excluded. (4) RNA-related families (171 for RM and 207 for dnaPipeTE) were detected and separated from the TE dataset. Following this pipeLINE, 436 TE sequences were manually curated, resulting in 214 canonical TE models. additionally, a small number of TEs were incorporated based on Tblastn results. (B) Violin plot representing the genome occupancy of 2 TE orders as DIRS, LINE, along with unknown from 57 assessed trypanosomatid genomes. Pairwise Wilcoxon rank test with Bonferroni correction was used for comparison among classes, where * (P-value < 0.01) indicates significance (Supplementary Table S3).

Figure 3

Figure 3. Contribution of REes and TEs to trypanosomatid genomes. (A) A cladogram displays the relationship of the 57 trypanosomatid species of 7 subfamilies used in this study. Monoxenous and dixenous parasites are marked by light and dark blue circles, respectively. (B) Genome sizes are shown in Mbp. (C) Proportion of repetitive and non-repetitive content in each species. (D) Total proportion of each TE order, DIRS (green) and LINE (blue).

Figure 4

Figure 4. Scatterplots of correlation between genome size vs REs and TEs [%] across 57 trypanosomatid genomes. (A) Linear regression plot between genome size and the percentage of REs. (B) Linear regression plot between assembly genome size and the percentage of TEs. (C) Correlation plot between genome size and the percentage of REs. (D) correlation plot between genome size and the percentage of TEs. Lines: linear regression, shaded area: confidence interval.

Figure 5

Figure 5. Phylogenetic relationships across 57 species of trypanosomatids. The bold letters show the position of the 7 subfamilies that currently constitute the family Trypanosomatidae. Bootstrap supports are shown at nodes, and maximum bootstrap support (100%) is shown with black circles. The distribution of the 4 retrotransposon clades is shown on the right, with coloured circles indicating the presence and white circles indicating the absence of a given element. Bodo saltans, a free-living phagotroph, was added as an outgroup. The scale bar indicates the number of substitutions per site.

Figure 6

Figure 6. TE age distribution in trypanosomatid genomes based on K2P divergence analysis. The y-axis displays the percentage of the genome (abundance) for different clades of TEs, and the x-axis shows the Kimura substitution level (k-value from 0 to 50) of copies with their respective consensus sequences. Likewise, a low degree of divergence indicates recent activity (<5%), whereas higher divergence scores suggest that the copies derive from older transposition events.

Supplementary material: File

Tullume-Vergara et al. supplementary material 1

Tullume-Vergara et al. supplementary material
Download Tullume-Vergara et al. supplementary material 1(File)
File 2.8 MB
Supplementary material: File

Tullume-Vergara et al. supplementary material 2

Tullume-Vergara et al. supplementary material
Download Tullume-Vergara et al. supplementary material 2(File)
File 403.9 KB
Supplementary material: File

Tullume-Vergara et al. supplementary material 3

Tullume-Vergara et al. supplementary material
Download Tullume-Vergara et al. supplementary material 3(File)
File 36 KB