Hostname: page-component-77f85d65b8-5ngxj Total loading time: 0 Render date: 2026-03-28T09:18:52.481Z Has data issue: false hasContentIssue false

Human DNA hijacking microbiota surveys: causes and consequences in colon related 16s rRNA amplicon sequencing

Published online by Cambridge University Press:  19 August 2025

Leandro Di Gloria
Affiliation:
Department of Experimental and Clinical Biomedical Sciences, University of Florence, Florence, Italy
Lorenzo Casbarra
Affiliation:
Department of Experimental and Clinical Biomedical Sciences, University of Florence, Florence, Italy
Marta Bastiani
Affiliation:
Department of Experimental and Clinical Biomedical Sciences, University of Florence, Florence, Italy
Gabriele Memoli
Affiliation:
Department of Experimental and Clinical Biomedical Sciences, University of Florence, Florence, Italy
Matteo Ramazzotti*
Affiliation:
Department of Experimental and Clinical Biomedical Sciences, University of Florence, Florence, Italy
*
Corresponding author: Matteo Ramazzotti; Email: matteo.ramazzotti@unifi.it

Abstract

The efficiency of polymerase chain reaction (PCR) decreases under suboptimal conditions, such as low template concentration combined with high concentrations of similar sequences. Under these circumstances, mis-priming can occur, leading to the generation of erroneous copies. Specifically, in 16S amplicon sequencing of human intestinal biopsy samples, host off-target sequences are frequently generated and subsequently sequenced, particularly when the commonly used V3-V4 primers are employed. This issue not only introduces errors in data interpretation but also results in the unnecessary consumption of sequencing depth. In response to this challenge, we analysed over 1,300 publicly available V3-V4 amplicon sequences related to the human colon, profiling the colon microbiota while elucidating the biases introduced by host off-targets. Briefly, our findings reveal that unaddressed host DNA contamination can lead to false bacterial identifications and obscure significant differences in microbiota composition. Furthermore, we identified human sequences on chromosomes 5, 11, and 17 as the main contributors to the majority of off-target sequences. Finally, we suggest practical approaches to mitigate this issue without altering the original protocol design, retaining the widely used V3–V4 primers. In particular, using a C3 spacer-modified nucleotide targeting the off-target sequence is here proposed as a promising strategy acting upstream of the off-target generation.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press in association with The Nutrition Society
Figure 0

Table 1. List of the V3-V4 16S colon microbiota sourced amplicon datasets analysed in this study

Figure 1

Table 2. Overview of the three PCR batches, detailing the samples and experimental settings

Figure 2

Figure 1. Bar plot representing the percentage abundances of the 24 most abundant genera across every analysed colon related bioproject. Each vertical bar represents the abundances within a sample. The samples names (along the X axes) were not included in the plot due to visibility issue caused by the high number of samples. The samples are clustered according to their nature or patient disease.

Figure 3

Table 3. Percentage abundances of the most abundant genera across every analysed colon related bioproject

Figure 4

Figure 2. Bar plots showing the off-target reads total count (Y axis) across the bioprojects, clustered according to the human chromosome of origin (X axis).

Figure 5

Figure 3. Logo of the consensus delineated at the 5′ end of the most abundant and recurrent off-target ASVs, excluding the forward primer sequence.

Figure 6

Figure 4. Graphical alignment of the 5′ consensus sequence, primers 338F and 341F, and the corresponding sequences from chromosomes 5, 11, and 17 and the mitochondrion (“Mitoc”). Nucleotides in bold denote mismatches in the alignment. The IUPAC letters in 5′ end consensus and primers are coloured in red. The blue line highlights the position after which the consensus sequence is located on the human chromosomes, excluding the primers sequences.

Figure 7

Figure 5. Fluorescence value curves (A) and melting curves (B) of the second PCR samples (listed in Table 2). The colours legend resumes the template type (negative, bacteria, human) and the PCR design (primers and eventual inhibitor). The label “5′ cons” refers to the oligonucleotide designed according to the off-target 5′ consensus and used as forward primer, while “Inhibitor” refers to the same oligonucleotide modified with the 3’ C3-spacer.

Supplementary material: File

Di Gloria et al. supplementary material

Di Gloria et al. supplementary material
Download Di Gloria et al. supplementary material(File)
File 2.9 MB