Hostname: page-component-848d4c4894-2pzkn Total loading time: 0 Render date: 2024-05-16T14:45:13.844Z Has data issue: false hasContentIssue false

Design of a new multi-epitope vaccine against Brucella based on T and B cell epitopes using bioinformatics methods

Published online by Cambridge University Press:  25 May 2021

Zhiqiang Chen
Affiliation:
Department of Immunology, College of Basic Medicine, Xinjiang Medical University, Urumqi, 830011Xinjiang, China
Yuejie Zhu
Affiliation:
Department of Immunology, College of Basic Medicine, Xinjiang Medical University, Urumqi, 830011Xinjiang, China Department of Reproductive Assistance, Center for Reproductive Medicine, The First Affiliated Hospital of Xinjiang Medical University, No. 393, Xinyi Road, Urumqi, 830011Xinjiang, China
Tong Sha
Affiliation:
Department of Immunology, College of Basic Medicine, Xinjiang Medical University, Urumqi, 830011Xinjiang, China
Zhiwei Li
Affiliation:
Clinical Laboratory Center, Xinjiang Uygur Autonomous Region People's Hospital, Urumqi, 830001Xinjiang, China
Yujiao Li
Affiliation:
Department of Immunology, College of Basic Medicine, Xinjiang Medical University, Urumqi, 830011Xinjiang, China
Fengbo Zhang*
Affiliation:
Department of Clinical Laboratory, The First Affiliated Hospital of Xinjiang Medical University, No. 393, Xinyi Road, Urumqi, 830011Xinjiang, China State Key Laboratory of Pathogenesis, Prevention, Treatment of Central Asian High Incidence Diseases, the First Affiliated Hospital of Xinjiang Medical University, No. 393, Xinyi Road, Urumqi, 830011Xinjiang, China
Jianbing Ding*
Affiliation:
Department of Immunology, College of Basic Medicine, Xinjiang Medical University, Urumqi, 830011Xinjiang, China State Key Laboratory of Pathogenesis, Prevention, Treatment of Central Asian High Incidence Diseases, the First Affiliated Hospital of Xinjiang Medical University, No. 393, Xinyi Road, Urumqi, 830011Xinjiang, China
*
Author for correspondence: Jianbing Ding, E-mail: 1601379937@qq.com; Fengbo Zhang, E-mail: 765219598@qq.com
Author for correspondence: Jianbing Ding, E-mail: 1601379937@qq.com; Fengbo Zhang, E-mail: 765219598@qq.com
Rights & Permissions [Opens in a new window]

Abstract

Brucellosis is one of the most serious and widespread zoonotic diseases, which seriously threatens human health and the national economy. This study was based on the T/B dominant epitopes of Brucella outer membrane protein 22 (Omp22), outer membrane protein 19 (Omp19) and outer membrane protein 28 (Omp28), with bioinformatics methods to design a safe and effective multi-epitope vaccine. The amino acid sequences of the proteins were found in the National Center for Biotechnology Information (NCBI) database, and the signal peptides were predicted by the SignaIP-5.0 server. The surface accessibility and hydrophilic regions of proteins were analysed with the ProtScale software and the tertiary structure model of the proteins predicted by I-TASSER software and labelled with the UCSF Chimera software. The software COBEpro, SVMTriP and BepiPred were used to predict B cell epitopes of the proteins. SYFPEITHI, RANKpep and IEDB were employed to predict T cell epitopes of the proteins. The T/B dominant epitopes of three proteins were combined with HEYGAALEREAG and GGGS linkers, and carriers sequences linked to the N- and C-terminus of the vaccine construct with the help of EAAAK linkers. Finally, the tertiary structure and physical and chemical properties of the multi-epitope vaccine construct were analysed. The allergenicity, antigenicity and solubility of the multi-epitope vaccine construct were 7.37–11.30, 0.788 and 0.866, respectively. The Ramachandran diagram of the mock vaccine construct showed 96.0% residues within the favoured and allowed range. Collectively, our results showed that this multi-epitope vaccine construct has a high-quality structure and suitable characteristics, which may provide a theoretical basis for future laboratory experiments.

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Author(s), 2021. Published by Cambridge University Press

Introduction

Brucella is a Gram-negative intracellular pathogen that causes brucellosis [Reference Corbel1], it usually can be divided into 12 species in nature, including six so-called classic Brucella species, namely B. melitensis, B. abortus, B. suis, B. canis, B. ovis and B. neotomae, and six newly discovered Brucella species from wild mammals, amphibians and fish, namely B. microti, B. pinnipidialis, B. ceti, B. inopinata, B. papionis and B. vulpis. In the genus Brucella, B. melitensis, B. abortus and B. suis have good clinical significance [Reference Eisenberg2Reference Kumar4]. Brucellosis in animals manifests itself in miscarriages and reduced fertility and is transmitted to humans by inhaling aerosolised bacteria or ingesting contaminated derivatives. Clinical symptoms of human brucellosis include undulant fever, arthritis and general weakness [Reference Jia5, Reference Aslam6]. At the present medical level, it is difficult to completely eliminate Brucella [Reference Zhang7]. Therefore, the vaccine is an ideal way to prevent Brucella infection [Reference Masjedian Jezi8]. Currently, there are no Brucella vaccines for humans, and the live-attenuated vaccines designed for animals have many defects, including interference with serological testing and human infectivity [Reference Shojaei9]. Therefore, the subunit vaccine with no hidden danger and good protective effect has become a new hotspot in brucellosis research. The research of Brucella subunit vaccine mainly includes desoxyribonucleic acid (DNA) vaccines, lipopolysaccharide (LPS) vaccines and protein vaccines [Reference Sha10, Reference Li11]. With the rapid development of bioinformatics technology, epitopes of different antigens can be constructed as a novel vaccine with good immune effects.

In previous studies, a series of different proteins from Brucella has been used to identify immunodominant antigens against Brucella infection, including outer membrane proteins [Reference Vishnu12], flagellar proteins [Reference Li13Reference Terwagne15], L7/L12 ribosomal proteins [Reference Du, Li and Wang16] and Cu−Zn superoxide dismutase (Cu/Zn SOD) [Reference Pratt17], etc. The Omp22 protein is an immunodominant antigen, belonging to the Omp25/Omp31 family of proteins. It is highly conserved among various species of Brucella and is related to the infectivity of Brucella. Studies have shown that the Omp22 protein is similar to the LPS of Brucella and induces an immune response in the body [Reference Martin-Martin18]. The Omp19 is exposed at the cell surface of Brucella spp, and it can be employed for protection against Brucella [Reference Tibor, Decelle and Letesson19]. The Omp28 is also an important outer membrane protein of Brucella. It is highly conserved among various genera. It has been reported that the Omp28 peptide with CpG oligonucleotide as an adjuvant can induce an immune response mediated by IgG2a type, indicating that the Omp28 can induce the body to produce large amounts of IgG antibodies [Reference Kaushik20]. It has been known that vaccines constructed from a single protein stimulate poor immune responses and that multiple protein combinations enhance the vaccine's immune response [Reference Golshani21]. Therefore, in this study, based on the T/B epitopes of Omp22, Omp19 and Omp28, a multi-epitope vaccine against Brucella was constructed. To verify the availability of the vaccine construct, the tertiary structure, secondary structure, physical and chemical properties, solubility, antigenicity and allergenicity of the vaccine construct were analysed by various bioinformatics software. The results indicated that the multi-epitope vaccine construct could be used as a candidate protein against Brucella.

Methods

Amino acid sequence of the protein

The amino acid sequences of the Omp22, Omp19, Omp28 were searched in the GenBank database (https://www.ncbi.nlm.nih.gov/genbank/).

Prediction of signal peptide

SignalP-5.0 Server [Reference Armenteros22] was used to predict the signal peptide of the protein sequence. SP (Sec/SPI) is related to the type of signal peptide predicted; CS represents the cleavage site; Other: the probability that the sequence does not have any kind of signal peptide.

Identification of hydrophilic residues and surface accessible

Immunoglobulins usually bind to the water-accessible regions of antigens. Thus, the predicted epitopes should ideally be located in the highly hydrophilic region with many accessible residues. Surface accessible and hydrophilic regions of the protein were determined using the ProtScale software [Reference Wilkins23] and marked these areas through UCSF Chimera software [Reference Yang24].

Prediction of tertiary structure

The I-TASSER server [Reference Yang25] automatically generated high-quality tertiary structure models of protein molecules from amino acid sequences. The structure and function of proteins were predicted by I-TASSER based on analytic hierarchy process. Here, we used the confidence score (C score) to evaluate the predictive model quality. The C score is in the range of – 5 to 2, where the higher C score, the higher credibility of the model. The template modelling (TM) score was used to deal with some error-sensitive root mean-square deviation (RMSD) problems. A TM score <0.17 indicates random similarity, and only a TM score>0.5 could indicate a correct topology model. These cutoff values did not depend on the length of the protein.

B cell epitope prediction

To ensure the accuracy of the prediction of B cell epitopes, we used various prediction software, including COBEpro (http://scratch.proteomics.ics.uci.edu/), SVMTriP (http://sysbio.unl.edu/SVMTriP/prediction.php) and BepiPred (http://www.cbs.dtu.dk/services/BepiPred-1.0/). The overlapping sequences of the top 10 from at least two software were chosen as B cell dominant epitopes.

T cell epitope prediction

T cells can be divided into CD4+ T cells and CD8+ T cells, which are restricted by the major histocompatibility complex (MHC) in identifying epitopes. Human MHC is called the human leucocyte antigen (HLA) gene complex. CD4+ T cells recognise antigenic epitopes consisting of 9–22 amino acid residues, which are limited by their HLA-II molecules and differentiate into T helper cells after activation. CD8+ T cells recognise epitopes consisting of 8–12 amino acid residues, which are limited by HLA-I molecules, and differentiate into cytotoxic T lymphocytes (CTLs) after activation. Therefore, it was necessary to predict CD4+ and CD8+ T cell epitopes, respectively, when predicting T cell epitopes. We selected HLA-A * 0201 and HLA-A * 2402 discerned by HLA-I and HLA-DRB * 0701 and HLA-DRB * 0901 discerned by HLA-II, which were the four most common alleles in North China. Some different online software was used to predict T cell epitopes, including IEDB (http://www.iedb.org/home_v3.php), SYFPEITHI (http://www.syfpeithi.de/bin/mhcserver.dll/epitopeprediction) and RANKPEP (http://imed.med.ucm.es/Tools/rankpep.html). We listed the top 10 high-score epitopes for each prediction software and selected three software overlapping sequences as T cell dominant epitopes for the proteins.

Predicting immunogenicity and antigenicity of CD8+ T cell epitopes

Epitope/HLA complexes should be capable of eliciting strong immune responses. Therefore, we used the HLA I immunogenicity prediction tool of the IEDB server, the parameters were set to default. The antigenic properties of all CD8+ T cell epitopes were analysed using VaxiJen 2.0 Server [Reference Doytchinova and Flower26] with a threshold of 0.5. Finally, we selected CD8+ T cell epitopes with immunogenicity and antigenicity for the next step (http://tools.immuneepitope.org/immunogenicity/).

Multi-epitope vaccine sequence construction

The predicted T/B dominant antigen epitopes were linked by some amino acid linkers. To enhance the immunogenicity of the epitope vaccine, heparin-binding haemagglutinin (HBHA) conservative sequences were added to epitope sequences as the carrier [Reference Rana and Akhter27]. It is also known that the PADRE peptides can induce CD4+ T cells and enhance the immune function of the vaccine construct [Reference Ghaffari-Nazari28].

Assessment of allergenicity, antigenicity and solubility

The allergenicity of the vaccine construct was predicted by SDAP [Reference Ivanciuc, Schein and Braun29]. SDAP can predict the cross-reactivity of candidate vaccines and known allergens through the allergenicity rules of WHO. We chose the default parameter, which was a similarity of more than 35% for full-length sequences, and an E cutoff of 0.01 when the sliding window was aligned to 80 amino acids, and 6 adjacent short amino acid sequences matched with known allergens. The antigenicity of the vaccine construct by VaxiJen 2.0 server. This server relies on auto cross covariance (ACC) transformation, and alignment-independent predicted antigenic epitopes by physiochemical properties of proteins. The SOLpro [Reference Magnan, Randall and Baldi30] was used to predict the solubility of the vaccine construct. Based on the multiple representation of the first-order amino acid sequence,two-stage SVM architecture was adopted. The final result is to summarise the prediction with 74% overall accuracy at the corresponding probability (³0.5).

The secondary structure prediction

The proportions and distributions of Alpha helix, Beta turn, Random coil, Extended strand in vaccine construct sequence were predicted and analysed using SOPMA online analysis software [Reference Deleage31].

Prediction of various physicochemical properties

The online tool ProtPararm from Expasy (http://www.expasy.org/protparam/) was used to analyse the physicochemical properties of the vaccine, including theoretical isoelectric points, molecular weight, hydrophilicity, atomic composition and extinction coefficient. The physical and chemical properties from the pk values of amino acids were calculated by ProtPararm software.

Construction of the tertiary structure of the vaccine construct

The I-TASSER online software was used to construct the vaccine's tertiary structure, which was validated by Ramachandran diagrams in the RAMPAGE webserver. (http://mordred.bioc.cam.ac.uk/~rapper/rampage.php). The Ramachandran plot is a method to show the allowed and disallowed dihedral angles psi (ψ) and phi (ϕ) of amino acid. It is calculated according to van der Waal radius of the side chain.

Results

Amino acid sequence of protein

Obtaining the Omp22 protein sequence from GenBank (Accession: AAS84601.1): MFKRSITAAALGAAVMAFAGSAFAADMMGGTDYTYNDPVAAGPHDWSGNYVGAQVGGSSSKFPSPFASRTGALGGIVVGKNMQNGNIVFGAELEGNFAEAEHRIGHGGTLQQSWNGNAKGKVGYTFDKTLVYGTAGYGVTRFKAKDNTTSASGWEGGVLIGAGVEQALSGPLSVKAEYDFQRFNDVKSQVNGIEQRNNLKNHSIKAGLNYKF Obtaining the Omp19 protein sequence from GenBank (Accession: ERU25360.1): MGISKASLLSLAAAGIVLAGCQSSRLGNLDNVSPPPPPAPVNAVPAGTVQKGNLDSPTQFPNAPSTDMSAQSGTQVASLPPASAPDLTPGAVAGVWNASLGGQSCKIATPQTKYGQGYRAGPLRCPGELANLASWAVNGKQLVLYDANGGTVASLYSSGQGRFDGQTTGGQAVTLSR Obtaining the Omp28 protein sequence from GenBank (Accession: AEF59021.1): MNTRASNFLAASFSTIMLVGAFSLPAFAQENQMTTQPARIAVTGEGMMTASPDMAILNLSVLRQAKTAREAMTANNEAMTKVLDAMKKAGIEDRDLQTGGINIQPIYVYPDDKNNLKEPTITGYSVSTSLTVRVRELANVGKILDESVTLGVNQGGDLNLVNDNPSAVINEARKRAVANAIAKAKTLADAAGVGLGRVVEISELSRPPMPMPIARGQFRTMLAAAPDNSVPIAAGENSYNVSVNVVFEIK

Signal peptide of proteins

The signal peptides of Omp22, Omp19 and Omp28 were predicted separately by the SignalP-5.0. Signal peptide sequence of the Omp19 was MGISKASLLSLAAAGIVLA (Fig. 1a); signal peptide sequence of the Omp22 was MFKRSITAAALGAAVMAFAGSAFA (Fig. 1b); signal peptide sequence of the Omp28 was MNTRASNFLAASFSTIMLVGAFSLPAFA (Fig. 1c). All of the three signal peptide sequences were removed from the epitope prediction of Omp22, Omp19 and Omp28.

Fig. 1. Signal peptide of proteins using SignalP-5.0 analysis. SP (Sec/SPI): type of signal peptide predicted; CS: the cleavage site; Other: the probability that the sequence does not have any kind of signal peptide. (a) The signal peptide prediction of Omp19: MGISKASLLSLAAAGIVLA. (b) The signal peptide prediction of Omp22: MFKRSITAAALGAAVMAFAGSAFA. (c) The signal peptide prediction of Omp28: MNTRASNFLAASFSTIMLVGAFSLPAFA.

Accessible and hydrophilic region of the proteins

The accessibility and hydrophilic regions of the Omp22 protein residues and their locations in three-dimensional structures are shown in ×Figure 2a and c. Amino acids 61–62 and 75–82 are considered highly unreachable residues due to a surface accessibility score below 5.0 (Fig. 2b). Amino acids 49–54 and 132–140 were estimated to be highly hydrophobic fragments (Fig. 2d). In the prediction of T/B epitopes, the residual regions of hydrophobicity and inaccessibility are neglected and they are unlikely to bind to specific antibodies.

Fig. 2. Solvent accessible and hydrophilic regions of Omp22. (a) The blue residues show the surface-accessible regions of Omp22 as tertiary structure. (b) The accessible residues are displayed as a ProtScale plot. The residues exceeding the threshold (6.0) will be considered surface accessible residues. (c) The green residues displayed the hydrophilic regions of Omp22 as tertiary structure. (d) two highly hydrophobic area (aa45–54) and (aa132–140) is marked in brown on the ProtScale hydrophobic plot.

The accessibility and hydrophobic residues of the Omp19 protein and their positions in the 3D structure are shown in Figure 3a and c. Amino acids 102–107 were considered highly inaccessible areas (Fig. 3b). The residues between amino acids 70–78 were predicted to be highly hydrophobic fragments (Fig. 3d).

Fig. 3. Solvent accessible and hydrophilic regions of Omp19. (a) The blue residues show the surface-accessible regions of Omp19 as tertiary structure. (b) The accessible residues are displayed as a ProtScale plot. The residues exceeding the threshold (6.0) will be considered surface accessible residues. (c) The green residues displayed the hydrophilic regions of Omp19 as tertiary structure. (d) A highly hydrophobic area (aa70–78) is marked in brown on the ProtScale hydrophobic plot.

The accessibility and hydrophobic residues of the Omp28 protein and their positions in the 3D structure are shown in ×Figure 4a and c. Amino acids 75–79 and amino acids 179–194 were considered highly inaccessible areas (Fig. 4b). The residues between amino acids 27–33 were predicted to be highly hydrophobic fragments (Fig. 4d).

Fig. 4. Solvent accessible and hydrophilic regions of Omp28. (a) The blue residues indicate the surface-accessible regions of Omp28 as tertiary structure. (b) The accessible residues are shown as a ProtScale plot. The residues exceeding the threshold (6.0) will be considered surface accessible residues. (c) The green residues displayed the hydrophilic regions of Omp28 as tertiary structure. (d) A highly hydrophobic area (aa27–33) is marked in brown on the ProtScale hydrophobic plot.

Tertiary structure of proteins

The I-TASSER online program was employed to model the 3D structure of proteins (Fig. 5). The C score of the Omp22 prediction model was shown as −0.56, and the TM score and the RMSD of the model are shown as 0.64 ± 0.13 and 6.4 ± 3.9 Å, respectively, so it had high reliability (Fig. 5a). The prediction model of the Omp19 was not highly reliable (Fig. 5b), and its C score was shown as −3.12. The TM and RMSD were 0.36 ± 0.12 and 12.0 ± 4.4 Å, respectively. The prediction model of the Omp28 was very reliable, with a C score of 0.94. The TM and RMSD are shown as 0.84 ± 0.08 and 3.7 ± 2.5 Å, respectively (Fig. 5c).

Fig. 5. Tertiary structure of protein. Multi-coloured ribbon and coil structure represents the helix, sheets and coiled secondary structure component of the 3D model obtained for the protein. (a) Omp22. (b) Omp19. (c) Omp28.

Prediction of B-cell epitopes

We used online software COBEpro, SVMTriP and BepiPred to screen the predicted Epitopes of B cells and displayed them in tabular form. The B cell epitopes for Omp22, Omp19 and Omp28 are shown in Tables 1–3, respectively.

Table 1. B cell epitopes of Omp22

Table 2. B cell epitopes of Omp19

Table 3. B cell epitopes of Omp28

Prediction of T cell epitopes

CD8+ T cell epitopes

Online software SYFPEITHI, IEDB and RANKpep were employed to analyse the CD8+ T cell epitope of the proteins. The analysis results of the Omp22 are shown in Tables 4–6. The analysis results of the Omp19 are shown in Tables 7–9. The analysis results of the Omp28 are shown in Tables 10–12.

Table 4. The CD8+ T cell epitopes of Omp22 by SYFPEITHI

Table 5. The CD8+ T cell epitopes of Omp22 by IEDB

Table 6. The CD8+ T cell epitopes of Omp22 by RANKPEP

Table 7. The CD8+ T cell epitopes of Omp19 by SYFPEITHI

Table 8. The CD8+ T cell epitopes of Omp19 by IEDB

Table 9. The CD8+ T cell epitopes of Omp19 by RANKPEP

Table 10. The CD8+ T cell epitopes of Omp28 by SYFPEITHI

Table 11. The CD8+ T cell epitopes of Omp28 by IEDB

Table 12. The CD8+ T cell epitopes of Omp28 by RANKPEP

CD4+ T cell epitopes

Online software SYFPEITHI, IEDB and RANKpep were employed to analyse the CD4+ T cell epitope of the proteins. The analysis results of the Omp22 are shown in Tables 13–15. The analysis results of the Omp19 are shown in Tables 16–18. The analysis results of the Omp28 are shown in Tables 19–21.

Table 13. The CD4+ T cell epitopes of Omp22 by SYFPEITHI

Table 14. The CD4+ T cell epitopes of Omp22 by IEDB

Table 15. The CD4+ T cell epitopes of Omp22 by RANKPEP

Table 16. The CD4+ T cell epitopes of Omp19 by SYFPEITHI

Table 17. The CD4+ T cell epitopes of Omp19 by IEDB

Table 18. The CD4+ T cell epitopes of Omp19 by RANKPEP

Table 19. The CD4+ T cell epitopes of Omp28 by SYFPEITHI

Table 20. The CD4+ T cell epitopes of Omp28 by IEDB

Table 21. The CD4+ T cell epitopes of Omp28 by RANKPEP

Overlapping epitopes

These results were compared to find selected sequences with overlapping regions that were identified as dominant B and T epitopes of proteins (Tables 22–24).

Table 22. The dominant linear B and T epitopes of Omp22

Table 23. The dominant linear B and T epitopes of Omp19

Table 24. The dominant linear B and T epitopes of Omp28

Class I immunogenicity and antigenic prediction

The immunogenicity and antigenicity analysis of CD8+ T cell dominant epitopes were performed by the MHC I immunogenicity prediction tool of the IEDB server and VaxiJen 2.0 server, respectively. Both immunogenicity and antigenicity were positive for epitope, which could be further analysed. The results are shown in Tables 22–24.

Design of the multi-epitope vaccine construct

To construct the final chimaeric subunit vaccine sequence, the predicted epitope of B cells was used as a template and compared with the T cell epitope. Epitopes whose sequences overlap with the B cell epitope were preferentially chosen for the final vaccine construct (Tables 25–27). Finally, six sequences were selected as constructs, including sequences 120–138, 154–174 of Omp22, and epitope sequences 24–47, 109–130, 142–153 of Omp19 and sequence 41–73 of Omp28. These epitopes were connected by amino acid linkers. HEYGAEALERAG and GGGS linkers bind the T-epitopes and the B-epitopes, while the carrier sequences connect the N-terminal and C-terminal via the EAAAK linker (Table 28).

Table 25. Comparative analysis of all predicted B cell, HLA-I and HLA-II epitopes of Omp22

Table 26. Comparative analysis of all predicted B cell, HLA-I and HLA-II epitopes of Omp19

Table 27. Comparative analysis of all predicted B cell, HLA-I and HLA-II epitopes of Omp28

Table 28. Predict allergenicity, antigenicity and solubility of vaccine structure

Allergenicity, antigenicity and solubility evaluation

The online software SDAP was employed to predict the allergenicity of the vaccine construct. The similarity (%) between the sequence of the construct and the most similar template was between 7.37 and 11.30, which was lower than the threshold of 35%, so the vaccine construct was considered non-allergenic. The antigenicity of this vaccine sequence was analysed using VaxiJen 2.0 service software. The antigen value of 0.788 was predicted, which was higher than the threshold value of 0.5, so the vaccine construct was considered to have good antigenicity. The vaccine construct was soluble with SOLpro SVM value 0.866, which was considered to have good solubility (Table 28).

Prediction of the secondary structure of the vaccine construct

Using SOPMA server to analyse the secondary structure of the vaccine. The results have shown that the structure had 67.32% alpha-helix, 3.39% extended strand, 7.62% Beta turn and 21.13% random coil (Fig. 6).

Fig. 6. Analysis of the secondary structure of the vaccine construct by SOMPA. The sequence length of the vaccine construct is 407 amino acids. The blue h is Alpha helix and accounts for 67.32%, the red e is extended strand and accounts for 7.62%, the yellow c is random coil and accounts for 21.13%, the green t is Beta turn and accounts for 3.93%.

Physicochemical analysis of the vaccine construct

The vaccine construct was made up of 407 amino acids and had a molecular weight of approximately 43 kDa. The molecular formula was C1855H2947N551O612S4. The theoretical pI value was 4.95 and contained 45 strongly alkaline ( + ) amino acids (K, R) and 61 strongly acidic (–) amino acids (D, E). The instability index was 29.87 (the instability index of the stable protein <40), which was predicted to be a stable protein. The grand average of hydropathicity (GRAVY): −0.439 (GRAVY ranges from −2 to 2, negative values indicate hydrophilic proteins) and were classified as a hydrophilic construct. The results show that the vaccine construct has good characteristics of initiating an immunogenic response.

Prediction and verification of the 3D structure of the vaccine construct

The results showed that the C-score of the three-dimensional (3D) model was shown as −1.52, and the TM score and RMSD of the model were 0.53 ± 0.15 and 10.4 ± 4.6 Å, respectively (Fig. 7a). The structure validation was achieved by Ramachandran graph analysis. The results showed that there were 86.4%, 9.6% and 4.0% residues in favourable, allowable and outlier regions, respectively (Fig. 7b).

Fig. 7. The 3D structure prediction and validation of the vaccine construct. (a) The 3D structure of model construct. (b) Ramachandran diagram of the mock vaccine, showing 96.0 residues in the allowable range. Ramachandran plot takes the angles of Phi and Pis as the abscissa and ordinate. Phi is the rotation angle of C−N bond on the left side of α carbon in a peptide unit, and Pis is the rotation angle of C−C bond on the right side of α carbon. The area inside the yellow coil is completely allowed, the area inside the blue coil is allowed and the area outside the blue coil is not allowed. When the scatter in the blue coil and the yellow coil exceeds 90%, the tertiary structure of the model conforms to rules of stereochemistry.

Discussion

Brucellosis is a serious infectious disease with low cure rate and complex clinical symptoms. At present, vaccine is considered to be the most effective measure to prevent Brucellosis [Reference Hou, Liu and Peng32]. In this study, we used bioinformatics methods to design a multi-epitope vaccine construct, which provided detailed insights into the initial stages of vaccine development.

Based on previous studies, the Omp22, Omp19 and Omp28 were evaluated as having good immunogenicity and inducing immune protection in vivo. Moreover, Omp22 and Omp28 were highly conservative, stable and not easily degraded, which provides favourable conditions for later vaccine construction. Therefore, it is of great significance to select the Omp22, Omp19 and Omp28 as candidate proteins for constructing a multi-epitope vaccine against Brucella. As far as we know, there is no multi-epitope vaccine design based on these three candidate proteins.

SignalP-5.0 server was employed for predicting the signal peptide of the Omp22, Omp19 and Omp28. We respectively removed the signal peptide sequences of Omp22, Omp19 and Omp28, namely 1–24 aa (MFKRSITAAALGAAVMAFAGSAFA), 1–19 aa (MGISKASLLSLAAAGIVLA) and 1–28 aa (MNTRASNFLAASFSTIMLVGAFSLPAFA). Signal peptides were usually located at the start of the protein translation region, affecting protein expression. To predict T/B epitopes more accurately, it was necessary to remove the signal peptide. It was found that the highly hydrophilic region of the antigen was conducive to the interaction with the antibody binding sites, and the more accessible residues on the surface of the antigen, the more conducive to the binding of the antibody [Reference Pourseif33]. Therefore, we screened out highly hydrophobic and inaccessible regions of these outer membrane proteins to ensure that the epitopes were located on the more hydrophilic and accessible residues (Figs 1 and 2).

The key to preparing an epitope vaccine is to obtain the epitopes of relative antigen [Reference Li34]. Therefore, in this study, the T cell and B cell epitopes of three candidate proteins were screened by the variety of epitope prediction software, which improved the accuracy of epitope prediction [Reference Wang35], then dominant epitopes with both T and B cell were elected. The immune response of Brucella mainly depends on active T cells. However, many vaccines can only induce B cell immunity [Reference Verma36]. CD8+ T cells can effectively lyse and kill infected cells, thus exposing Brucella to the outside of cells and triggering other germicidal mechanisms. Therefore, we analysed the antigenicity and immunogenicity of CD8+ T cell epitopes to ensure that the epitope vaccine can effectively activate CD8+ T cells. Considering that CD4 + T cells are needed to induce appropriate antibody immune response, we also included CD4 + T cell epitopes in the vaccine construct. Our multi-epitope vaccine was designed based on T/B cell epitopes, it is possible to selectively activate specific B cells, CTL and T helper cells to achieve a fully protected and sustained immune response. Finally, six dominant epitopes were identified by a series of comparisons. There were two dominant epitopes (120–138, 154–174) from the Omp22, three dominant epitopes (24–47, 109–130, 142–153) from the Omp19 and one dominant epitope (41–73) from the Omp28.

As shown in Table 28, our vaccine construct is composed of HBHA,PADRE (as a carrier) located at the N- and C-terminal end of the vaccine sequence,six sets of T/ B cell epitopes in the middle of the vaccine construct, which were connected to each other by appropriate linkers. Using GSSS and HEYGAEALERAG cleavable linkers to separate these three domains from each other to enhance the expression of epitopes. These linkers have two key roles in the structure of epitope vaccines: firstly, to prevent the generation of binding epitopes (new epitopes) for the designed epitope vaccine; secondly, to promote immune processing and presentation of HLA-II binding epitopes [Reference Nezafat37]. Besides, for linker sequences, we usually use glycine (G) and serine (S) as component amino acids of linker sequences [Reference van Rosmalen, Krom and Merkx38]. Because glycine (G) and serine (S) are the smallest of all amino acids and the most flexible,have no chiral carbon and can be placed between epitope sequences without affecting the conformation and function of either sides. Moreover, due to the functional characteristics of HBHA, the EAAAK linker was used to connect HBHA to the N terminal of vaccine construct as a carrier to ensure the interaction between HBHA and other vaccine fragments was minimised and provides better separation [Reference Arai39]. PADRE sequence has been reported to reduce the effect of human HLA-DR polymorphism and can enhance the long-term immune response by inducing CD4+ T cells [Reference Solanki and Tiwari40]. Therefore, we added the PADRE sequence to the vaccine construct.

Finally, we constructed a multi-epitope vaccine with a length of 407 amino acids. The physicochemical, structural and immunological properties of the vaccine construct were predicted by various bioinformatics methods. It is necessary to examine any possible allergenicity at the early stage of vaccine design [Reference Validi41]. Our vaccine construct was shown to be non-allergenic on SDAP software, making it more effective as a candidate vaccine. The multi-epitope vaccine construct showed higher scores of antigenicity on the VaxiJen 2.0 server. The vaccine construct showed the solubility of more than 0.5 (0.886), which exhibited that the vaccine construct will be highly soluble during its heterologous expression in E. coli. The solubility of recombinant protein in E. coli is the key to many biochemical and functional studies. In short, the vaccine construct is soluble, non-allergic and antigenic peptides.

The molecular weight (MW) of the final protein is estimated to be 43 kDa. The estimated theoretical pI was 4.95, indicating that the vaccine construct was acidic. The predicted value of the instability index was 29.87, which indicated that the protein was very stable after expression, thus further confirmed its possibility. The predicted score of the GRAVY was −0.439,which shows that the protein would be a hydrophilic construct. The results show that the vaccine construct has good characteristics of initiating an immunogenic response. Analyses of the secondary structure showed that the protein mainly contained 67.32% alpha helices, with 7.62% extended strand, and they have been identified as important ‘structural antigens’ types. The 3D structure of the vaccine construct was modelled by I-TASSER. RMSD and TM scores are index to evaluate the reliability and accuracy of the prediction model. A TM-score more than 0.5 always shows the correct topology model, and C-score was used to show its confidence. Expected TM score of 0.53 ± 0.15 validated the accuracy of the model. The chimaeric structure displayed appropriate characteristics based on the Ramachandran plot's results. Ramachandran plot analysis indicates that 96% of the residues are initiated in the favoured and allowed regions, with fewer (4%) residues in the outlier region. This indicated that the quality of the whole model is acceptable. In this study, a multi-epitope vaccine construct was designed against brucellosis by immunodominant epitopes from antigens of Brucella, including Omp22, Omp19 and Omp28 using the combination of online bioinformatics servers. However, there is a lack of confirmation of the protective efficacy of the vaccine construct in animal models. More studies with both in vivo and in vitro methods would be designed in the future to assess the potency of the vaccine construct.

In conclusion, this study uses a large number of immunoinformatic approaches to find the vaccine construct to fight against Brucella infection, which provides a theoretical basis for future laboratory experiments.

Acknowledgements

The authors are thankful to the State Key Laboratory of Pathogenesis, Prevention, Treatment of Central Asian High Incidence Diseases, The First Affiliated Hospital of Xinjiang Medical University, PR China.

Authors' contributions

This study was conceived and designed by Jianbing Ding and Fengbo Zhang. Bioinformatic analysis was performed by Zhiqiang Chen and Tong Sha. The manuscript was drafted by Zhiqiang Chen and Fengbo Zhang and edited by Jianbing Ding. Ethics approval and consent to participate.

Financial support

This study was supported by grants (No. 81860352, No. 81860375, No.81560322) from the National Natural Science Foundation of China and funds for the Xinjiang Key construction Project of the 13th Five-Year Plan (basic medicine).

Conflict of interest

The authors declared no potential conflicts of interest.

Ethical standards

This article does not contain any studies using human participants or animals.

Data availability statement

The authors confirm that the data supporting the findings of this study are available within the article and its references.

Footnotes

*

These authors contributed equally to this work.

References

Corbel, MJ (1997) Brucellosis: an overview. Emerging Infectious Diseases 3, 213221.CrossRefGoogle ScholarPubMed
Eisenberg, T et al. (2020) Expanding the host range: infection of a reptilian host (Furcifer pardalis) by an atypical Brucella strain. Antonie van Leeuwenhoek 113, 15311537.CrossRefGoogle ScholarPubMed
Waldrop, SG and Sriranganathan, N (2019) Intracellular invasion and survival of Brucella neotomae, another possible zoonotic Brucella species. PLoS One 14, e0213601.CrossRefGoogle ScholarPubMed
Kumar, S et al. (2011) Rapid multiplex PCR assay for the simultaneous detection of the Brucella genus, B. abortus, B. melitensis, and B. suis. Journal of Microbiology and Biotechnology 21, 8992.CrossRefGoogle ScholarPubMed
Jia, B et al. (2017) Brucella endocarditis: clinical features and treatment outcomes of 10 cases from Xinjiang, China. Journal of Infection 74, 512514.CrossRefGoogle ScholarPubMed
Aslam, M et al. (2020) Potential druggable proteins and chimeric vaccine construct prioritization against Brucella melitensis from species core genome data. Genomics 112, 17341745.CrossRefGoogle ScholarPubMed
Zhang, FB et al. (2019) The immunogenicity of OMP31 peptides and its protection against Brucella melitensis infection in mice. Scientific Reports 9, 3512.CrossRefGoogle ScholarPubMed
Masjedian Jezi, F et al. (2019) Immunogenic and protective antigens of Brucella as vaccine candidates. Comparative Immunology, Microbiology and Infectious Diseases 65, 2936.CrossRefGoogle ScholarPubMed
Shojaei, M et al. (2018) Immunogenicity evaluation of plasmids encoding Brucella melitensis Omp25 and Omp31 antigens in BALB/c mice. Iranian Journal of Basic Medical Sciences 21, 957964.Google ScholarPubMed
Sha, T et al. (2020) Bioinformatics analysis of candidate proteins Omp2b, P39 and BLS for Brucella multivalent epitope vaccines. Microbial Pathogenesis 147, 104318.CrossRefGoogle ScholarPubMed
Li, ZW et al. (2019) Immunoinformatics prediction of OMP2b and BCSP31 for designing multi-epitope vaccine against Brucella. Molecular Immunology 114, 651660.CrossRefGoogle ScholarPubMed
Vishnu, US et al. (2015) Novel vaccine candidates against Brucella melitensis identified through reverse vaccinology approach. Omics: A Journal of Integrative Biology 19, 722729.CrossRefGoogle ScholarPubMed
Li, X et al. (2012) Vaccination with recombinant flagellar proteins FlgJ and FliN induce protection against Brucella abortus 544 infection in BALB/c mice. Veterinary Microbiology 161, 137144.CrossRefGoogle ScholarPubMed
Coloma-Rivero, RF et al. (2020) The role of the flagellar protein FlgJ in the virulence of Brucella abortus. Frontiers in Cellular and Infection Microbiology 10, 178.CrossRefGoogle ScholarPubMed
Terwagne, M et al. (2013) Innate immune recognition of flagellin limits systemic persistence of Brucella. Cellular Microbiology 15, 942960.CrossRefGoogle ScholarPubMed
Du, ZQ, Li, X and Wang, JY (2016) Immunogenicity analysis of a novel subunit vaccine candidate molecule-recombinant L7/L12 ribosomal protein of Brucella suis. Applied Biochemistry and Biotechnology 179, 14451455.CrossRefGoogle ScholarPubMed
Pratt, AJ et al. (2015) Structural, functional, and immunogenic insights on Cu,Zn superoxide dismutase pathogenic virulence factors from Neisseria meningitidis and Brucella abortus. Journal of Bacteriology 197, 38343847.CrossRefGoogle ScholarPubMed
Martin-Martin, AI et al. (2008) Importance of the Omp25/Omp31 family in the internalization and intracellular replication of virulent B. ovis in murine macrophages and HeLa cells. Microbes & Infection 10, 706710.CrossRefGoogle ScholarPubMed
Tibor, A, Decelle, B and Letesson, JJ (1999) Outer membrane proteins Omp10, Omp16, and Omp19 of Brucella spp. are lipoproteins. Infection & Immunity 67, 49604962.CrossRefGoogle ScholarPubMed
Kaushik, P et al. (2010) Protection of mice against Brucella abortus 544 challenge by vaccination with recombinant OMP28 adjuvanted with CpG oligonucleotides. Veterinary Research Communications 34, 119132.CrossRefGoogle ScholarPubMed
Golshani, M et al. (2018) Comparison of the protective immunity elicited by a Brucella cocktail protein vaccine (rL7/L12 + rTOmp31 + rSOmp2b) in two different adjuvant formulations in BALB/c mice. Molecular Immunology 103, 306311.CrossRefGoogle Scholar
Armenteros, JJA et al. (2019) SignalP 5.0 improves signal peptide predictions using deep neural networks. Nature Biotechnology 37, 420423.CrossRefGoogle Scholar
Wilkins, MR et al. (1999) Protein identification and analysis tools in the ExPASy server. Methods in Molecular Biology 112, 531552.Google ScholarPubMed
Yang, Z et al. (2012) UCSF Chimera, MODELLER, and IMP: an integrated modeling system. Journal of Structural Biology 179, 269278.CrossRefGoogle Scholar
Yang, JY et al. (2015) The I-TASSER suite: protein structure and function prediction. Nature Methods 12, 78.CrossRefGoogle ScholarPubMed
Doytchinova, IA and Flower, DR (2007) VaxiJen: a server for prediction of protective antigens, tumour antigens and subunit vaccines. BMC Bioinformatics 8, 4.CrossRefGoogle ScholarPubMed
Rana, A and Akhter, Y (2016) A multi-subunit based, thermodynamically stable model vaccine using combined immunoinformatics and protein structure based approach. Immunobiology 221, 544557.CrossRefGoogle ScholarPubMed
Ghaffari-Nazari, H et al. (2015) Improving multi-epitope long peptide vaccine potency by using a strategy that enhances CD4 + T help in BALB/c mice. PLoS One 10, e0142563.CrossRefGoogle Scholar
Ivanciuc, O, Schein, CH and Braun, W (2003) SDAP: database and computational tools for allergenic proteins. Nucleic Acids Research 31, 359362.CrossRefGoogle ScholarPubMed
Magnan, CN, Randall, A and Baldi, P (2009) SOLpro: accurate sequence-based prediction of protein solubility. Bioinformatics (Oxford, England) 25, 22002207.CrossRefGoogle ScholarPubMed
Deleage, G (2017) ALIGNSEC: viewing protein secondary structure predictions within large multiple sequence alignments. Bioinformatics (Oxford, England) 33, 39913992.CrossRefGoogle ScholarPubMed
Hou, H, Liu, X and Peng, Q (2019) The advances in brucellosis vaccines. Vaccine 37, 39813988.CrossRefGoogle ScholarPubMed
Pourseif, MM et al. (2018) A novel B- and helper T-cell epitopes-based prophylactic vaccine against Echinococcus granulosus. BioImpacts: BI 8, 3952.CrossRefGoogle Scholar
Li, Y et al. (2013) Bioinformatic prediction of epitopes in the Emy162 antigen of Echinococcus multilocularis. Experimental & Therapeutic Medicine 6, 335340.CrossRefGoogle ScholarPubMed
Wang, H et al. (2014) Prokaryotic expression and identification of B- and T-cell combined epitopes of Em95 antigen of Echinococcus multilocularis. International Journal of Clinical & Experimental Pathology 7, 51175122.Google ScholarPubMed
Verma, S et al. (2018) Multi-epitope DnaK peptide vaccine against S. typhi: an in silico approach. Vaccine 36, 40144022.CrossRefGoogle Scholar
Nezafat, N et al. (2016) Designing an efficient multi-epitope peptide vaccine against Vibrio cholerae via combined immunoinformatics and protein interaction based approaches. Computational Biology & Chemistry 62, 8295.CrossRefGoogle ScholarPubMed
van Rosmalen, M, Krom, M and Merkx, M (2017) Tuning the flexibility of glycine-serine linkers to allow rational design of multidomain proteins. Biochemistry 56, 65656574.CrossRefGoogle ScholarPubMed
Arai, R et al. (2001) Design of the linkers which effectively separate domains of a bifunctional fusion protein. Protein Engineering 14, 529532.CrossRefGoogle ScholarPubMed
Solanki, V and Tiwari, V (2018) Subtractive proteomics to identify novel drug targets and reverse vaccinology for the development of chimeric vaccine against Acinetobacter baumannii. Scientific Reports 8, 9044.CrossRefGoogle ScholarPubMed
Validi, M, et al. (2018) Immuno-informatics based approaches to design a novel multi epitope-based vaccine for immune response reinforcement against Leptospirosis. Molecular Immunology 104, 128138.CrossRefGoogle ScholarPubMed
Figure 0

Fig. 1. Signal peptide of proteins using SignalP-5.0 analysis. SP (Sec/SPI): type of signal peptide predicted; CS: the cleavage site; Other: the probability that the sequence does not have any kind of signal peptide. (a) The signal peptide prediction of Omp19: MGISKASLLSLAAAGIVLA. (b) The signal peptide prediction of Omp22: MFKRSITAAALGAAVMAFAGSAFA. (c) The signal peptide prediction of Omp28: MNTRASNFLAASFSTIMLVGAFSLPAFA.

Figure 1

Fig. 2. Solvent accessible and hydrophilic regions of Omp22. (a) The blue residues show the surface-accessible regions of Omp22 as tertiary structure. (b) The accessible residues are displayed as a ProtScale plot. The residues exceeding the threshold (6.0) will be considered surface accessible residues. (c) The green residues displayed the hydrophilic regions of Omp22 as tertiary structure. (d) two highly hydrophobic area (aa45–54) and (aa132–140) is marked in brown on the ProtScale hydrophobic plot.

Figure 2

Fig. 3. Solvent accessible and hydrophilic regions of Omp19. (a) The blue residues show the surface-accessible regions of Omp19 as tertiary structure. (b) The accessible residues are displayed as a ProtScale plot. The residues exceeding the threshold (6.0) will be considered surface accessible residues. (c) The green residues displayed the hydrophilic regions of Omp19 as tertiary structure. (d) A highly hydrophobic area (aa70–78) is marked in brown on the ProtScale hydrophobic plot.

Figure 3

Fig. 4. Solvent accessible and hydrophilic regions of Omp28. (a) The blue residues indicate the surface-accessible regions of Omp28 as tertiary structure. (b) The accessible residues are shown as a ProtScale plot. The residues exceeding the threshold (6.0) will be considered surface accessible residues. (c) The green residues displayed the hydrophilic regions of Omp28 as tertiary structure. (d) A highly hydrophobic area (aa27–33) is marked in brown on the ProtScale hydrophobic plot.

Figure 4

Fig. 5. Tertiary structure of protein. Multi-coloured ribbon and coil structure represents the helix, sheets and coiled secondary structure component of the 3D model obtained for the protein. (a) Omp22. (b) Omp19. (c) Omp28.

Figure 5

Table 1. B cell epitopes of Omp22

Figure 6

Table 2. B cell epitopes of Omp19

Figure 7

Table 3. B cell epitopes of Omp28

Figure 8

Table 4. The CD8+ T cell epitopes of Omp22 by SYFPEITHI

Figure 9

Table 5. The CD8+ T cell epitopes of Omp22 by IEDB

Figure 10

Table 6. The CD8+ T cell epitopes of Omp22 by RANKPEP

Figure 11

Table 7. The CD8+ T cell epitopes of Omp19 by SYFPEITHI

Figure 12

Table 8. The CD8+ T cell epitopes of Omp19 by IEDB

Figure 13

Table 9. The CD8+ T cell epitopes of Omp19 by RANKPEP

Figure 14

Table 10. The CD8+ T cell epitopes of Omp28 by SYFPEITHI

Figure 15

Table 11. The CD8+ T cell epitopes of Omp28 by IEDB

Figure 16

Table 12. The CD8+ T cell epitopes of Omp28 by RANKPEP

Figure 17

Table 13. The CD4+ T cell epitopes of Omp22 by SYFPEITHI

Figure 18

Table 14. The CD4+ T cell epitopes of Omp22 by IEDB

Figure 19

Table 15. The CD4+ T cell epitopes of Omp22 by RANKPEP

Figure 20

Table 16. The CD4+ T cell epitopes of Omp19 by SYFPEITHI

Figure 21

Table 17. The CD4+ T cell epitopes of Omp19 by IEDB

Figure 22

Table 18. The CD4+ T cell epitopes of Omp19 by RANKPEP

Figure 23

Table 19. The CD4+ T cell epitopes of Omp28 by SYFPEITHI

Figure 24

Table 20. The CD4+ T cell epitopes of Omp28 by IEDB

Figure 25

Table 21. The CD4+ T cell epitopes of Omp28 by RANKPEP

Figure 26

Table 22. The dominant linear B and T epitopes of Omp22

Figure 27

Table 23. The dominant linear B and T epitopes of Omp19

Figure 28

Table 24. The dominant linear B and T epitopes of Omp28

Figure 29

Table 25. Comparative analysis of all predicted B cell, HLA-I and HLA-II epitopes of Omp22

Figure 30

Table 26. Comparative analysis of all predicted B cell, HLA-I and HLA-II epitopes of Omp19

Figure 31

Table 27. Comparative analysis of all predicted B cell, HLA-I and HLA-II epitopes of Omp28

Figure 32

Table 28. Predict allergenicity, antigenicity and solubility of vaccine structure

Figure 33

Fig. 6. Analysis of the secondary structure of the vaccine construct by SOMPA. The sequence length of the vaccine construct is 407 amino acids. The blue h is Alpha helix and accounts for 67.32%, the red e is extended strand and accounts for 7.62%, the yellow c is random coil and accounts for 21.13%, the green t is Beta turn and accounts for 3.93%.

Figure 34

Fig. 7. The 3D structure prediction and validation of the vaccine construct. (a) The 3D structure of model construct. (b) Ramachandran diagram of the mock vaccine, showing 96.0 residues in the allowable range. Ramachandran plot takes the angles of Phi and Pis as the abscissa and ordinate. Phi is the rotation angle of C−N bond on the left side of α carbon in a peptide unit, and Pis is the rotation angle of C−C bond on the right side of α carbon. The area inside the yellow coil is completely allowed, the area inside the blue coil is allowed and the area outside the blue coil is not allowed. When the scatter in the blue coil and the yellow coil exceeds 90%, the tertiary structure of the model conforms to rules of stereochemistry.