BIOINFORMATICS
BIOINFORMATICS
Mutation Master: Profiles of substitutions in hepatitis C virus RNA of the core, alternate reading frame, and NS2 coding regions
- JOSÉ L. WALEWSKI, JULIO A. GUTIERREZ, WESTYN BRANCH-ELLIMAN, DECHERD D. STUMP, TOBY R. KELLER, ALFREDO RODRIGUEZ, GARY BENSON, ANDREA D. BRANCH
-
- Published online by Cambridge University Press:
- 24 April 2002, pp. 557-571
-
- Article
- Export citation
-
The RNA genome of the hepatitis C virus (HCV) undergoes rapid evolutionary change. Efforts to control this virus would benefit from the advent of facile methods to identify characteristic features of HCV RNA and proteins, and to condense the vast amount of mutational data into a readily interpretable form. Many HCV sequences are available in GenBank. To facilitate analysis, consensus sequences were constructed to eliminate the overrepresentation of certain genotypes, such as genotype 1, and a novel package of sequence analysis tools was developed. Mutation Master generates profiles of point mutations in a population of sequences and produces a set of visual displays and tables indicating the number, frequency, and character of substitutions. It can be used to analyze hundreds of sequences at a time. When applied to 255 HCV core protein sequences, Mutation Master identified variable domains and a series of mutations meriting further investigation. It flagged position 4, for example, where 90% or more of all sequences in genotypes 1, 2, 4, and 5, have N4, whereas those in genotypes 3, 6, 7, 8, 9, and 10 have L4. This pattern is noteworthy: L (hydrophobic) to N (polar) substitutions are generally rare, and genotypes 1, 2, 4, and 5 do not form a recognized super family of sequences. Thus, the L4N substitution probably arose independently several times. Moreover, not one member of genotypes 1, 2, 4, or 5 has L4 and not one member of genotypes 3, 6, 7, 8, 9, or 10 has N4. This nonoverlapping pattern suggests that coordinated changes at position 4 and a second site are required to yield a viable virus. The package generated a table of genotype-specific substitutions whose future analysis may help to identify interacting amino acids. Three substitutions were present in 100% of genotype 2 members and absent from all others: A68D, R74K, and R114H. Finally, this study revealed that ARFP, a novel protein encoded in an overlapping reading frame, is as conserved as conventional HCV proteins, a result supporting a role for ARFP in the viral life cycle. Whereas most conventional programs for phylogenetic analysis of sequences provide information about overall relatedness of genes or genomes, this program highlights and profiles point mutations. This is important because determinants of pathogenicity and drug susceptibility are likely to result from changes at only one or two key nucleotides or amino acid sites, and would not be detected by the type of pairwise comparisons that have usually been performed on HCV to date. This study is the first application of Mutation Master, which is now available upon request (http://tandem.biomath.mssm.edu/mutationmaster.html).
REPORT
REPORT
Surprising flexibility of leader RNA determinants for r-protein L4-mediated transcription termination in the Escherichia coli S10 operon
- JANICE M. ZENGEL, YIZHONG SHA, LASSE LINDAHL
-
- Published online by Cambridge University Press:
- 24 April 2002, pp. 572-578
-
- Article
- Export citation
-
Escherichia coli ribosomal protein L4 autogenously regulates transcription of the S10 operon, which encodes L4 and 10 other ribosomal proteins. Regulation results from L4-stimulated premature transcription termination at a U-rich site in the untranslated leader. The process requires transcription factor NusA. Here we report a detailed analysis of the RNA requirements for NusA-dependent, L4-mediated transcription control. We found that efficient regulation requires multiple features of the S10 leader, including two hairpins, called HD and upper HE, a connecting tether, and a U-rich sequence at the distal side of HE. As expected, regulation was optimal when all 7 Us were maintained in the U4CGU3 sequence at the termination site. However, despite the apparent specificity of L4 action on only the S10 operon, there is surprising flexibility at the primary sequence level for the HD-tether-HE region. Changes in the sequence of non-base-paired nucleotides flanking the HD hairpin or an A at the second position of the HD loop reduced L4 regulation, but other changes had little or no effect. Furthermore, generic hairpins from other RNAs could replace the natural HD and upper HE hairpins with little or no reduction of L4 control, suggesting that the secondary structure elements are also relatively generic. The lack of specific sequence requirements suggests that L4 may recognize multiple elements within this region of the nascent leader.
Branchpoint selection in the splicing of U12-dependent introns in vitro
- TIMOTHY S. MCCONNELL, SOO-JIN CHO, MIKKO J. FRILANDER, JOAN A. STEITZ
-
- Published online by Cambridge University Press:
- 24 April 2002, pp. 579-586
-
- Article
- Export citation
-
In metazoans, splicing of introns from pre-mRNAs can occur by two pathways: the major U2-dependent or the minor U12-dependent pathways. Whereas the U2-dependent pathway has been well characterized, much about the U12-dependent pathway remains to be discovered. Most of the information regarding U12-type introns has come from in vitro studies of a very few known introns of this class. To expand our understanding of U12-type splicing, especially to test the hypothesis that the simple base-pairing mechanism between the intron and U12 snRNA defines the branchpoint of U12-dependent introns, additional in vitro splicing substrates were created from three putative U12-type introns: the third intron of the Xenopus RPL1a gene (XRP), the sixth intron of the Xenopus TFIIS.oA gene (XTF), and the first intron of the human Sm E gene (SME). In vitro splicing in HeLa nuclear extract confirmed U12-dependent splicing of each of these introns. Surprisingly, branchpoint mapping of the XRP splicing intermediate shows use of the upstream rather than the downstream of two consecutive adenosines within the branchpoint sequence (BPS), contrary to the prediction based on alignment with the sixth intron of human P120, a U12-dependent intron whose branch site was previously determined. Also, in the SME intron, the position of the branchpoint A residue within the region base paired with U12 differs from that in P120 and XTF. Analysis of these three additional introns therefore rules out simple models for branchpoint selection by the U12-type spliceosome.
Research Article
Metal ion binding and the folding of the hairpin ribozyme
- TIMOTHY J. WILSON, DAVID M.J. LILLEY
-
- Published online by Cambridge University Press:
- 24 April 2002, pp. 587-600
-
- Article
- Export citation
-
The hairpin ribozyme comprises two formally unpaired loops carried on two arms of a four-way helical RNA junction. Addition of divalent metal ions brings about a conformational transition into an antiparallel structure in which there is an intimate association between the loops to generate the active form of the ribozyme. In this study, we have used fluorescence resonance energy transfer to analyze the global folding of the complete ribozyme, and the simple four-way junction derived from it, over a wide concentration range of divalent and monovalent metal ions. The simple junction undergoes an ion-induced rotation into an antiparallel form. In the presence of a constant background concentration of sodium ions, the magnesium-ion-induced transition is characterized by noncooperative binding with a Hill coefficient n = 1. By contrast, the magnesium-ion-induced folding of the complete ribozyme is more complex, involving two distinct binding phases. The first phase occurs in the micromolar range, and involves the cooperative binding of at least three magnesium ions. This can also be achieved by high concentrations of sodium ions, and is therefore likely to be due to diffuse binding of cations at the junction and the interface of the loop–loop interaction. The second phase occurs in the millimolar range, and can only be induced by divalent metal ions. This transition occurs in response to the noncooperative, site-specific binding of magnesium ions. We observe a good correlation between the extent of ion-induced folding and cleavage activity.
Identification of an essential pseudoknot in the putative downstream internal ribosome entry site in giardiavirus transcript
- SRINIVAS GARLAPATI, CHING C. WANG
-
- Published online by Cambridge University Press:
- 24 April 2002, pp. 601-611
-
- Article
- Export citation
-
Enhanced translation of giardiavirus-luciferase chimeric mRNA in Giardia lamblia requires the initial 264-nt viral capsid coding region as a putative internal ribosomal entry site (IRES). Essential structural elements in this site include (1) a downstream box (DB) complementary to the anti-DB at the 3′ end of 16S-like rRNA, (2) stem-loops I, II, III, and IVA, and (3) a pentanucleotide 5′-UCUCC-3′ immediately downstream from stem loop IVA. A search for the structural role of the pentanucleotide suggested that it may form a pseudoknot with another pentanucleotide 5′-GGAGA-3′ in loop II. Alteration of the two pentanucleotides by site-directed mutagenesis resulted in a drastic reduction in translation of the transcript. But the loss was recovered by compensatory changes in the two sequences, suggesting Watson–Crick base pairings between them. Results from in vitro enzymatic and chemical structural probing supported the presence of such a pseudoknot 143 nt downstream from the initiation codon. Minor repositioning of this codon led invariably to a complete loss of translation, suggesting that the initiation site is confined within a rigid position defined by all the structural elements in the IRES including the pseudoknot. This is the first pseudoknot of its kind shown to play an important role in a downstream IRES of a viral transcript. The finding is particularly interesting because it could reflect a unique feature of translation initiation in Giardia, which is known to have exceedingly short (1–6 nt) 5′ untranslated regions in its mRNAs.
Crosslinking of 4.5S RNA to the Escherichia coli ribosome in the presence or absence of the protein Ffh
- JUTTA RINKE-APPEL, MONIKA OSSWALD, KLAUS VON KNOBLAUCH, FLORIAN MUELLER, RICHARD BRIMACOMBE, PETR SERGIEV, OLGA AVDEEVA, ALEXEY BOGDANOV, OLGA DONTSOVA
-
- Published online by Cambridge University Press:
- 24 April 2002, pp. 612-625
-
- Article
- Export citation
-
Radioactively labeled 4.5S RNA containing statistically distributed 4-thiouridine residues in place of normal uridine was prepared by T7 transcription. The ability of this modified 4.5S RNA to form a complex with the protein Ffh was demonstrated by a gel shift assay. The modified 4.5S RNA, with or without Ffh, was added to Escherichia coli ribosomes under various conditions, and crosslinking from the thiouridine residues was induced by irradiation at 350 nm. The crosslinked ribosomal components were analyzed by our standard procedures. Two clearly defined types of crosslinking were observed. The first was a crosslink to 23S rRNA, which was entirely dependent both on the presence of Ffh and a nascent protein chain in the 50S subunit. This crosslink was localized to nt ∼ 2828–2837 of the 23S rRNA, from position 84 of the 4.5S molecule. The second type of crosslinking, to the 30S ribosomal subunit, was independent of the presence of Ffh, and was found both with vacant 70S ribosomes or isolated 30S subunits. Here the crosslink was localized to the 3′-terminal region of the 16S rRNA, from positions 29–50 of the 4.5S RNA. Crosslinking to ribosomal protein S1 was also observed. The known crystal structure of the protein Ffh/4.5S RNA fragment complex was extrapolated by computer modeling so as to include the whole 4.5S molecule, and this was docked onto the ribosome using the crosslinking data. The results are discussed in terms of multiple functions and binding sites of the 4.5S RNA.
Yeast Pescadillo is required for multiple activities during 60S ribosomal subunit synthesis
- MARLENE OEFFINGER, ANTHONY LUENG, ANGUS LAMOND, DAVID TOLLERVEY
-
- Published online by Cambridge University Press:
- 24 April 2002, pp. 626-636
-
- Article
- Export citation
-
The Pescadillo protein was identified via a developmental defect and implicated in cell cycle progression. Here we report that human Pescadillo and its yeast homolog (Yph1p or Nop7p) are localized to the nucleolus. Depletion of Nop7p leads to nuclear accumulation of pre-60S particles, indicating a defect in subunit export, and it interacts genetically with a tagged form of the ribosomal protein Rpl25p, consistent with a role in subunit assembly. Two pre-rRNA processing pathways generate alternative forms of the 5.8S rRNA, designated 5.8SL and 5.8SS. In cells depleted for Nop7p, the 27SA3 pre-rRNA accumulated, whereas later processing intermediates and the mature 5.8SS rRNA were depleted. Less depletion was seen for the 5.8SL pathway. TAP-tagged Nop7p coprecipitated precursors to both 5.8SL and 5.8SS but not the mature rRNAs. We conclude that Nop7p is required for efficient exonucleolytic processing of the 27SA3 pre-rRNA and has additional functions in 60S subunit assembly and transport. Nop7p is a component of at least three different pre-60S particles, and we propose that it carries out distinct functions in each of these complexes.
Hepatitis delta virus ribonucleoproteins shuttle between the nucleus and the cytoplasm
- JOÃO P. TAVANEZ, CELSO CUNHA, MARIA C.A. SILVA, EZIO DAVID, JOÃO MONJARDINO, MARIA CARMO-FONSECA
-
- Published online by Cambridge University Press:
- 24 April 2002, pp. 637-646
-
- Article
- Export citation
-
Hepatitis delta virus (HDV) infection of individuals infected with hepatitis B virus (HBV) is associated with more severe liver damage and an increased risk of fulminant disease. HDV is a single-stranded RNA virus that encodes a single protein, the delta antigen, which is expressed in two forms, small (S-HDAg) and large (L-HDAg). Here we show that although HDV ribonucleoproteins are mainly detected in the nucleus, they are also present in the cytoplasm of cells infected with HDV or transfected with HDV cDNA. Making use of an heterokaryon assay, we demonstrate that HDV ribonucleoproteins shuttle continuously between the nucleus and the cytoplasm. In the absence of HDV RNA, both forms of the delta antigen are retained in the nucleus, whereas in the absence of the delta antigen, HDV RNA is predominantly detected in the cytoplasm. Coexpression of HDV RNA and S-HDAg (which binds to the viral RNA and contains a nuclear localization signal) results in nuclear accumulation of the viral RNA. This suggests that HDV RNA mediates export of viral particles to the cytoplasm whereas the delta antigen triggers their reimport into the nucleus.
In vivo selection of better self-splicing introns in Escherichia coli: The role of the P1 extension helix of the Tetrahymena intron
- FENG GUO, THOMAS R. CECH
-
- Published online by Cambridge University Press:
- 24 April 2002, pp. 647-658
-
- Article
- Export citation
-
In vivo selection was used to improve the activity of the Tetrahymena pre-rRNA self-splicing intron in the context of heterologous exons. The intron was engineered into a kanamycin nucleotidyltransferase gene, with the pairing between intron bases and the 5′ and 3′ splice sites maintained. The initial construct failed to confer kanamycin resistance on Escherichia coli, although the pre-mRNA was active in splicing in vitro. Random mutation libraries were constructed to identify active intron variants in E. coli. All the active mutants sequenced contained mutations disrupting a base-paired region above the paired region P1 (referred to as the P1 extension region or P1ex) that involves the very 5′ end of the intron. Subsequent site-directed mutagenesis confirmed that these P1ex mutations are responsible and sufficient to activate the intron splicing in E. coli. Thus, it appears that too strong of a secondary structure in the P1ex element can be inhibitory to splicing in vivo. In vitro splicing assays demonstrated that two P1ex mutant constructs splice six to eight times faster than the designed construct at 40 μM GTP concentration. The relative reaction rates of the mutant constructs compared to the original design are further increased at a lower GTP concentration. Possible mechanisms by which the disrupted P1ex structure could influence splicing rates are discussed. This study emphasizes the value of using libraries of random mutations to improve the activity of ribozymes in heterologous contexts in vivo.
Efficient aminoacylation of the tRNAAla acceptor stem: Dependence on the 2:71 base pair
- PENNY J. BEUNING, MARIA C. NAGAN, CHRISTOPHER J. CRAMER, KARIN MUSIER-FORSYTH, JOSEP-LLUIS GELPÍ, DONALD BASHFORD
-
- Published online by Cambridge University Press:
- 24 April 2002, pp. 659-670
-
- Article
- Export citation
-
Specific aminoacylation by aminoacyl-tRNA synthetases requires accurate recognition of cognate tRNA substrates. In the case of alanyl-tRNA synthetase (AlaRS), RNA duplexes that mimic the acceptor stem of the tRNA are efficient substrates for aminoacylation in vitro. It was previously shown that recognition by AlaRS is severely affected by a simple base pair transversion of the G2:C71 pair at the second position in the RNA helix. In this study, we determined the aminoacylation efficiencies of 50 variants of the tRNAAla acceptor stem containing substitutions at the 2:71 position. We find that there is not a single functional group of the wild-type G2:C71 base pair that is critical for positive recognition. Rather, we observed that base-pair orientation plays an important role in recognition. In particular, pyrimidine2:purine71 combinations generally resulted in decreased aminoacylation efficiency compared to the corresponding purine:pyrimidine pair. Moreover, the activity of a pyrimidine:purine variant could be partially restored by the presence of a major groove amino group at position 71. In an attempt to understand this result further, dielectric continuum electrostatic calculations were carried out, in some cases with additional inclusion of van der Waals interaction energies, to determine interaction potentials of the wild-type duplexAla and seven 2:71 variants. This analysis revealed a positive correlation between major groove negative electrostatic potential in the vicinity of the 3:70 base pair and measured aminoacylation efficiency.
Region-specific alternative splicing in the nervous system: Implications for regulation by the RNA-binding protein NAPOR
- WENQING ZHANG, HAIYING LIU, KYOUNGHA HAN, PAULA J. GRABOWSKI
-
- Published online by Cambridge University Press:
- 24 April 2002, pp. 671-685
-
- Article
- Export citation
-
Alternative RNA splicing generates extensive proteomic diversity in the nervous system, yet few neural-specific RNA binding proteins have been implicated in splicing control. Here we show that the biochemical properties and spatial expression of mouse neuroblastoma apoptosis-related RNA-binding protein (NAPOR; also called NAPOR-1) are consistent with its roles in the regulation of the exon 5 and exon 21 splicing events of the N-methyl-D-aspartate (NMDA) receptor R1 transcript. NAPOR, which is closely related to CUG binding protein 2 (CUG-BP2), promotes exon 21 and represses exon 5 splicing in functional coexpression assays. These NMDA mRNA isoforms are distributed, in vivo, in a region-specific manner in rat brain, such that high levels of exon 21 selection and exon 5 skipping coincide with high NAPOR mRNA expression in the forebrain. Within the forebrain, this spatial correspondence is most striking in the visual cortex. In contrast, low NAPOR expression coincides with the reciprocal pattern of alternative splicing in the hindbrain. Complementary experiments demonstrate a tissue-specific distribution of NAPOR, CUG-BP, and other highly related proteins within the nervous system as assayed by probing forebrain and hindbrain nuclear extracts with monoclonal antibody, mAb 3B1. Thus, NAPOR may be one of a group of closely related proteins involved in splicing regulation within the brain. An intronic RNA element responsible for the silencing of exon 21 splicing is identified by mutational analysis and shown to bind directly to recombinant NAPOR protein, suggesting a model in which exon 21 selection is positively regulated by an antirepression mechanism of action.
T7 RNA polymerase-directed transcripts are processed in yeast and link 3′ end formation to mRNA nuclear export
- KEN DOWER, MICHAEL ROSBASH
-
- Published online by Cambridge University Press:
- 24 April 2002, pp. 686-697
-
- Article
- Export citation
-
We have characterized transcripts synthesized in vivo by bacteriophage T7 RNA polymerase to investigate yeast mRNA processing. T7 transcripts are not capped, consistent with capping being tightly coupled to RNA polymerase II (pol II) transcription. In contrast to higher eukaryotic non-pol II transcripts, yeast T7 transcripts are spliced as well as cleaved and polyadenylated. However, T7 and pol II transcripts are affected differently in cleavage and polyadenylation mutant strains, indicating that pol II may have a role in yeast 3′ end formation. T7 transcripts with 3′ ends directed by a polyadenylation signal are exported from the nucleus, and this export is dependent on the canonical cleavage and polyadenylation machinery. Importantly, transcripts with T7 terminator-directed 3′ ends are unadenylated and predominantly nuclear in wild-type cells. Our results suggest that transcription by pol II is required for neither the nuclear export of an in vivo-transcribed mRNA nor for the retention of transcripts with aberrant 3′ ends. Moreover, proper 3′ end formation may be necessary and sufficient to promote mRNA export in yeast.
METHOD
METHOD
Interaction of RNA with phage display selected peptides analyzed by capillary electrophoresis mobility shift assay
- PIOTR MUCHA, AGNIESZKA SZYK, PIOTR REKOWSKI, RICHARD GUENTHER, PAUL F. AGRIS
-
- Published online by Cambridge University Press:
- 24 April 2002, pp. 698-704
-
- Article
- Export citation
-
A sensitive capillary electrophoresis mobility shift assay (CEMSA) to analyze RNA/peptide interactions has been developed. Capillary electrophoresis (CE) has been adapted for investigating the interaction between variously methylated 17-nt analogs of the yeast tRNAPhe anticodon stem and loop domain (ASLPhe) and 15-amino-acid peptides selected from a random phage display library (RPL). A peptide-concentration-dependent formation of RNA/peptide complex was clearly visible during CEMSA. In the presence of peptide, the UV-monitored CE peak for ASLPhe with three of the five naturally occurring modifications (2′-O-methylcytidine (Cm32), 2′-O-methylguanine (Gm34) and 5-methylcytidine (m5C40) shifted from 18.16 to 20.90 min. The mobility shift was observed only for methylated RNA. The negative effects of diffusion, electroosmotic flow and adhesion of molecules to the capillary internal wall were suppressed by using a buffer containing a sieving polymer and a polyacrylamide-coated capillary. Under these conditions, well-shaped peaks and resolution of RNA free and bound to peptide were achieved. Peptide tF2, the most populated ligand in the RPL, specifically bound triply methylated ASLPhe in a methylated nucleoside-dependent manner. CE was found to be an efficient and sensitive method for the qualitative analysis of RNA–peptide interaction and should be generally applicable to the study of RNA–peptide (protein) interactions.