Hostname: page-component-89b8bd64d-dvtzq Total loading time: 0 Render date: 2026-05-12T22:53:38.886Z Has data issue: false hasContentIssue false

Relative mutation rates of each nucleotide for another estimated from allele frequency spectra at human gene loci

Published online by Cambridge University Press:  30 July 2009

LEEYOUNG PARK*
Affiliation:
Natural Science Research Institute, Yonsei University, 134 Shinchon-Dong, Seodaemun-Ku, Seoul 120-749, Korea
*
Tel: (82)2 2123 7615. Fax: (82)2 313 8892. e-mail: lypark@yonsei.ac.kr
Rights & Permissions [Opens in a new window]

Summary

This study aims to comprehensively examine the mutation rates of one base for another in human gene loci. In contrast to most previous efforts based on divergence data from untranscribed regions, the present study employs the basic theory of the reversible recurrent mutation model using large-scale, high-quality re-sequencing data from public databases of gene loci. Population mutation parameters (4Nν and 4Nμ) are obtained for each pair of base substitutions. The estimated parameters show good strand reversal symmetry, supporting the existence of mutation-drift equilibrium. Analysis of specific gene regions including mRNA, coding sequence (CDS), 5′-untranslated region (5′-UTRs), 3′-UTR and intron shows that there are clear differences in the mutation rates of each base for another depending on the location of the base in question. Results from analyses that take the adjacent bases into account exhibit excellent strand reversal symmetry, confirming that the identity of an adjacent base influences mutation rates. The CpG to TpG (or CpG to CpA) substitution is found at a rate approximately seven-fold higher than the reverse transition in intron regions due to cytosine deamination, but the effect is strongly reduced in mRNA regions and almost entirely lost in 5′-UTRs. However, from the overall increased transitions in sites other than CpGs and the proportion of CpGs in the total sequence, CpG methylation is not the main factor responsible for the increased rate of transitions as compared with transversions. In this report, after adjusting average mutation rates to the sequence compositions, no substitution bias is found between A+T and C+G, indicating base composition equilibrium in human gene loci. Population differences are also identified between groups of people of African and European descent, presumably due to past population histories. By applying the basic theory of population genetics to re-sequenced data, this study contributes new, detailed information regarding mutations in human gene regions.

Information

Type
Paper
Copyright
Copyright © Cambridge University Press 2009
Figure 0

Table 1. Estimates of θ (4Nν and 4Nμ) for the AD samples, which are scaled up by 103

Figure 1

Table 2. Estimates of θ (4Nν and 4Nμ) for the ED samples, which are scaled up by 103

Figure 2

Table 3. Test of mutation-drift equilibrium and the correlation between the rates in the AD and ED samples; Spearman rank correlation rho and Kendall's rank correlation tau are indicated with their P-values

Figure 3

Table 4. Average transition to transversion ratios and average A/T-to-G/C to G/C-to-A/T ratios for given target regions (Ts, average rate of transitions; Tv, average rate of transversions). The estimates of average θ for transition, transversion, A/T-to-G/C and G/C-to-A/T are scaled up by 103

Figure 4

Table 5. Estimates of θ (4Nν and 4Nμ) from the distribution of allele frequencies in the AD samples based on either the next or previous nucleotide. The estimates of θ are scaled up by 103

Figure 5

Table 6. Estimates of θ (4Nν and 4Nμ) from the distribution of allele frequencies in the ED samples based on either the next or previous nucleotide. The estimates of θ are scaled up by 103

Figure 6

Table 7. Spearman rank correlation rho and Kendall's rank correlation tau of effects from the next and previous nucleotides; all tests show strong significance with P-values of less than 1·0×10−5 by each test

Supplementary material: File

Park supplementary table 1

Park supplementary table 1

Download Park supplementary table 1(File)
File 342.5 KB
Supplementary material: File

Park supplementary table 2

Park supplementary table 2

Download Park supplementary table 2(File)
File 99.3 KB
Supplementary material: File

Park supplementary table 3

Park supplementary table 3

Download Park supplementary table 3(File)
File 98.3 KB