Hostname: page-component-848d4c4894-mwx4w Total loading time: 0 Render date: 2024-06-16T00:17:13.537Z Has data issue: false hasContentIssue false

Determination of probability distribution of diplotype configuration (diplotype distribution) for each subject from genotypic data using the EM algorithm

Published online by Cambridge University Press:  31 July 2002

Y. KITAMURA
Affiliation:
Division of Statistical Genetics, Institute of Rheumatology, Tokyo Women's Medical University, Tokyo, Japan Mitsubishi Research Institute, Inc., Tokyo, Japan
M. MORIGUCHI
Affiliation:
Institute of Rheumatology, Tokyo Women's Medical University, Tokyo, Japan
H. KANEKO
Affiliation:
Division of Statistical Genetics, Institute of Rheumatology, Tokyo Women's Medical University, Tokyo, Japan Institute of Rheumatology, Tokyo Women's Medical University, Tokyo, Japan
H. MORISAKI
Affiliation:
Department of Bioscience, National Cardiovascular Center Research Institute, Osaka, Japan
T. MORISAKI
Affiliation:
Department of Bioscience, National Cardiovascular Center Research Institute, Osaka, Japan
K. TOYAMA
Affiliation:
Department of Bioscience, National Cardiovascular Center Research Institute, Osaka, Japan
N. KAMATANI
Affiliation:
Division of Statistical Genetics, Institute of Rheumatology, Tokyo Women's Medical University, Tokyo, Japan Institute of Rheumatology, Tokyo Women's Medical University, Tokyo, Japan
Get access

Abstract

Haplotype analysis is important for mapping traits. Recently, methods for estimating haplotype frequencies from genotypes of unrelated individuals based on the expectation-maximization (EM) algorithm have been developed. Our program estimates haplotype frequencies in the population and determines the posterior probability distribution of diplotype configuration (diplotype distribution) for each subject based on the estimated haplotype frequencies. Samples from three ethnic groups for the smoothelin gene (SMTN) and those from three Japanese groups for serum amyloid A genes (SAA@) were analyzed. The estimated diplotype distribution for each individual was concentrated, in most cases, in a single diplotype configuration. The diplotype configuration thus determined was the same as that determined in in vitro experiments, with one exception. Thus, the diplotype configurations determined using the estimated haplotype frequencies from unrelated individuals are reliable. Using this method, the risk of a subject developing a phenotype may be estimated from the diplotype distribution when the phenotype is associated with diplotype configurations.

Type
Research Article
Copyright
University College London 2002

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)