Hostname: page-component-6766d58669-kl59c Total loading time: 0 Render date: 2026-05-19T07:17:18.576Z Has data issue: false hasContentIssue false

In trans variant calling reveals enrichment for compound heterozygous variants in genes involved in neuronal development and growth.

Published online by Cambridge University Press:  13 June 2019

Allison J. Cox
Affiliation:
Department of Pediatrics, The University of Iowa, Iowa City, IA, USA Interdisciplinary Graduate Program in Genetics, The University of Iowa, Iowa City, IA, USA
Fillan Grady
Affiliation:
Medical Scientist Training Program, University of Iowa, Iowa City, IA, USA
Gabriel Velez
Affiliation:
Medical Scientist Training Program, University of Iowa, Iowa City, IA, USA Omics Laboratory, Department of Ophthalmology, Byers Eye Institute, Stanford University, Palo Alto, CA, USA
Vinit B. Mahajan
Affiliation:
Omics Laboratory, Department of Ophthalmology, Byers Eye Institute, Stanford University, Palo Alto, CA, USA Palo Alto Veterans Administration, Palo Alto, CA, USA
Polly J. Ferguson
Affiliation:
Department of Pediatrics, The University of Iowa, Iowa City, IA, USA
Andrew Kitchen
Affiliation:
Department of Anthropology, The University of Iowa, Iowa City, IA, USA
Benjamin W. Darbro
Affiliation:
Department of Pediatrics, The University of Iowa, Iowa City, IA, USA
Alexander G. Bassuk*
Affiliation:
Department of Pediatrics, The University of Iowa, Iowa City, IA, USA Interdisciplinary Graduate Program in Genetics, The University of Iowa, Iowa City, IA, USA
*
Author for correspondence: Alexander Bassuk, E-mail: alexander-bassuk@uiowa.edu
Rights & Permissions [Opens in a new window]

Abstract

Compound heterozygotes occur when different variants at the same locus on both maternal and paternal chromosomes produce a recessive trait. Here we present the tool VarCount for the quantification of variants at the individual level. We used VarCount to characterize compound heterozygous coding variants in patients with epileptic encephalopathy and in the 1000 Genomes Project participants. The Epi4k data contains variants identified by whole exome sequencing in patients with either Lennox-Gastaut Syndrome (LGS) or infantile spasms (IS), as well as their parents. We queried the Epi4k dataset (264 trios) and the phased 1000 Genomes Project data (2504 participants) for recessive variants. To assess enrichment, transcript counts were compared between the Epi4k and 1000 Genomes Project participants using minor allele frequency (MAF) cutoffs of 0.5 and 1.0%, and including all ancestries or only probands of European ancestry. In the Epi4k participants, we found enrichment for rare, compound heterozygous variants in six genes, including three involved in neuronal growth and development – PRTG (p = 0.00086, 1% MAF, combined ancestries), TNC (p = 0.022, 1% MAF, combined ancestries) and MACF1 (p = 0.0245, 0.5% MAF, EU ancestry). Due to the total number of transcripts considered in these analyses, the enrichment detected was not significant after correction for multiple testing and higher powered or prospective studies are necessary to validate the candidacy of these genes. However, PRTG, TNC and MACF1 are potential novel recessive epilepsy genes and our results highlight that compound heterozygous variants should be considered in sporadic epilepsy.

Information

Type
Research Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © Cambridge University Press 2019
Figure 0

Fig. 1. Flow diagram for the processing and analysis of variant lists. Vcf files are annotated and filtered using SNPSift/SNPEff. Final vcf along with parameter and sample information files are input to VarCount. The input files are processed to recode minor and major alleles when the MAF >0.5 and to count the number of individuals with variants qualifying based on information in the parameter file. The final output lists for every transcript or gene, the number of individuals with qualified variants in that locus (counts.text), which individuals have the variant(s) (countfile.txt), and which variants are harboured by each individual (output.txt).

Figure 1

Table 1. Rare (<0.5 and 1.0% minor allele frequency) compound heterozygous variants in Epi4k participants.

Figure 2

Fig. 2. PRTG compound heterozygous mutations in Epi4k probands. (a) Theoretical model of the human PRTG structure spanning the plasma membrane indicating mutation locations in each child. The three pairs of in trans mutations, indicated in red, were found using a <1% MAF threshold. (b) Schematic representation of PRTG functional domains. Multiple sequence alignment of the PRTG Ig-1 domain. The E104 residue is 100% conserved across seven species. (c) Top: Electrostatic potential surface of PRTG calculated in APBS. Bottom: Close-up of the PRTG electrostatic potential surface at the site of mutation. The p.Glu104Gly mutation leads to a loss of negative charge, which may disrupt interactions with putative PRTG binding partners. The p.Glu104Asp mutation does not lead to a change in charge or electrostatic potential.

Figure 3

Table 2. De novo variants in Epi4k probands with compound heterozygous variants in PRTG, TNC or MACF1.

Supplementary material: File

Cox et al. supplementary material

Tables S1-S5

Download Cox et al. supplementary material(File)
File 528.9 KB