Skip to main content

A Novel Approach for Pathway Analysis of GWAS Data Highlights Role of BMP Signaling and Muscle Cell Differentiation in Colorectal Cancer Susceptibility

  • Aniket Mishra (a1) and Stuart MacGregor (a1)
  • Please note a correction has been issued for this article.

Genome-wide association studies (GWAS) have revolutionized the field of gene mapping. As the GWAS field matures, it is becoming clear that for many complex traits, a proportion of the missing heritability is attributable to common variants of individually small effect. Detecting these small effects individually can be difficult, and statistical power would be increased if relevant variants could be grouped together for testing. Here, we propose a VEGAS2Pathway approach that aggregates association strength of individual markers into pre-specified biological pathways. It accounts for gene size and linkage disequilibrium between markers using simulations from the multivariate normal distribution. Pathway size is taken into account via a resampling approach. Importantly, since the approach only requires summary data, the method can easily be applied in all GWASs, including meta-analysis, singleton-based, family-based, and DNA-pooling-based designs. This approach is implemented in a user-friendly web page and a command line tool. The web implementation uses gene-sets from the gene ontology (GO), curated gene-sets from MSigDB (containing canonical pathways and gene-sets from BIOCARTA, REACTOME, KEGG databases), PANTHER, and pathway commons databases, enabling analysis of a wide range of complex traits. We applied this method on a colorectal cancer GWAS meta-analysis data set (10,934 cases, 12,328 controls) from the Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO). We report statistically significant enrichment of association signal for the ‘BMP signaling’ and ‘muscle cell differentiation’ pathways, suggesting a possible role for these pathways onto the risk of colorectal cancer.

Corresponding author
address for correspondence: Aniket Mishra, Statistical Genetics, QIMR Berghofer Medical Research Institute, 300 Herston Rd, Brisbane QLD 4006, Australia. E-mail:
Hide All
Alarmo, E. L., Parssinen, J., Ketolainen, J. M., Savinainen, K., Karhu, R., & Kallioniemi, A. (2009). BMP7 influences proliferation, migration, and invasion of breast cancer cells. Cancer Letters, 275, 3543.
Brosens, L. A., van Hattem, A., Hylind, L. M., Iacobuzio-Donahue, C., Romans, K. E., Axilbund, J., . . . Giardiello, F. M. (2007). Risk of colorectal cancer in juvenile polyposis. Gut, 56, 965967.
Cerami, E. G., Gross, B. E., Demir, E., Rodchenkov, I., Babur, O., Anwar, N., . . . Sander, C. (2011). Pathway Commons, a web resource for biological pathway data. Nucleic Acids Research, 39, D685–690.
Chau, J. F., Jia, D., Wang, Z., Liu, Z., Hu, Y., Zhang, X., . . . Li, B. (2012). A crucial role for bone morphogenetic protein-Smad1 signalling in the DNA damage response. Nature Communications, 3, 836.
Cho, D. H., Jo, Y. K., Kim, S. C., Park, I. J., & Kim, J. C. (2012). Down-regulated expression of ATG5 in colorectal cancer. Anticancer Research, 32, 40914096.
Chu, H., Luo, H., Wang, H., Chen, X., Li, P., Bai, Y., . . . Zhang, G. (2014). Silencing BMP-2 expression inhibits A549 and H460 cell proliferation and migration. Diagnostic Pathology, 9, 123.
De Boeck, A., Pauwels, P., Hensen, K., Rummens, J. L., Westbroek, W., Hendrix, A., . . . De Wever, O. (2013). Bone marrow-derived mesenchymal stem cells promote colorectal cancer progression through paracrine neuregulin 1/HER3 signalling. Gut, 62, 550560.
Evangelou, M., Rendon, A., Ouwehand, W. H., Wernisch, L., & Dudbridge, F. (2012). Comparison of methods for competitive tests of pathway analysis. PLoS One, 7, e41018.
Fenouille, N., Grosso, S., Yunchao, S., Mary, D., Pontier-Bres, R., Imbert, V., . . . Lagadec, P. (2012). Calpain 2-dependent IkappaBalpha degradation mediates CPT-11 secondary resistance in colorectal cancer xenografts. Journal of Pathology, 227, 118129.
Gene Ontology Consortium. (2008). The gene ontology project in 2008. Nucleic Acids Research, 36, D440–444.
Guha, U., Gomes, W. A., Kobayashi, T., Pestell, R. G., & Kessler, J. A. (2002). In vivo evidence that BMP signaling is necessary for apoptosis in the mouse limb. Developmental Biology, 249, 108120.
Holmans, P., Green, E. K., Pahwa, J. S., Ferreira, M. A., Purcell, S. M., Sklar, P., . . . Craddock, N. (2009). Gene ontology analysis of GWA study data sets provides insights into the biology of bipolar disorder. American Journal of Human Genetics, 85, 1324.
Howe, J. R., Mitros, F. A., & Summers, R. W. (1998). The risk of gastrointestinal carcinoma in familial juvenile polyposis. Annals of Surgical Oncology, 5, 751756.
Jeon, H. S., Dracheva, T., Yang, S. H., Meerzaman, D., Fukuoka, J., Shakoori, A., . . . Jen, J. (2008). SMAD6 contributes to patient survival in non-small cell lung cancer and its knockdown reestablishes TGF-beta homeostasis in lung cancer cells. Cancer Research, 68, 96869692.
Jiao, S., Peters, U., Berndt, S., Brenner, H., Butterbach, K., Caan, B. J., . . . Hsu, L. (2014). Estimating the heritability of colorectal cancer. Human Molecular Genetics, 23, 38983905.
Kobayashi, T., Lyons, K. M., McMahon, A. P., & Kronenberg, H. M. (2005). BMP signaling stimulates cellular differentiation at multiple steps during cartilage development. Proceedings of the National Academy of Sciences of the United States of America, 102, 1802318027.
Kodach, L. L., Wiercinska, E., de Miranda, N. F., Bleuming, S. A., Musler, A. R., Peppelenbosch, M. P., . . . Hardwick, J. C. (2008). The bone morphogenetic protein pathway is inactivated in the majority of sporadic colorectal cancers. Gastroenterology, 134, 13321341.
Lee, P. H., O'Dushlaine, C., Thomas, B., & Purcell, S. M. (2012). INRICH: interval-based enrichment analysis for genome-wide association studies. Bioinformatics, 28, 17971799.
Liberzon, A., Subramanian, A., Pinchback, R., Thorvaldsdottir, H., Tamayo, P., & Mesirov, J. P. (2011). Molecular signatures database (MSigDB) 3.0. Bioinformatics, 27, 17391740.
Liu, J. Z., McRae, A. F., Nyholt, D. R., Medland, S. E., Wray, N. R., Brown, K. M., . . . MacGregor, S. (2010). A versatile gene-based test for genome-wide association studies. American Journal of Human Genetics, 87, 139145.
Liu, Q., Tomei, S., Ascierto, M. L., De Giorgi, V., Bedognetti, D., Dai, C., . . . Marincola, F. M. (2014). Melanoma NOS1 expression promotes dysfunctional IFN signaling. Journal of Clinical Investigation, 124, 21472159.
Liu, Y., Chen, J., Yang, Y., Zhang, L., & Jiang, W. G. (2012). Muolecular impact of bone morphogenetic protein 7, on lung cancer cells and its clinical significance. International Journal of Molecular Medicine, 29, 10161024.
Mishra, A., & MacGregor, S. (2015). VEGAS2: Software for more flexible gene-based testing. Twin Research and Human Genetics, 18, 8691.
Nomura, S., Yoshitomi, H., Takano, S., Shida, T., Kobayashi, S., Ohtsuka, M., . . . Miyazaki, M. (2008). FGF10/FGFR2 signal induces cell migration and invasion in pancreatic cancer. British Journal of Cancer, 99, 305313.
Pers, T. H., Karjalainen, J. M., Chan, Y., Westra, H. J., Wood, A. R., Yang, J., . . . Franke, L. (2015). Biological interpretation of genome-wide association studies using predicted gene functions. Nature Communications, 6, 5890.
Peters, U., Bien, S., & Zubair, N. (2015). Genetic architecture of colorectal cancer. Gut, 64, 16231636
Peters, U., Jiao, S., Schumacher, F. R., Hutter, C. M., Aragaki, A. K., Baron, J. A., . . . Colon Cancer Family Registry and the Genetics and Epidemiology of Colorectal Cancer Consortium. (2013). Identification of genetic susceptibility loci for colorectal tumors in a genome-wide meta-analysis. Gastroenterology, 144, 799807.e24.
Robinson, M. R., Wray, N. R., & Visscher, P. M. (2014). Explaining additional genetic variation in complex traits. Trends in Genetics, 30, 124132.
Schaefer, C. F., Anthony, K., Krupa, S., Buchoff, J., Day, M., Hannay, T., & Buetow, K. H. (2009). PID: The pathway interaction database. Nucleic Acids Research, 37, D674–679.
Segre, A. V., DIAGRAM Consortium, MAGIC investigators Groop, L., Mootha, V. K., Daly, M. J., & Altshuler, D. (2010). Common inherited variation in mitochondrial genes is not enriched for associations with type 2 diabetes or related glycemic traits. PLoS Genet, 6, e1001058.
Study, C., Houlston, R. S., Webb, E., Broderick, P., Pittman, A. M., Di Bernardo, M. C., . . . Dunlop, M. G. (2008). Meta-analysis of genome-wide association data identifies four new susceptibility loci for colorectal cancer. Nature Genetics, 40, 14261435.
Thomas, P. D., Campbell, M. J., Kejariwal, A., Mi, H., Karlak, B., Daverman, R., . . . Narechania, A. (2003). PANTHER: a library of protein families and subfamilies indexed by function. Genome Research, 13, 21292141.
Thu, K. L., Radulovich, N., Becker-Santos, D. D., Pikor, L. A., Pusic, A., Lockwood, W. W., . . . Tsao, M. S. (2014). SOX15 is a candidate tumor suppressor in pancreatic cancer with a potential role in Wnt/beta-catenin signaling. Oncogene, 33, 279288.
Wang, K., Li, M., & Hakonarson, H. (2010). Analysing biological pathways in genome-wide association studies. Nature Reviews Genetics, 11, 843854.
Wang, K., Zhang, H., Kugathasan, S., Annese, V., Bradfield, J. P., Russell, R. K., . . . Hakonarson, H. (2009). Diverse genome-wide association studies associate the IL12/IL23 pathway with Crohn disease. American Journal of Human Genetics, 84, 399405.
Wang, L., Jia, P., Wolfinger, R. D., Chen, X., & Zhao, Z. (2011). Gene set analysis of genome-wide association studies: Methodological issues and perspectives. Genomics, 98, 18.
Wojcik, G. L., Kao, W. H., & Duggal, P. (2015). Relative performance of gene- and pathway-level methods as secondary analyses for genome-wide association studies. BMC Genetics, 16, 34.
Yamaguchi, J., Nagayama, S., Chino, A., Sakata, A., Yamamoto, N., Sato, Y., . . . Arai, M. (2014). Identification of coding exon 3 duplication in the BMPR1A gene in a patient with juvenile polyposis syndrome. Japanese Journal of Clinical Oncology, 44, 10041008.
Yang, J., Ferreira, T., Morris, A. P., Medland, S. E., Genetic Investigation of, A. T. C., Replication, D. I. G., . . . Visscher, P. M. (2012). Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nature Genetics, 44, 369375.
Recommend this journal

Email your librarian or administrator to recommend adding this journal to your organisation's collection.

Twin Research and Human Genetics
  • ISSN: 1832-4274
  • EISSN: 1839-2628
  • URL: /core/journals/twin-research-and-human-genetics
Please enter your name
Please enter a valid email address
Who would you like to send this to? *


Type Description Title
Supplementary materials

Mishra and MacGregor supplementary material S3
Supplementary Table

 Unknown (57 KB)
57 KB
Supplementary materials

Mishra and MacGregor supplementary material S2
Supplementary Table

 Unknown (74 KB)
74 KB
Supplementary materials

Mishra and MacGregor supplementary material S1
Supplementary Table

 Unknown (49 KB)
49 KB
Supplementary materials

Mishra and MacGregor supplementary material S4
Mishra and MacGregor supplementary material

 Word (56 KB)
56 KB


Full text views

Total number of HTML views: 117
Total number of PDF views: 249 *
Loading metrics...

Abstract views

Total abstract views: 1325 *
Loading metrics...

* Views captured on Cambridge Core between 20th January 2017 - 19th August 2018. This data will be updated every 24 hours.

A correction has been issued for this article: