Skip to main content
×
×
Home

Optimizing phylogenetic supertrees using answer set programming

  • LAURA KOPONEN (a1), EMILIA OIKARINEN (a1), TOMI JANHUNEN (a1) and LAURA SÄILÄ (a2)
Abstract

The supertree construction problem is about combining several phylogenetic trees with possibly conflicting information into a single tree that has all the leaves of the source trees as its leaves and the relationships between the leaves are as consistent with the source trees as possible. This leads to an optimization problem that is computationally challenging and typically heuristic methods, such as matrix representation with parsimony (MRP), are used. In this paper we consider the use of answer set programming to solve the supertree construction problem in terms of two alternative encodings. The first is based on an existing encoding of trees using substructures known as quartets, while the other novel encoding captures the relationships present in trees through direct projections. We use these encodings to compute a genus-level supertree for the family of cats (Felidae). Furthermore, we compare our results to recent supertrees obtained by the MRP method.

Copyright
References
Hide All
Aho, A. V., Sagiv, Y., Szymanski, T. G. and Ullman, J. D. 1981. Inferring a tree from lowest common ancestors with an application to the optimization of relational expressions. SIAM Journal on Computing 10, 3, 405421.
Alviano, M., Dodaro, C., Leone, N. and Ricca, F. 2015. Advances in WASP. In Proceedings of the 13th International Conference on Logic Programming and Nonmonotonic Reasoning, LPNMR 2015. Lecture Notes in Computer Science, vol. 9345. Springer.
Baral, C. 2003. Knowledge Representation, Reasoning, and Declarative Problem Solving. Cambridge University Press, New York, NY, USA.
Baum, B. R. 1992. Combining trees as a way of combining data sets for phylogenetic inference, and the desirability of combining gene trees. Taxon 41, 1, 310.
Bininda-Emonds, O. R. 2004. Phylogenetic Supertrees: Combining Information to Reveal the Tree of Life. Computational Biology. Springer.
Bomanson, J., Gebser, M. and Janhunen, T. 2014. Improving the normalization of weight rules in answer set programs. In Proceedings of the 14th European Conference on Logics in Artificial Intelligence, JELIA 2014. Lecture Notes in Computer Science, vol. 8761. Springer, 166180.
Brooks, D. R., Erdem, E., Erdoğan, S. T., Minett, J. W. and Ringe, D. 2007. Inferring phylogenetic trees using answer set programming. Journal of Automated Reasoning 39, 4, 471511.
Bryant, D. 1997. Building trees, hunting for trees, and comparing trees. Ph.D. thesis, University of Canterbury.
Byrka, J., Guillemot, S. and Jansson, J. 2010. New results on optimizing rooted triplets consistency. Discrete Applied Mathematics 158, 11, 11361147.
Cavalcanti, M. J. 2007. A phylogenetic supertree of the hammerhead sharks (Carcharhiniformes, Sphyrnidae). Zoological Studies 46, 1, 611.
Chen, D., Diao, L., Eulenstein, O., Fernández-Baca, D. and Sanderson, M. 2003. Flipping: a supertree construction method. DIMACS series in discrete mathematics and theoretical computer science 61, 135162.
Chimani, M., Rahmann, S. and Böcker, S. 2010. Exact ILP solutions for phylogenetic minimum flip problems. In Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology, BCB 2010. ACM, 147153.
Day, W. H., Johnson, D. S. and Sankoff, D. 1986. The computational complexity of inferring rooted phylogenies by parsimony. Mathematical biosciences 81, 1, 3342.
Erdős, P. L., Steel, M. A., Székely, L. A. and Warnow, T. 1999. A few logs suffice to build (almost) all trees (i). Random Structures and Algorithms 14, 2, 153184.
Flynn, J. J., Finarelli, J. A., Zehr, S., Hsu, J. and Nedbal, M. A. 2005. Molecular phylogeny of the Carnivora (Mammalia): assessing the impact of increased sampling on resolving enigmatic relationships. Systematic Biology 54, 2, 317337.
Foulds, L. R. and Graham, R. L. 1982. The Steiner problem in phylogeny is NP-complete. Advances in Applied Mathematics 3, 1, 4349.
Fulton, T. L. and Strobeck, C. 2006. Molecular phylogeny of the Arctoidea (Carnivora): effect of missing data on supertree and supermatrix analyses of multiple gene data sets. Molecular phylogenetics and evolution 41, 1, 165181.
Gebser, M., Janhunen, T. and Rintanen, J. 2014. Answer set programming as SAT modulo acyclicity. In Proceedings of the 21st European Conference on Artificial Intelligence, ECAI 2014. IOS Press, 351356.
Gebser, M., Kaminski, R., Kaufmann, B. and Schaub, T. 2012. Answer Set Solving in Practice. Synthesis Lectures on Artificial Intelligence and Machine Learning. Morgan & Claypool Publishers.
Gebser, M., Kaminski, R., Ostrowski, M., Schaub, T., and Thiele, S. 2009. On the input language of ASP grounder Gringo. In Proceedings of the 10th International Conference on Logic Programming and Nonmonotonic Reasoning, LPNMR 2009. Lecture Notes in Computer Science, vol. 5753. Springer, 502508.
Gebser, M., Kaufmann, B., Kaminski, R., Ostrowski, M., Schaub, T. and Schneider, M. T. 2011. Potassco: The Potsdam answer set solving collection. AI Commun. 24, 2, 107124.
Gent, I. P., Prosser, P., Smith, B. M. and Wei, W. 2003. Supertree construction with constraint programming. In Proceedings of the 9th International Conference on Principles and Practice of Constraint Programming, CP 2003. Lecture Notes in Computer Science, vol. 2833. Springer, 837841.
Goloboff, P. A. and Pol, D. 2002. Semi-strict supertrees. Cladistics 18, 5, 514525.
Kavanagh, J., Mitchell, D. G., Ternovska, E., Manuch, J., Zhao, X. and Gupta, A. 2006. Constructing Camin-Sokal phylogenies via answer set programming. In Proceedings of the 13th International Conference on Logic for Programming, Artificial Intelligence, and Reasoning, LPAR 2006. Lecture Notes in Computer Science, vol. 4246. Springer, 452466.
Le, T., Nguyen, H., Pontelli, E. and Son, T. C. 2012. ASP at work: An ASP implementation of PhyloWS. In Technical Communications of the 28th International Conference on Logic Programming, ICLP 2012. LIPIcs, vol. 17. 359369.
Le Berre, D. and Parrain, A. 2010. The Sat4j library, release 2.2. Journal on Satisfiability, Boolean Modeling and Computation 7, 5964.
Martins, R., Manquinho, V. and Lynce, I. 2014. Open-WBO: a modular MaxSAT solver. In Theory and Applications of Satisfiability Testing, SAT 2014. Lecture Notes in Computer Science, vol. 8561. Springer, 438445.
Morgado, A. and Marques-Silva, J. 2010. Combinatorial optimization solutions for the maximum quartet consistency problem. Fundam. Inform. 102, 3–4, 363389.
Nixon, K. C. 1999. The parsimony ratchet, a new method for rapid parsimony analysis. Cladistics 15, 4, 407414.
Piaggio-Talice, R., Burleigh, J. G. and Eulenstein, O. 2004. Quartet supertrees. In Phylogenetic Supertrees. Springer, 173191.
Purvis, A. 1995. A modification to Baum and Ragan's method for combining phylogenetic trees. Systematic Biology 44, 2, 251255.
Ragan, M. A. 1992. Phylogenetic inference based on matrix representation of trees. Molecular phylogenetics and evolution 1, 1, 5358.
Säilä, L. K., Fortelius, M., Oikarinen, E., Werdelin, L. and Corfe, I. 2012. Fossil mammals, phylogenies and climate: the effects of phylogenetic relatedness on range sizes and replacement patterns in changing environments. In Proceedings of 60th Annual Symposium of Vertebrate Palaeontology and Comparative anatomy, SVPCA 2012. Poster.
Säilä, L. K., Fortelius, M., Oikarinen, E., Werdelin, L., Corfe, I. and Tuomola, A. 2011. Taxon replacement: Invasion or speciation? First results for a supertree of Neogene mammals. Journal of Vertebrate Paleontology 31, 3, suppl., 184A.
Semple, C. and Steel, M. 2000. A supertree method for rooted trees. Discrete Applied Mathematics 105, 1, 147158.
Snir, S. and Rao, S. 2012. Quartet MaxCut: a fast algorithm for amalgamating quartet trees. Molecular phylogenetics and evolution 62, 1, 18.
Sridhar, S., Lam, F., Blelloch, G. E., Ravi, R. and Schwartz, R. 2008. Mixed integer linear programming for maximum-parsimony phylogeny inference. IEEE/ACM Transactions on Computational Biology and Bioinformatics 5, 3, 323331.
Steel, M., Dress, A. W. and Bocker, S. 2000. Simple but fundamental limitations on supertree and consensus tree methods. Systematic Biology 49, 2, 363368.
Swenson, M. S., Suri, R., Linder, C. R. and Warnow, T. 2011. An experimental study of Quartets MaxCut and other supertree methods. Algorithms for Molecular Biology 6, 1, 7.
Wilkinson, M., Cotton, J. A., Creevey, C., Eulenstein, O., Harris, S. R., Lapointe, F.-J., Levasseur, C., Mcinerney, J. O., Pisani, D. and Thorley, J. L. 2005. The shape of supertrees to come: tree shape related properties of fourteen supertree methods. Systematic biology 54, 3, 419431.
Wilkinson, M., Pisani, D., Cotton, J. A. and Corfe, I. 2005. Measuring support and finding unsupported relationships in supertrees. Systematic Biology 54, 5, 823831.
Wu, G., You, J.-H. and Lin, G. 2007. Quartet-based phylogeny reconstruction with answer set programming. IEEE/ACM Transactions on Computational Biology and Bioinformatics 4, 1, 139152.
Recommend this journal

Email your librarian or administrator to recommend adding this journal to your organisation's collection.

Theory and Practice of Logic Programming
  • ISSN: 1471-0684
  • EISSN: 1475-3081
  • URL: /core/journals/theory-and-practice-of-logic-programming
Please enter your name
Please enter a valid email address
Who would you like to send this to? *
×

Keywords

Type Description Title
PDF
Supplementary materials

Koponen supplementary material
Online Appendix

 PDF (190 KB)
190 KB

Metrics

Altmetric attention score

Full text views

Total number of HTML views: 0
Total number of PDF views: 21 *
Loading metrics...

Abstract views

Total abstract views: 325 *
Loading metrics...

* Views captured on Cambridge Core between September 2016 - 15th August 2018. This data will be updated every 24 hours.