A knowledge-based approach for selecting information sources*

THOMAS EITER; MICHAEL FINK; HANS TOMPITS

doi:10.1017/S1471068406002754

A knowledge-based approach for selecting information sources*

Published online by Cambridge University Press: 01 May 2007

THOMAS EITER ,

MICHAEL FINK and

HANS TOMPITS

Show author details

THOMAS EITER: Affiliation:
Institut für Informationssysteme, Technische Universität Wien, Favoritenstraβe 9-11, A-1040 Vienna, Austria e-mail: eiter@kr.tuwien.ac.at, michael@kr.tuwien.ac.at, tompits@kr.tuwien.ac.at
MICHAEL FINK: Affiliation:
Institut für Informationssysteme, Technische Universität Wien, Favoritenstraβe 9-11, A-1040 Vienna, Austria e-mail: eiter@kr.tuwien.ac.at, michael@kr.tuwien.ac.at, tompits@kr.tuwien.ac.at
HANS TOMPITS: Affiliation:
Institut für Informationssysteme, Technische Universität Wien, Favoritenstraβe 9-11, A-1040 Vienna, Austria e-mail: eiter@kr.tuwien.ac.at, michael@kr.tuwien.ac.at, tompits@kr.tuwien.ac.at

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

Through the Internet and the World-Wide Web, a vast number of information sources has become available, which offer information on various subjects by different providers, often in heterogeneous formats. This calls for tools and methods for building an advanced information-processing infrastructure. One issue in this area is the selection of suitable information sources in query answering. In this paper, we present a knowledge-based approach to this problem, in the setting where one among a set of information sources (prototypically, data repositories) should be selected for evaluating a user query. We use extended logic programs (ELPs) to represent rich descriptions of the information sources, an underlying domain theory, and user queries in a formal query language (here, XML-QL, but other languages can be handled as well). Moreover, we use ELPs for declarative query analysis and generation of a query description. Central to our approach are declarative source-selection programs, for which we define syntax and semantics. Due to the structured nature of the considered data items, the semantics of such programs must carefully respect implicit context information in source-selection rules, and furthermore combine it with possible user preferences. A prototype implementation of our approach has been realized exploiting the DLV KR system and its PLP front-end for prioritized ELPs. We describe a representative example involving specific movie databases, and report about experimental results.

Keywords

knowledge representation nonmonotonic reasoning logic programming answer-set programming information-source selection data repositories preference handling

Information

Type: Regular Papers
Information: Theory and Practice of Logic Programming , Volume 7 , Issue 3 , May 2007 , pp. 249 - 300

DOI: https://doi.org/10.1017/S1471068406002754 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2007

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Abiteboul, S., Buneman, P. and Suciu, D. 2000. Data on the Web: From Relations to Semistructured Data and XML. Morgan Kaufmann, Los Altos.Google Scholar

Alferes, J., Pereira, L., Przymusinska, H. and Przymusinski, T. 2002. LUPS – A Language for Updating Logic Programs. Artificial Intelligence 138, 1–2, 87–116.CrossRef Google Scholar

Apt, K., Blair, H. and Walker, A. 1988. Towards a Theory of Declarative Knowledge. See Minker (1988), 89–148.Google Scholar

Arens, Y., Chee, C., Hsu, C. and Knoblock, C. 1993. Retrieving and Integrating Data from Multiple Information Sources. International Journal of Cooperative Information Systems 2, 2, 127–158.CrossRef Google Scholar

Arens, Y. and Knoblock, C. 1992. Planning and Reformulating Queries for Semantically-Modeled Multidatabase Systems. Proceedings of the First International Conference on Information and Knowledge Managements. 92–101.Google Scholar

Arens, Y., Knoblock, C. and Shen, W. 1996. Query Reformulation for Dynamic Information Integration. Journal of Intelligent Information Systems 6, 2–3, 99–130.CrossRef Google Scholar

Baral, C. 2003. Knowledge Representation, Reasoning and Declarative Problem Solving with Answer Sets. Cambridge University Press.CrossRef Google Scholar

Bayardo, R., Bohrer, B., Brice, R., Cichocki, A., Fowler, J., Helal, A., Kashyap, V., Ksiezyk, T., Martin, G., Nodine, M., Rashid, M., Rusinkiewicz, M., Shea, R., Unnikrishnan, C., Unruh, A. and Woelk, D. 1997. InfoSleuth: Semantic Integration of Information in Open and Dynamic Environments (Experience Paper). Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD '97). 195–206.CrossRef Google Scholar

Borgida, A., Brachman, R. J., McGuinness, D. L. and Resnick, L. A. 1989. CLASSIC: A Structural Data Model for Objects. Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD '89), J. Clifford, B. G. Lindsay, and D. Maier, Eds. ACM Press, 58–67.Google Scholar

Brewka, G. and Eiter, T. 1999. Preferred Answer Sets for Extended Logic Programs. Artificial Intelligence 109, 1–2, 297–356.CrossRef Google Scholar

Buccafurri, F., Leone, N. and Rullo, P. 1996. Stable Models and their Computation for Logic Programming with Inheritance and True Negation. Journal of Logic Programming 27, 1, 5–43.CrossRef Google Scholar

Buccafurri, F., Leone, N. and Rullo, P. 2000. Enhancing Disjunctive Datalog by Constraints. IEEE Transactions on Knowledge and Data Engineering 12, 5, 845–860.CrossRef Google Scholar

Burke, R., Hammond, K. and Kozlovsky, J. 1995. Knowledge-Based Information Retrieval from Semi-Structured Text. Working Notes of the AAAI '95 Fall Symposium, Series on AI Applications in Knowledge Navigation and Retrieval, Cambridge, MA. 19–24.Google Scholar

Chen, Y.-J. and Soo, V.-W. 2001. Ontology-Based Information Gathering Agents. Proceedings of the First Asia-Pacific Conference on Web Intelligence (WI 2001), N. Zhong et al., Ed. LNCS, subseries LNAI, vol. 2198. Springer, 423–427.Google Scholar

Collet, C., Huhns, M. and Shen, W.-M. 1991. Resource Integration using a Large Knowledge Base in Carnot. IEEE Computer 24, 12, 55–62.CrossRef Google Scholar

Decker, K., Sycara, K. and Williamson, M. 1997. Middle-Agents for the Internet. Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (IJCAI '97). Vol. 1. Morgan Kaufmann, 578–583.Google Scholar

Delgrande, J. and Schaub, T. 1994. A General Approach to Specificity in Default Reasoning. Proceedings of the Fourth International Conference on Principles of Knowledge Representation and Reasoning (KR '94). 146–157.CrossRef Google Scholar

Delgrande, J., Schaub, T. and Tompits, H. 2001. plp: A Generic Compiler for Ordered Logic Programs. Proceedings of the Sixth International Conference on Logic Programming and Nonmonotonic Reasoning (LPNMR 2001), T. Eiter, W. Faber, and M. Truszczyński, Eds. LNCS, subseries LNAI, vol. 2173. Springer, 411–415.Google Scholar

Delgrande, J. P., Schaub, T. and Tompits, H. 2003. A Framework for Compiling Preferences in Logic Programs. Theory and Practice of Logic Programming 3, 2, 129–187.CrossRef Google Scholar

Deutsch, A., Fernandez, M., Florescu, D., Levy, A. and Suciu, D. 1999. A Query Language for XML. Computer Networks 31, 11–16, 1155–1169.CrossRef Google Scholar

Dimopoulos, Y. and Kakas, A. 2001. Information Integration and Computational Logic. Computational Logic, Special Issue on the Future Technological Roadmap of Compulog-Net, 105–135.Google Scholar

Eiter, T., Fink, M., Sabbatini, G. and Tompits, H. 2002a. On Properties of Update Sequences Based on Causal Rejection. Theory and Practice of Logic Programming 2, 6, 721–777.CrossRef Google Scholar

Eiter, T., Fink, M., Sabbatini, G. and Tompits, H. 2002b. Using Methods of Declarative Logic Programming for Intelligent Information Agents. Theory and Practice of Logic Programming 2, 6, 645–719.CrossRef Google Scholar

Eiter, T., Fink, M. and Tompits, H. 2003. A Knowledge-Based Approach for Selecting Information Sources. Tech. Rep. INFSYS RR-1843-03-14, 2003, Institut für Informations-systeme, Technische Universität Wien.Google Scholar

Eiter, T., Gottlob, G. and Mannila, H. 1997. Disjunctive Datalog. ACM Transactions on Database Systems 22, 3, 364–418.CrossRef Google Scholar

Eiter, T., Ianni, G., Schindlauer, R. and Tompits, H. 2005a. A Uniform Integration of Higher-Order Reasoning and External Evaluations in Answer-Set Programming. Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence (IJCAI 2005). Morgan Kaufmann.Google Scholar

Eiter, T., Ianni, G., Schindlauer, R. and Tompits, H. 2005b. Nonmonotonic Description Logic Programs: Implementation and Experiments. Proceedings of the Twelfth International Conference on Logic for Programming, Artificial Intelligence and Reasoning (LPAR 2004), F. Baader and A. Voronkov, Eds. LNCS, vol. 3452. Springer, 511–517.Google Scholar

Eiter, T., Lukasiewicz, T., Schindlauer, R. and Tompits, H. 2004. Combining Answer-Set Programming with Description Logics for the Semantic Web. Proceedings of the Ninth International Conference on Principles of Knowledge Representation and Reasoning (KR 2004), D. Dubois, C. Welty, and M.-A. Williams, Eds. Morgan Kaufmann, 141–151.Google Scholar

Faber, W., Leone, N. and Pfeifer, G. 2004. Recursive Aggregates in Disjunctive Logic Programs: Semantics and Complexity. Proceedings of the Ninth European Conference on Logics in Artificial Intelligence (JELIA 2004), J. J. Alferes and J. A. Leite, Eds. LNCS, subseries LNAI, vol. 3229. Springer, 200–212.Google Scholar

Fellbaum, C. 1998. WordNet: An Electronic Lexical Database. MIT Press.CrossRef Google Scholar

Fink, M. 2002. Declarative Logic-Programming Components for Information Agents. Ph.D. thesis, Institut für Informationssysteme, Technische Universität Wien, Austria.Google Scholar

Fowler, J., Perry, B., Nodine, M. H. and Bargmeyer, B. 1999. Agent-Based Semantic Interoperability in InfoSleuth. SIGMOD Record 28, 1, 60–67.CrossRef Google Scholar

Fuhr, N. 1999. A Decision-Theoretic Approach to Database Selection in Networked IR. ACM Transactions on Information Systems 17, 3, 229–249.CrossRef Google Scholar

Garcia-Molina, H., Papakonstantinou, Y., Quass, D., Rajaraman, A., Sagiv, Y., Ullman, J., Vassalos, V. and Widom, J. 1997. The TSIMMIS Approach to Mediation: Data Models and Languages. Journal of Intelligent Information Systems 8, 2, 117–132.CrossRef Google Scholar

Geerts, P. and Vermeir, D. 1993. A Nonmonotonic Reasoning Formalism using Implicit Specificity Information. Proceedings of the Second International Workshop on Logic Programming and Nonmonotonic Reasoning (LPNMR '93), L.-M. Pereira and A. Nerode, Eds. LNCS, subseries LNAI. Springer, 380–396.Google Scholar

Geerts, P. and Vermeir, D. 1995. Specificity by Default. Proceedings of the European Conference on Symbolic and Quantitative Approaches to Reasoning and Uncertainty (ECSQARU '95). LNCS, subseries LNAI, vol. 946. Springer, 207–216.Google Scholar

Gelfond, M. and Lifschitz, V. 1991. Classical Negation in Logic Programs and Disjunctive Databases. New Generation Computing 9, 3–4, 365–386.CrossRef Google Scholar

Genesereth, M., Keller, A. and Duschka, O. 1997. Infomaster: An Information Integration System. Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD '97), J. Peckham, Ed. ACM Press, 539–542.Google Scholar

Goto, S., Ozono, T. and Shintani, T. 2001. A Method for Information Source Selection using Thesaurus for Distributed Information Retrieval. Proceedings of the Pacific Asian Conference on Intelligent Systems 2001 (PAIS 2001). 272–277.Google Scholar

Grosof, B. N., Horrocks, I., Volz, R. and Decker, S. 2003. Description Logic Programs: Combining Logic Programs with Description Logics. Proceedings of the Twelfth International World Wide Web Conference (WWW 2003). ACM Press, 48–57.Google Scholar

Huffman, S. B. and Steier, D. 1995. A Navigation Assistant for Data Source Selection and Integration. Working Notes of the AAAI '95 Fall Symposium Series on AI Applications in Knowledge Navigation and Retrieval, Cambridge, MA. AAAI Press, 72–77.Google Scholar

Huhns, M. and Singh, M. 1992. The Semantic Integration of Information Models. Proceedings of the AAAI Workshop on Cooperation among Heterogeneous Intelligent Agents.Google Scholar

Inoue, K. and Sakama, C. 2000. Prioritized Logic Programming and Its Applications to Commonsense Reasoning. Artificial Intelligence 123, 1–2, 185–222.Google Scholar

Kirk, T., Levy, A., Sagiv, Y., and Srivastava, D. 1995. The Information Manifold. Proceedings of the AAAI 2001 Spring Symposium on Information Gathering in Distributed Heterogeneous Environments. AAAI Press, 85–91.Google Scholar

Kowalski, R. A. and Sadri, F. 1990. Logic Programs with Exceptions. Proceedings of the Seventh International Conference on Logic Programming (ICLP '90). MIT Press, 598–616.Google Scholar

Krentel, M. 1988. The Complexity of Optimization Problems. Journal of Computer and System Sciences 36, 490–509.CrossRef Google Scholar

Laenens, E. and Vermeir, D. 1990. A Logical Basis for Object-Oriented Programming. Proceedings of the Second European Workshop on Logics in Artificial Intelligence (JELIA '90). LNCS, subseries LNAI. Springer, 317–332.Google Scholar

Lenat, D. B. and Guha, R. V. 1990. Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project. Addison-Wesley.Google Scholar

Leone, N., Pfeifer, G., Faber, W., Eiter, T., Gottlob, G., Perri, S. and Scarcello, F. 2006. The DLV System for Knowledge Representation and Reasoning. ACM Transactions on Computational Logic. To appear.CrossRef Google Scholar

Levy, A., Rajaraman, A. and Ordille, J. 1996. Querying Heterogeneous Information Sources using Source Descriptions. Proceedings of the Twentysecond International Conference on Very Large Data Bases (VLDB '96), T. Vijayaraman, A. Buchmann, C. Mohan, and N. Sarda, Eds. Morgan Kaufmann, 251–262.Google Scholar

Levy, A., Srivastava, D. and Kirk, T. 1995. Data Model and Query Evaluation in Global Information Systems. Journal of Intelligent Information Systems 5, 2, 121–143.CrossRef Google Scholar

Levy, A. and Weld, D. 2000. Intelligent Internet Systems. Artificial Intelligence 118, 1–2, 1–14.CrossRef Google Scholar

Lifschitz, V. and Turner, H. 1994. Splitting a Logic Program. Proceedings of the Eleventh International Conference on Logic Programming (ICLP '94). MIT Press, 23–38.Google Scholar

Luke, S., Spector, L., Rager, D. and Hendler, J. 1997. Ontology-Based Web Agents. Proceedings of the First International Conference on Autonomous Agents (Agents '97), W. L. Johnson, Ed. 59–66.Google Scholar

MacGregor, R. and Bates, R. 1987. The LOOM Knowledge Representation Language. Tech. Rep. RS-87-188, Information Sciences Institute, University of Southern California. Project Web page http://www.isi.edu/isd/LOOM/.Google Scholar

Minker, J., Ed. 1988. Foundations of Deductive Databases and Logic Programming. Morgan Kaufman, Washington DC.Google Scholar

Motik, B., Volz, R. and Maedche, A. 2003. Optimizing Query Answering in Description Logics using Disjunctive Deductive Databases. Proceedings of the Tenth International Workshop on Knowledge Representation meets Databases (KRDB 2003), F. Bry, C. Lutz, U. Sattler, and M. Schoop, Eds. CEUR Workshop Proceedings, vol. 79. RWTH Aachen University, 39–50. http://sunsite.informatik.rwth-aachen.de/Publications/CEUR-WS/Vol-79/.Google Scholar

Nodine, M., Ngu, A., Cassandra, A. and Bohrer, W. 2003. Scalable Semantic Brokering over Dynamic Heterogeneous Data Sources in InfoSleuth. IEEE Transactions on Knowledge and Data Engineering 15, 5, 1082–1098.CrossRef Google Scholar

Przymusinski, T. C. 1988. On the Declarative Semantics of Deductive Databases and Logic Programs. See Minker (1998), 193–216.Google Scholar

Sadri, F. and Toni, F. 2000. Computational Logic and Multi-Agent Systems: A Roadmap. Computational Logic, Special Issue on the Future Technological Roadmap of Compulog-Net, 1–31.Google Scholar

Schindlauer, R. 2002. Representation of SQL Queries for Declarative Query Analysis. M.S. thesis, Institut für Informationssysteme, Technische Universität Wien, Austria.Google Scholar

Sim, K. M. and Wong, P. T. 2001. Web-Based Information Retrieval using Agent and Ontology. In Proceedings of the First Asia-Pacific Conference on Web Intelligence (WI 2001), N. Zhong et al., Ed. LNCS, subseries LNAI, vol. 2198. Springer, 384–388.Google Scholar

Singh, M., Cannata, P., Huhns, M., Jacobs, N., Ksiezyk, T., Ong, K., Sheth, A., Tomlinson, C. and Woelk, D. 1997. The Carnot Heterogeneous Database Project: Implemented Applications. Distributed and Parallel Databases 5, 2, 207–225.CrossRef Google Scholar

Subrahmanian, V., Bonatti, P., Dix, J., Eiter, T., Kraus, S., Ozcan, F. and Ross, R. 2000. Heterogeneous Agent Systems: Theory and Implementation. MIT Press.CrossRef Google Scholar

Swift, T. 2004. Deduction in Ontologies via ASP. Proceedings of the Seventh International Conference on Logic Programming and Nonmonotonic Reasoning (LPNMR 2004), I. Niemelä and V. Lifschitz, Eds. LNCS, subseries LNAI, vol. 2923. Springer, 275–288.Google Scholar

Van Nieuwenborgh, D., and Vermeir, D. 2002. Preferred Answer Sets of Ordered Logic Programs. Proceedings of the Eighth European Conference on Logics in Artificial Intelligence (JELIA 2002), S. Flesca, S. Greco, G. Ianni, and N. Leone, Eds. LNCS, subseries LNAI, vol. 2424. 432–443.Google Scholar

Wendlandt, E. B. and Driscoll, J. R. 1991. Incorporating a Semantic Analysis into a Document Retrieval Strategy. Proceedings of the Fourteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, A. Bookstein, Y. Chiaramella, G. Salton, and V. V. Raghavan, Eds. ACM Press, 270–279.Google Scholar

Wiederhold, G. 1993. Intelligent Intration of Information. Proceedings of the ACM SIGMOD Conference on Management of Data (SIGMOD '93). 434–437.CrossRef Google Scholar

Article contents

A knowledge-based approach for selecting information sources*

Abstract

Keywords

Information

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests