Skip to main content Accessibility help
×
×
Home

Natural language interfaces to databases

  • Ann Copestake (a1) and Karen Sparck Jones (a1)
Abstract

This paper reviews the current state of the art in natural language access to databases. This has been a long-standing area of work in natural language processing. But though some commercial systems are now available, providing front ends has proved much harder than was expected, and the necessary limitations on front ends have to be recognized. The paper discusses the issues, both general to language and task-specific, involved in front end design, and the way these have been addressed, concentrating on the work of the last decade. The focus is on the central process of translating a natural language question into a database query, but other supporting functions are also covered. The points are illustrated by the use of a single example application. The paper concludes with an evaluation of the current state, indicating that future progress will depend on the one hand on general advances in natural language processing, and on the other on expanding the capabilities of traditional databases.

Copyright
References
Hide All
Allen, J, 1987. Natural Language Processing. Menlo Park, CA: Benjamin/Cummings.
Alshawi, H et al. , 1989. Research Programme in Natural Language Processing, Final Report. Cambridge: SRI Cambridge Computer Science Research Centre.
Alshawi, H, 1989. “Analysing the dictionary definitions”. In: Boguraev, B and Briscoe, T (eds.), Computational Lexicography for Natural Language Processing, pp 153169. London: Longman.
Ballard, B and Stumberger, D, 1986. “Semantic acquisition in TELI”. Proceedings of the 24th Annual Meeting of the ACL, pp 2029. New York.
Bates, M, Moser, MG and Stallard, D, 1986. “The IRUS transportable natural language database interface. In: Kerschberg, L (ed.), Expert Database Systems, pp 617630. Menlo Park, CA: Benjamin/Cummings.
Bates, M and Weischedel, R, 1987. Tutorial: Evaluating Natural Language Interfaces. 25th Annual Meeting of the ACL, Stanford CA. Cambridge, MA: Bolt, Berank and Newman.
Binot, JL et al. , 1988. LOKI: A Logic Oriented Approach to Data and Knowledge Bases Supporting Natural Language Interaction. London: Scicon Ltd.
Bobrow, RJ, Resnik, P and Weischedel, RM, 1990. “Multiple underlying systems: translating user requests into programs to produce answers”. Proceedings of the 28th Annual Meeting of the ACL, pp 227234.
Boguraev, BK and Sparck Jones, K, 1982. “How to drive a database front end using general semantic information”. Proceedings of the Conference on Applied Natural Language Processing, pp 8188. Santa Monica CA.
Boguraev, BK and Sparck Jones, K, 1984. “A natural language front end to databases with evaluative feedback”. In: Gardarin, G and Gelenbe, E (eds.), New Applications of Databases, pp 159183. New York: Academic Press.
Brachman, RJ, Gilbert, VP and Levesque, HJ, 1985. “An essential hybrid reasoning system: knowledge and symbol level accounts of KRYPTON”. Proceedings of the 9th IJCAI, pp 532539. Los Angeles.
Brennan, SE, 1988. “The multi-media articulation of answers in a natural language database query system”. Proceedings of the 2nd Conference on Applied Natural Language Processing, pp 18. Austin.
Brodie, ML and Mylopolous, J, 1988. Readings in Artificial Intelligence and Databases. Los Altos, CA: Morgan Kaufmann.
Bronnenberg, WJHJ et al. , 1980. “The question answering system PHLIQAI”. In: Bolc, L (ed.), Natural Language Question Answering Systems, pp 217305. London: Macmillan.
Carbonell, JG, 1983. “Discourse pragmatics and ellipsis resolution intask-oriented natural language interfaces”. Proceedings of the 21st Annual Meeting of the ACL, pp 164168. Cambridge, MA: MIT.
Carroll, J and Grover, C, 1989. “The derivation of a large computational lexicon for English from LDOCE”. In: Boguraev, B and Briscoe, T (eds.), Computational Lexicography for Natural Language Processing, pp 117133. London: Longman.
Clifford, J, 1988. “Natural language querying of historical databases”. Computational Linguistics 14(4) 1034.
Codd, EF et al. , 1978. RENDEZVOUS Version 1: An Experimental English-Language Query Formulation System for Casual Users of Relational Databases. Research Report RJ2144. San José, CA: IBM Research Laboratory.
Copestake, AA and Sparck Jones, K, 1989. Inference in Natural Language Front Ends to Databases. Technical Report 163. University of Cambridge: Computer Laboratory.
Damerau, F, 1980. The Transformational Question Answering (TQA) System: Description, Operating Experience and Implications. Report RC8287. Yorktown Heights, NY: IBM Thomas J Watson Research Center.
Damerau, F, 1981. “Operating statistics for the transformational question answering system”. American Journal of Computational Linguistics 7 3042.
Date, CJ, 1983. An Introduction to Database Systems: Vol. 2. Reading, MA: Addison-Wesley.
Davidson, J and Kaplan, SJ, 1983. “Natural language access to databases: Interpreting update requests”. American Journal of Computational Linguistics 9 5768.
Frost, DP, 1989. “The design of a natural language interface for medical expert systems”. PhD thesis, University of London (University College and Middlesex School of Medicine.
Gazdar, G and Mellish, C, 1989. Natural Language Processing in LISP. Reading, MA: Addison-Wesley.
Ginsparg, J, 1983. “A robust portable natural language database interface”. Proceedings of the Conference on Applied Natural Language Processing, pp 2531. Santa Monica, CA.
Green, B et al. , 1961. “BASEBALL: An automatic question answerer”. Proceedings of the Western Joint Computer Conference 19 219224; reprinted in Grosz, Sparck Jones and Webber (1986) pp 545–549.
Grosz, B, Sparck Jones, K and Webber, BL (eds.), 1986. Readings in Natural Language Processing. Los Altos, CA: Morgan Kaufmann.
Grosz, B et al. , 1987. “TEAM: an experiment in the design of transportable natural-language interfaces”. Artificial Intelligence 12 173243.
Grover, C et al. , 1989. The Alvey Natural Language Tools Grammar (second release). Technical Report 162. University of Cambridge: Computer Laboratory.
Haas, N and Hendrix, GG, 1983. “Learning without being told: Acquiring knowledge for information management”. In: Michalski, RS, Carbonell, JG and Mitchell, TM (eds.), Machine Learning, pp 405427. Palo Alto, CA: Tioga Publishing.
Hafner, CD, 1985. “Semantics of temporal data and temporal queries”. Proceedings of the 23rd Annual Meeting of the ACL, pp 18. Chicage, IL.
Harris, LR, 1977. “User oriented database query with the ROBOT natural language query system”. International Journal of Man-Machine Studies 9 697713.
Harris, LR, 1984. “Experience with INTELLECT: Artificial intelligence technology transfer”. The AI Magazine 5(2) 4350.
Hendrix, GG, et al., 1978. “Developing a natural language interface to complex data”. ACM Transactions on Database Systems 3 105147; reprinted in Grosz, Sparck Jones and Webber (1986) pp 563–584.
Hirst, G, 1987. Semantic Interpretation and the Resolution of Ambiguity. Cambridge: Cambridge University Press.
Hobbs, JR and Martin, P, 1987. “Semantics of temporal data and temporal queries”. Proceedings of the 10th IJCAI, pp 520523. Karlsruhe, West Germany.
Jarke, M, 1986. “Control of search and knowledge acquisition in large-scale KBMS”. In: Brodie, ML and Mylopolous, J (eds.), On Knowledge Base Management Systems, pp 507522. New York: Springer Verlag.
Kalita, JK, Jones, ML and McCalla, GI, 1986. “Summarising natural language database responses”. Computational Linguistics 12 107124.
Kaplan, SJ, Mays, E and Joshi, AK, 1980. A Technique for Managing the Lexicon in a Natural Language Interface to a Changing Database. Technical Report MS-CIS-80–10. University of Pennsylvania: Department of Computer and Information Science.
Kaplan, SJ, 1982. “Cooperative responses from a portable natural language query system”. Artificial Intelligence 19 165187.
Kent, W, 1978. Data and Reality. Amsterdam: North-Holland.
King, JL, 1981. “QUIST: A system for semantic query optimization in relational databases”. Proceedings of the 7th International Conference on Very Large Databases, pp 510517. Cannes, France.
Konolige, K, 1979. A Framework for a Portable Natural-language Interface to Large Databases. Technical Note 197. SRI International.
Konolige, K, 1981. The Database as Model: A Metatheoretic Approach. Technical Note 255. SRI International.
Manferdelli, JL, 1989. “Natural languages”. Sun Technology, Summer 1989, 122129.
McKeown, KR, 1983. “Paraphrasing questions using given and new information”. American Journal of Computational Linguistics 9 110.
McKeown, KR, 1985. Text Generation. Cambridge: Cambridge University Press.
Means, LG, 1988. “Cn yur cmputr raed ths”. Proceedings of the 2nd Conference on Applied Natural Language Processing, pp 93100. Austin, Texas.
Mellish, CS, 1989. “Some chart-based techniques for parsing ill-formed input”. Proceedings of the 27th ACL, pp 102109. Vancouver.
Moore, RC et al. , 1982. “Panel: Natural language access to databases—theoretical/technical issues”. Proceedings of the 20th ACL, pp 4466, 169171. Toronto.
Mueckstein, E-MM, 1983. “Q-TRANS: Query translation into English”. Proceedings of the 8th IJCAI, pp 660662. Karlsruhe, West Germany.
Perrault, CR and Grosz, BJ, 1986. “Natural language interfaces”. Annual Review of Computer Science 1 4782.
Proctor, C and Young, S, 1987. “Dialogue control in conversational speech interfaces”. In: Taylor, MM, Néel, F and Bouwhuis, DG (eds.), The Structure of Multimodal Dialogue. Amsterdam: North-Holland.
Reiter, R et al. , 1983. “A panel on AI and databases”. Proceedings of the 8th IJCAI, pp 11991206. Karlsruhe, West Germany.
Salveter, S and Maier, D, 1982. “Natural language database updates”. Proceedings of the 20th Annual Meeting of the ACL, pp 6773. Toronto.
Scha, RJH, 1983. “Logical foundations for question answering”. PhD thesis.
Sondheimer, NK et al. , 1981. “Panel: Evaluation of natural language front ends to databases”. Proceedings of the 19th ACL, pp 2942. Stanford, CA.
Sparck Jones, K et al. , 1984. “Panel: Natural language and databases, again”. Proceedings of the COLING 84, 10th International Conference on Computational Linguistics, 22nd Annual Meeting of the ACL, pp 182193. Stanford, CA.
Sparck Jones, K and Tait, JI, 1984. “Linguistically motivated descriptive term selection”. Proceedings of the COLING 84, 10th International Conference on Computational Linguistics, 22nd Annual Meeting of the ACL, pp 287290. Stanford, CA.
Sparck Jones, K, 1988. A Note on Robustness in Front Ends. University of Cambridge: Computer Laboratory.
Stallard, DG, 1986. “A terminological simplification transformation for natural language question answering systems”. Proceedings of the 24th ACL, pp 241246. New York.
Stenton, SP, 1987. “Dialogue management for co-operative knowledge based systems”. The Knowledge Engineering Review 2 99122.
Storrs, G, du Boulay, B and Gray, PMD, 1985. A Metadata Advisor: Some Sample Queries. University of Aberdeen: Department of Computer Science.
Tennant, H, 1981. Natural Language Processing. New York: Petrocelli.
Tennant, H et al. , 1983. “Menu-based natural language understanding”. Proceedings of the 21st Annual Meeting of the ACL, pp 151158. Cambridge, MA: MIT.
Tomita, M, 1985. “An efficient context-free parsing algorithm for natural languages”. Proceedings of the 9th IJCAI, pp 756764. Los Angeles.
Wahlster, W, 1989. “User and discourse models for multimodal communication”. Proceedings of the Hewlett-Packard Laboratories 1989 European Scientific Symposium, pp 115131. Paris.
Waltz, DL, 1978. “An English language question answering system for a large relational database”. Communications of the ACM 21 526539.
Warren, D and Pereira, F, 1982. “An efficient easily adaptable system for interpreting natural language queries”. American Journal of Computational Linguistics 8 110122.
Webber, BL, 1986. “Questions, answers and responses: Interacting with knowledge base systems”. In: Brodie, ML and Mylopolous, J (eds.), On Knowledge Base Management Systems, pp 365402. New York: Springer Verlag.
Weischedel, RM, 1989. “A hybrid approach to representation in the JANUS natural language processor”. Proceedings of the 27th ACL, pp 193202. Vancouver, British Columbia.
Whittaker, S and Stenton, P, 1989. “User studies and the design of natural language systems”. Proceedings of the 4th EACL, pp 116123. Manchester.
Wilks, Y et al. , 1989. “A tractable machine dictionary as a resource for computational semantics”. In: Boguraev, B and Briscoe, T (eds.), Computational Lexicography for Natural Language Processing, pp 193228. London: Longman.
Woods, W, 1972. The Lunar Sciences Natural Language Information System. Final Report. Cambridge, MA: Bolt, Beranek and Newman.
Woods, W, 1978. “Semantics and quantification in natural language question answering”. In: Yovits, M (ed.), Advances in Computers, pp 187. New York: Academic Press.
Young, SJ, 1989. Final Report: Alvey/SERC Project MMI003, Voice Operated Database Inquiry Systems, Speech Input. University of Cambridge: Engineering Department.
Recommend this journal

Email your librarian or administrator to recommend adding this journal to your organisation's collection.

The Knowledge Engineering Review
  • ISSN: 0269-8889
  • EISSN: 1469-8005
  • URL: /core/journals/knowledge-engineering-review
Please enter your name
Please enter a valid email address
Who would you like to send this to? *
×

Metrics

Altmetric attention score

Full text views

Total number of HTML views: 0
Total number of PDF views: 0 *
Loading metrics...

Abstract views

Total abstract views: 0 *
Loading metrics...

* Views captured on Cambridge Core between <date>. This data will be updated every 24 hours.

Usage data cannot currently be displayed