To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
By
J. W. Brahan, Institute for Information Technology National Research Council Ottawa Canada K1A 0R6,
B. Farley, Institute for Information Technology National Research Council Ottawa Canada K1A 0R6,
R. A. Orchard, Institute for Information Technology National Research Council Ottawa Canada K1A 0R6,
A. Parent, Institute for Information Technology National Research Council Ottawa Canada K1A 0R6,
C. S. Phan, Institute for Information Technology National Research Council Ottawa Canada K1A 0R6
Most expert systems perform a task on behalf of the user. The task usually involves gathering and analyzing data, and recommending or initiating the appropriate action. However, expert systems can also play an important role in showing the user how to perform a task. In this role, the expert system provides support until it eventually becomes of decreasing importance as its knowledge base is transferred to the user. This category includes Help Systems, Coaching Systems, and Tutorial Systems. In this paper, we discuss the development of an Intelligent Advisor combining the three functions in a system to assist the user in acquiring and refining the knowledge required to carry out a design task. The combined system provides a means of introducing a training facility as an integral part of the work environment. The primary goal of our project is the creation of a system in which the generic advisor components are identified along with the methodology required to adapt them to specific applications. The conceptual modelling phase of database design was chosen as the application domain to develop the system and to demonstrate feasibility. An initial prototype has been implemented, which illustrates the operation of the system in each of the three modes as applied to database modelling. The technology is currently being extended to a second application domain.
Introduction
ERMA (Entity-Relationship Modelling Advisor) is a knowledge-based system that serves as a consultant to the user of a computer-based design tool, providing advice as required.
By
S. Craw, Department of Computing Science University of Aberdeen Aberdeen AB9 2UE,
D. Sleeman, Department of Computing Science University of Aberdeen Aberdeen AB9 2UE,
N. Graner, Department of Computing Science University of Aberdeen Aberdeen AB9 2UE,
M. Rissakis, Department of Computing Science University of Aberdeen Aberdeen AB9 2UE,
S. Sharma, Department of Computing Science University of Aberdeen Aberdeen AB9 2UE
The Machine Learning Toolbox (MLT), an Esprit project (P2154), provides an integrated toolbox of ten Machine Learning (ML) algorithms. One distinct component of the toolbox is Consultant, an advice-giving expert system, which assists a domain expert to choose and use a suitable algorithm for his learning problem. The University of Aberdeen has been responsible for the design and implementation of Consultant.
Consultant's knowledge and domain is unusual in several respects. Its knowledge represents the integrated expertise of ten algorithm developers, whose algorithms offer a range of ML techniques; but also some algorithms use fairly similar approaches. The lack of an agreed ML terminology was the initial impetus for an extensive, associated help system. From an MLT user's point of view, an ML beginner requires significant assistance with terminology and techniques, and can benefit from having access to previous, successful applications of ML to similar problems; but in contrast a more experienced user of ML does not wish constant supervision. This paper describes Consultant, discusses the methods used to achieve the required flexibility of use, and compares Consultant's similarities and distinguishing features with more standard expert system applications.
INTRODUCTION
The Machine Learning Toolbox (MLT), an Esprit project (P2154), provides an integrated toolbox of ten Machine Learning (ML) algorithms. One distinct component of the toolbox is Consultant, an advice-giving expert system. It provides domain experts with assistance and guidance on the selection and use of tools from the toolbox, but it is specifically aimed at experts who are not familiar with ML and its design has focused on their needs.
By
B. R. Gaines, Knowledge Science Institute, University of Calgary Calgary, Alberta, Canada T2N 1N4.,
M. L. G. Shaw, Knowledge Science Institute, University of Calgary Calgary, Alberta, Canada T2N 1N4.
An intelligent learning data base (ILDB) system is an integrated learning system which implements automatic knowledge acquisition from data bases by providing formalisms for 1) translation of standard data base information into a form suitable for use by its induction engines. 2) using induction techniques to produce knowledge from data bases, and 3) interpreting the knowledge produced efficiently to solve users' problems. Although a lot of work on knowledge acquisition from data bases has been done, the requirements for building practical learning systems to learn from conventional data bases are still far away for existing systems to reach. A crucial requirement is more efficient learning algorithms as realistic data bases are usually fairly large. Based on KEshell. dBASE3 and the low-order polynomial induction algorithm HCV. this paper presents a knowledge engineering shell. KEsheH2. which implements the 3 phases of automatic knowledge acquisition from data bases in an integral way.
INTRODUCTION
Over the past twenty years data base research has evolved technologies that are now widely used in almost every computing and scientific field. However, many new advanced applications including computer-aided design (CAD), computer-aided manufacturing (CAM), computer-aided software engineering (CASE), image processing, and office automation (OA) have revealed that traditional data base management systems (DBMSs) are inadequate, especially on the following cases [Wu 90b]:
Conventional data base technology has laid particular stress on dealing with large amounts of persistent and highly structured data efficiently and using transactions for concurrency control and recovery.
By
D. R. Pugh, Department of Computer Science University of Wales Aberystwyth Dyfed SY23 3DB United Kingdom,
C. J. Price, Department of Computer Science University of Wales Aberystwyth Dyfed SY23 3DB United Kingdom
This paper describes work on the construction of a configurable knowledge acquisition tool, Jigsaw. Unlike automated knowledge acquisition programs such as MORE [Kahn, 1988], MOLE [Eshelman, 1988], and OPAL [Musen, 1989], each of which automates elicitation for just one problem solving method, it is possible to alter Jigsaw's knowledge acquisition strategy to match different problem solving methods.
The work is based upon eliciting knowledge for problem solvers made up from different combinations of generic task (as denned in [Chandrasekaran, 1986] and [Chandrasekaran, 1988]). Each combination of generic tasks defines the functionality of a different problem solving method. However, the eventual aim of this work is that it will be possible to adapt it to a range of different KADS [Schreiber et. al., 1987] interpretation models and thus it will be part of a complete knowledge acquisition methodology.
The paper outlines the requirements for such a knowledge acquisition tool and details the distributed architecture which allows the tool, Jigsaw, to achieve the required flexibility to elicit knowledge for such problem solvers. An important part of this flexibility is the way in which Jigsaw can be configured to match different types of problem solver. This is described in some detail.
Jigsaw has been used to reproduce the MDX2 [Sticklen, 1987] knowledge base, which was initially constructed by using manual knowledge acquisition techniques. The paper gives a description of how Jigsaw elicited this knowledge.
Before turning to some selected issues in type theory, type-shifting and the semantics of noun phrases, which form the main topic of this chapter, I will briefly mention some other, very exciting topics of current research in formal semantics that are relevant to the theme of this book, but which do not receive the attention they deserve in what follows.
One is the centrality of context dependence, context change, and a more dynamic view of the basic semantic values. This view of meanings, in terms of functions from contexts to contexts, is important both for the way it helps us make better sense of such things as temporal anaphora, and also for the way it solves some of the crucial foundational questions about presuppositions and the principles governing presupposition projection. The work of Irene Heim [86] is central in this area.
Another crucial area that will not be covered is the recently emerging emphasis on the algebraic structure of certain model-theoretic domains, such as entities, events or eventualities. As well as the work of Godehard Link [137] and Jan-Tore Lønning [140] in this area, referred to by Fenstad [54], I would also mention the attempt by Emmon Bach [6] to generalize the issue into questions about natural language metaphysics. An example of such a question is whether languages that grammaticize various semantic distinctions (such as mass versus count nouns) differently have different ontologies, or whether Link's use of Boolean algebras, which may be either atomic or not necessarily atomic, provides a framework in which to differentiate those structures that are shared from additional ones that only some languages express through grammar.
In the linguistic study of syntax, various formats have been developed for measuring the combinatorial power of natural languages. In particular, there is Chomsky's well-known hierarchy of grammars which can be employed to calibrate syntactic complexity. No similar tradition exists in linguistic semantics, however. Existing powerful formalisms for stating truth conditions, such as Montague Semantics, are usually presented as monoliths like Set Theory or Intensional Type Theory, without any obvious way of expressive fine-tuning. The purpose of this chapter is to show how semantics has its fine-structure too, when viewed from a proper logical angle.
The framework used to demonstrate this claim is that of Categorial Grammar (see Buszkowski et al. [24] or Oehrle et al. [173] for details). In this linguistic paradigm, syntactic sensitivity resides in the landscape of logical calculi of implication, which manipulate functional types as conditional propositions. The landscape starts from a classical Ajdukiewicz system with modus ponens only and then ascends to intuitionistic conditional logic, which allows also subproofs with conditionalisation. A well-known intermediate system is the Lambek Calculus (Moortgat [156], van Benthem [232]), which takes special care in handling occurrences of propositions or types. These calculi represent various ‘engines’ for driving categorial parsing, which can be studied as to their formal properties by employing the tools of logical proof theory.
My role in the original workshop which gave rise to this book was to sum up what had been said during the week, trying to relate the different contributions one to another. In preparing for what threatened to be an arduous task, it occurred to me that a fundamental assumption underlying the organization of such a workshop needed to be taken out and examined overtly: calling a workshop ‘Computational Linguistics and Formal Semantics’ assumes that these two areas of academic endeavour have something to say to each other; may, even, be inextricably related.
Superficially, of course, this seems likely to be true: an investigation of language and its use could be seen as the core interest of both disciplines. But, on looking a little more closely, the intimate connection tends to evaporate. For most computational linguists, the task is to define and implement an adequate treatment of (some subset of) some natural language within the framework of a computer application. Even when they are primarily interested in demonstrating that a particular linguistic theory or a particular paradigm of computation is superior to its rivals, the argument will frequently be couched in application-oriented terms: such and such a theory is superior because it leads to clearer linguistic descriptions which are easier to modify and to debug, for example, or such and such a computational paradigm is superior because it allows linguistic programmers to compartmentalize their knowledge of a language, describing sub-parts independently without having to worry about all the possible intricate complexities of interactions between them.
This paper is a tutorial on property-theoretic semantics: our aim is to provide an accessible account of property theory and its application to the formal semantics of natural language. We develop a simple version of property theory and provide the semantics for a fragment of English in the theory. We shall say more about the particular form of the theory in the next section but to begin with we outline the main reasons why we believe property-theoretic semantics to be worthy of attention.
INTENSIONALITY
The main motivation for the development of a theory of propositions and properties stems from the desire to develop an adequate account of intensionality in natural language. In this section we briefly review some possible approaches to intensionality in order to motivate the approach we shall eventually adopt.
INTENSIONALITY VIA POSSIBLE WORLDS
Traditional approaches to intensionality employ the notion of possible world. Propositions are taken to be sets of possible worlds and properties to be functions from individuals to propositions. The modal and doxastic operators are then unpacked as functions from propositions to propositions. For example, the modal operator of necessity is analysed as that function which maps a proposition P to that proposition which consists of all those worlds whose accessible worlds are elements of P. Different choices of the relation of accessibility between worlds yield different modal and doxastic notions. Kripke [127] introduced the possible world analysis of necessity and possibility while Hintikka [88] extended the analysis to deal with belief and knowledge.
The workshop that inspired this book was the high point of a collaborative research project entitled ‘Incorporation of Semantics into Computational Linguistics’, supported by the EEC through the COST13 programme. The workshop was also the first official academic event to be organized by IDSIA, a newly inaugurated institute of the Dalle Molle Foundation situated in Lugano, Switzerland devoted to artificial intelligence research.
A few words about the project may help to place the contents of the book in its proper context. The underlying theme was that although human language is studied from the standpoint of many different disciplines, there is rather less constructive interaction between the disciplines than might be hoped for, and that for such interaction to take place, a common framework of some kind must be established. Computational linguistics (CL) and artificial intelligence (AI) were singled out as particular instances of such disciplines: each has focused on rather different aspects of the relationship between language and computation.
Thus, what we now call CL grew out of attempts to apply the concepts of computer science to generative linguistics (e.g. Zwicky and his colleagues [255], Petrick [181]). Given these historical origins, CL unwittingly inherited some of the goals for linguistic theory that we normally associate with Chomskyan linguistics. One such goal, for example, is to finitely describe linguistic competence, that is, the knowledge that underlies the human capacity for language by means of a set of universal principles that characterize the class of possible languages, and a set of descriptions for particular languages.
This paper is written from the point of view of one who works in artificial intelligence (AI): the attempt to reproduce interesting and distinctive aspects of human behaviour with a computer, which, in my own case, means an interest in human language use.
There may seem little of immediate relevance to cognition or epistemology in that activity. And yet it hardly needs demonstration that AI, as an aspiration and in practice, has always been of interest to philosophers, even to those who may not accept the view that AI is, essentially, the pursuit of metaphysical goals by non-traditional means.
As to cognition in particular, it is also a commonplace nowadays, and at the basis of cognitive science, that the structures underlying AI programs are a guide to psychologists in their empirical investigations of cognition. That does not mean that AI researchers are in the business of cognition, nor that there is any direct inference from how a machine does a task, say translating a sentence from English to Chinese, to how a human does it. It is, however, suggestive, and may be the best intellectual model we currently have of how the task is done. So far, so well known and much discussed in the various literatures that make up cognitive science.
Large practical computational-linguistic applications, such as machine translation systems, require a large number of knowledge and processing modules to be put together in a single architecture and control environment. Comprehensive practical systems must have knowledge about speech situations, goal-directed communicative actions, rules of semantic and pragmatic inference over symbolic representations of discourse meanings and knowledge of syntactic and phonological/graphological properties of particular languages. Heuristic methods, extensive descriptive work on building world models, lexicons and grammars as well as a sound computational architecture are crucial to the success of this paradigm. In this paper we discuss some paradigmatic issues in building computer programs that understand and generate natural language. We then illustrate some of the foundations of our approach to practical computational linguistics by describing a language for representing text meaning and an approach to developing an ontological model of an intelligent agent. This approach has been tested in the dionysus project at Carnegie Mellon University which involved designing and implementing a natural language understanding and generation system.
SEMANTICS AND APPLICATIONS
Natural language processing projects at the Center for Machine Translation of Carnegie Mellon University are geared toward designing and building large computational applications involving most crucial strata of language phenomena (syntactic, semantic, pragmatic, rhetorical, etc.) as well as major types of processing (morphological and syntactic parsing, semantic interpretation, text planning and generation, etc.). Our central application is machine translation which is in a sense the paradigmatic application of computational linguistics.
This paper stands in marked contrast to many of the others in this volume in that it is intended to be entirely tutorial in nature, presupposing little on the part of the reader but a user's knowledge of English, a modicum of good will, and the desire to learn something about the notion of unification as it has come to be used in theoretical and computational linguistics. Most of the ideas I shall be talking about were first introduced to the study of ordinary language by computational linguists and their adoption by a notable, if by no means representative, subset of theoretical linguists represents an important milestone in our field, for it is the first occasion on which the computationalists have had an important impact on linguistic theory. Before going further, there may be some value in pursuing briefly why this rapprochement between these two branches has taken so long to come about. After all, the flow of information in the other direction – from theoretician to computationalist – has continued steadily from the start.
PRODUCTIVITY
The aspects of ordinary language that have proved most fascinating to its students all have something to do with its productivity, that is, with the fact that there appears to be no limit to the different utterances that can be made and understood in any of the languages of the world. Certainly, speakers can make and understand utterances that they have never made or understood before, and it is presumably largely in this fact that the immense power and flexibility of human language resides.
The integration of syntactic and semantic processing has prompted a number of different architectures for natural language systems, such as rule-by-rule interpretation [221], semantic grammars [22] and cascaded ATNs [253]. The relationship between syntax and semantics has also been of central concern in theoretical linguistics, particularly following Richard Montague's work. Also, with the recent rapprochement between theoretical and computational linguistics, Montague's interpretation scheme has made its way into natural language systems. Variations on Montague's interpretation scheme have been adopted and implemented in several syntactic theories with a significant following in computational linguistic circles. The first steps in this direction were taken by Hobbs and Rosenschein [89]. A parser for LFG was augmented with a Montagovian semantics by Halvorsen [76]. GPSG has been similarly augmented by Gawron et al. [65], and Schubert and Pelletier [206] followed with a compositional interpretation scheme using a first order logic rather than Montague's computationally intractable higher-order intensional logic.
In parallel with these developments in the syntax/semantics interface, unification-based mechanisms for linguistic description had significant impact both on syntactic theory and syntactic description. But the Montagovian view of compositionality in semantics and the architectures for configuring the syntax/semantics interface were slower to achieve a similar revaluation in view of unification and the new possibilities for composition which it brought.
This chapter concerns experiments with Situation Schemata as a linguistic representation language, in the first instance for Machine Translation purposes but not exclusively so. The work reported is primarily concerned with the adaptation of the Situation Schemata language presented by Fenstad et al. [56] (but see also Fenstad et al. in this volume) to an implementation within grammars having interestingly broad linguistic coverage. We shall also consider the computational tools required for such an implementation and our underlying motivation. The chapter is therefore not solely concerned with the form of the representation language and its relation to any eventual interpretation language, but also with practical and methodological issues associated with the implementation of current theories in Computational Linguistics. A key theme is the interplay between linguistic information, representation and appropriate mechanisms for abstraction.
The rest of the chapter is divided into three further sections. Section 7.2 suggests that the problem of translation can shed light on a range of representational issues on the syntax/semantics border; it contains a brief introduction to Situation Semantics and the original version of Situation Schemata. Section 7.3 outlines the design of a computational toolkit for experimenting with unification-based descriptions, concentrating on details of the grammar formalism and its abstraction mechanisms. Section 7.4 considers the form of Situation Schemata that has been implemented within a grammar of German and some related representational issues that arise out of this work.