To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
P. K. Srimani, Professor, Department of Computer Science, Bangalore University,S. F. B. Nasir, Professor, Department of Computer Science, Bangalore University
P. K. Srimani, Professor, Department of Computer Science, Bangalore University,S. F. B. Nasir, Professor, Department of Computer Science, Bangalore University
P. K. Srimani, Professor, Department of Computer Science, Bangalore University,S. F. B. Nasir, Professor, Department of Computer Science, Bangalore University
P. K. Srimani, Professor, Department of Computer Science, Bangalore University,S. F. B. Nasir, Professor, Department of Computer Science, Bangalore University
P. K. Srimani, Professor, Department of Computer Science, Bangalore University,S. F. B. Nasir, Professor, Department of Computer Science, Bangalore University
“Theory of computation begins with alphabets and flows as poetry in mathematical rhythms.”
Introduction
When we study computability, we study problems in an abstract sense. For example,
a. Addition is the problem of returning a third number that is the sum of two given numbers.
b. Travelling Salesman problem (TSP) is one in which a list of distances between some number of cities are given and the person is asked to find the shortest route so that he visits each city once and returns to the start.
c. Halting Problem (HP) is one in which a program is given some appropriate input and it needs to be decided whether the program, when run on that input, loops forever or halts.
Problems (b) and (c) are the two problems of computer interest.
In both these problems, the statement of the problem does not give the actual values needed to provide the result but just tells what kind of objects they are. A set of actual values for a problem is called an instance of the problem (by this terminology, all homework problems done by school students are considered as instances of problems).
In all these problems an instance is required that is the input and the relationship between the input and the output is to be understood. In order to solve these problems, there are certain things one should know.
Can it be solved algorithmically – is there any definite procedure that solves any instance of the problem in a finite amount of time? In other words, is it computable? Not all problems are computable.
P. K. Srimani, Professor, Department of Computer Science, Bangalore University,S. F. B. Nasir, Professor, Department of Computer Science, Bangalore University
‘Grammar is the mathematics of a language and mathematics is the grammar of creation.’
Introduction
In any language such as English, Hindi or Sanskrit, words can be combined in several ways. Naturally, some combinations form valid sentences, while others do not. The validity of a sentence is determined by the grammar of a language, which comprises a set of rules. For instance, ‘The boy prepares tea quickly’, although meaningless, is a perfectly legal sentence. In other words, the sentences in a language may be nonsensical, but they must obey the rules of grammar. The discussion in this chapter deals with only the syntax of sentence (the way the words are combined) and not with the semantics of sentences (meaning).
In the previous chapters, two different (though equivalent) methods: finite automata and regular expressions were introduced for describing languages. These methods have their own limitations in the sense that some simple languages, such as {0n1n|n ≥ 0}, cannot be described by these methods. Formal languages and grammars are widely used in connection with programming languages. During programming, we proceed with an intuitive knowledge of the languages, which leads to errors. Therefore, a precise description of the language is needed, at almost every step, which helps to understand the syntax diagrams found in programming texts. Among the ways in which programming languages can be defined precisely, grammars or context-free grammars are most widely used. This method happens to be a very powerful method and such grammars can describe certain features which have a recursive structure. Context-free-grammars (CFG) were first used in the study of human languages.
P. K. Srimani, Professor, Department of Computer Science, Bangalore University,S. F. B. Nasir, Professor, Department of Computer Science, Bangalore University
A language is a system of signs used to communicate information to others. However, the language of computation is a combination of both english and mathematics. Fundamentally, a computer is a symbol manipulator, in the sense, that it takes sequences of symbols as inputs and manipulates them as per the program specifications. These symbols are precise and unambiguous, unlike the language of humans. The first step in communicating a problem to a machine is the design of a proper language of computation and this is the fundamental object of computability.
While discussing about languages, it is important to note two cases:
a. During the evaluation of an input expression (e.g.: calculator), the language of arithmetic expression handles both the input and the output communication.
b. In the case of web form, the language simply describes all the legitimate inputs to the field and the output is simply a binary value-Yes or No.
Thus, the problems belonging to case (a), have both input and an output language, while, those belonging to case (b), have only an input language. It is of interest to note that, from the perspective of theory of computation, any type of problem can be expressed in terms of a language recognition.
Since a language is a medium of communication, it should be given some meaning (i.e. its semantics). But much of the manipulation of symbols, strings and language could be done effectively without understanding their semantics and this is the subject of this section. In other words, the mathematical study of the theory of computation begins with the understanding of the mathematics of symbols and strings.
P. K. Srimani, Professor, Department of Computer Science, Bangalore University,S. F. B. Nasir, Professor, Department of Computer Science, Bangalore University
P. K. Srimani, Professor, Department of Computer Science, Bangalore University,S. F. B. Nasir, Professor, Department of Computer Science, Bangalore University
The first edition of Language and the Internet appeared in 2001, with the preface predicting rapid advances in the field, and thus the need for revision on a shorter time-scale than is usual in book publication. The prediction turned out to be accurate, but in unexpected ways. I had anticipated that the four domains of e-mail, chat, virtual worlds, and the Web would evolve in interesting directions; but I did not expect to have to add an additional domain to this list – and certainly not two. Yet that is what has happened. As will already be apparent from chapters 1 and 2, the activities generally referred to as blogging and instant messaging have developed a linguistic character sufficiently distinctive as to require separate treatment. And other activities, especially in Internet telephony, are becoming apparent just over the linguistic horizon.
Blogging
Blogs were already in existence when I was preparing the first edition of this book in 2000, but they had achieved little public presence, and I was not aware of them. Five years on, and the name is everywhere, along with a whole family of derived nomenclature. Those who blog, bloggers, carry out the activity of blogging, setting up a blogsite with a unique web address in order to do so. They may also locate their page within a bigger site, which takes blog feeds from many sources, and includes other (non-blog) material.
The Internet is an electronic, global, and interactive medium, and each of these properties has consequences for the kind of language found there. The most fundamental influence arises out of the electronic character of the channel. Most obviously, a user's communicative options are constrained by the nature of the hardware needed in order to gain Internet access. Thus, a set of characters on a keyboard determines productive linguistic capacity (the type of information that can be sent); and the size and configuration of the screen determines receptive linguistic capacity (the type of information that can be seen). Both sender and receiver are additionally constrained linguistically by the properties of the Internet software and hardware linking them. There are, accordingly, certain traditional linguistic activities that this medium can facilitate very well, and others that it cannot handle at all. There are also certain linguistic activities which an electronic medium allows that no other medium can achieve. How do users respond to these new pressures, and compensate linguistically?
It is important to know what the various limitations and facilitations are. A well-established axiom of communication states that users should know the strengths as well as the restrictions of their chosen medium, in relation to the uses they subject it to and the purposes they have in mind. People have strong expectations of the Internet, and established users evidently have strong feelings about how it should be used to achieve its purposes. However, it is not a straightforward relationship.
At one level, it is extremely easy to define the linguistic identity of e-mail as a variety of language; at another level, it is surprisingly difficult. The easy part lies in the fixed discourse structure of the message – a structure dictated by the mailer software which has become increasingly standardized over the past twenty years or so. Just in the same way as we can analyse the functionally distinct elements that constitute a newspaper article (in terms of headline, body copy, illustration, caption, etc.) or a scientific paper (in terms of title, authorship, abstract, introduction, methodology, etc.), so we can see in e-mails a fixed sequence of discourse elements. They will be so familiar to likely readers of this book that they need only the briefest of expositions. The difficult part, to which the bulk of this chapter relates, lies in the range of opinions about the purpose of e-mail, as a communicative medium, and about the kind of language which is the most appropriate and effective to achieve that purpose. With around 80 billion e-mails being sent every day (in 2005), a consensus seems unlikely, especially when age, sex, and cultural differences are taken into account. At the same time, it ought at least to be possible to identify what the parameters of disagreement are, to develop a sense of the range of linguistic features which any characterization of e-mail would have to include.
It seems to be a standard convention for books dealing with digital technology to begin or end by warning their readers that everything they contain is going to be soon out of date; and a linguistic perspective on the subject is no exception. Any attempt to characterize the language of the Internet, whether as a whole or with reference to one of its constituent situations, immediately runs up against the transience of the technology. The different arenas of communication described in earlier chapters will not remain as they are for long, given that the technological developments upon which they rely are constantly evolving, putting users under constant pressure to adapt their language to the demands of new contexts, and giving them fresh opportunities to interact in novel ways. The readiness with which people do adapt language to meet the needs of new situations, which is at the heart of linguistic evolution – and which the central chapters of this book clearly demonstrate – is going to be fully exploited in the next few decades, with the emergence of yet more sophisticated forms of digitally mediated communication. Nor is the population using it any more stable: it is unusual to see a disclaimer of the kind used on the imprints page of this book, for example, but there is simply no guarantee that any of the URLs [uniform resource locators] listed in my footnotes and bibliography will still exist by the time this book appears.
The uncertain linguistic identity of Netspeak, in its various Internet manifestations, is presumably why so many usage dictionaries, guides, and rule books have appeared in recent years. People seem to have begun to sense that they are dealing with something new, as far as their linguistic intuitions are concerned. They are realizing that their established knowledge, which has enabled them to survive and succeed in spoken and written linguistic encounters hitherto, is no longer enough to guarantee survival and success on the Internet. Perhaps they have encountered the ‘painful and awkward lessons’ in social interaction which Patricia Wallace talked about (p. 18). Perhaps they have been misunderstood, misperceived, or attacked (flamed) because they have failed to notice the differences between this new medium of communication and the old. David Porter sums it up this way:
There are words, but they often seem to be words stripped of context, words desperately burdened by the lack of the other familiar markers of identity in this strange, ethereal realm. It is no wonder that these digitalized words, flung about among strangers and strained beyond the limits of what written language in other contexts is called upon to do, are given to frequent misreading, or that they erupt as often they do into antagonistic ‘flames’. In a medium of disembodied voices and decontextualized points of view, a medium, furthermore, beholden to the fetishization of speed, the experience of ambiguity and misreading is bound to be less an exception than the norm.
‘The vision I have for the Web is about anything being potentially connected with anything.’ This observation by the Web's inventor, Tim Berners-Lee, on the first page of his biographical account, Weaving the Web (1999), provides a characterization of this element of the Internet which truly strains the notion of ‘situation’ and the accompanying concept of a ‘variety’ of Internet language. After all, language, and any language, in its entirety, is part of this ‘anything’. The Web in effect holds a mirror up to the graphic dimension of our linguistic nature. A significant amount of human visual linguistic life is already there, as well as a proportion of our vocal life. So can it be given a coherent linguistic identity?
‘Graphic’ here refers to all aspects of written (as opposed to spoken) language, including typewritten, handwritten (including calligraphic), and printed text. It includes much more than the direct visual impression of a piece of text, as presented in a particular typography and graphic design on the screen; it also includes all those features which enter into a language's orthographic system (chiefly its spelling, punctuation, and use of capital letters) as well as the distinctive features of grammar and vocabulary which identify a typically ‘written’ as opposed to a ‘spoken’ medium of communication. Most web text will inevitably be printed, given the technology generally in use.