Search results for Computer Science

12 - Meeting browsers and meeting assistants
- By Steve Whittaker, University of California, USA, Simon Tucker, University of Sheffield, UK, Denis Lalanne, University of Fribourg, Switzerland
Edited by Steve Renals, University of Edinburgh, Hervé Bourlard, Jean Carletta, University of Edinburgh, Andrei Popescu-Belis
Book:

Multimodal Signal Processing

Published online:

05 July 2012

Print publication:

07 June 2012, pp 204-217
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The previous chapter (Chapter 11) explained how user requirements directed our development of meeting support technology, more specifically meeting browsers and assistants. Chapters 3 to 9 discussed the enabling components, i.e. the multimodal signal processing necessary to build meeting support technology. In the following, we will present an overview of the meeting browsers and assistants developed both in AMI and related projects, as well as outside this consortium.
Introduction
Face-to-face meetings are a key method by which organizations create and share knowledge, and the last 20 years have seen the development of new computational technology to support them.
Early research on meeting support technology focused on group decision support systems (Poole and DeSanctis, 1989), and on shared whiteboards and large displays to promote richer forms of collaboration (Mantei, 1988, Moran et al., 1998, Olson et al., 1992, Whittaker and Schwarz, 1995, Whittaker et al., 1999). There were also attempts at devising methods for evaluating these systems (Olson et al., 1992). Subsequent research was inspired by ubiquitous computing (Streitz et al., 1998, Yu et al., 2000), focusing on direct integration of collaborative computing into existing work practices and artifacts. While much of this prior work has addressed support for real-time collaboration by providing richer interaction resources, another important research area is interaction capture and retrieval.
Interaction capture and retrieval is motivated by the observation that much valuable information exchanged in workplace interactions is never recorded, leading people to forget key decisions or repeat prior discussions.

8 - Language structure
- By Tilman Becker, DFKI Saarbrücken, Germany, Theresa Wilson, Johns Hopkins University, Baltimore
Edited by Steve Renals, University of Edinburgh, Hervé Bourlard, Jean Carletta, University of Edinburgh, Andrei Popescu-Belis
Book:

Multimodal Signal Processing

Published online:

05 July 2012

Print publication:

07 June 2012, pp 125-154
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Introduction
While the meeting setting creates many challenges just in terms of recognizing words and who is speaking them, once we have the words, there is still much to be done if the goal is to be able to understand the conversation. To do this, we need to be able to understand the language and the structure of the language being used.
The structure of language is multilayered. At a fine-grained, detailed level, we can look at the structure of the spoken utterances themselves. Dialogue acts which segment and label the utterances into units with one core intention are one type of structure at this level. Another way of looking at understanding language at this level is by focusing on the subjective language being used to express internal mental states, such as opinions, (dis-)agreement, sentiments, and uncertainty.
At a coarser level, language can be structured by the topic of conversation. Finally, within a given topic, there is a structure to the language used to make decisions. Language understanding is sufficiently advanced to capture the content of the conversation for specific phenomena like decisions based on elaborate domain models. This allows an indexing and summarization of meetings at a very high degree of understanding.
Finally, the language of spoken conversation differs significantly from written language. Frequent types of speech disfluencies can be detected and removed with techniques similar to those used for understanding language structure as described above.

Frontmatter
Tore Samuelsson, Göteborgs Universitet, Sweden
Book:

Genomics and Bioinformatics

Published online:

05 August 2012

Print publication:

07 June 2012, pp i-iv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

9 - Sequential models
from Part II - Estimation
Jorma Rissanen, Tampere University of Technology, Finland
Book:

Optimal Estimation of Parameters

Published online:

05 July 2012

Print publication:

07 June 2012, pp 112-143
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

A Methodology for tele-operating mobile manipulators with an emphasis on operator ease of use
M. Frejek, S. B. Nokleby
Journal:

Robotica / Volume 31 / Issue 3 / May 2013

Published online by Cambridge University Press:

07 June 2012, pp. 331-344
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
An algorithm for the tele-operation of mobile-manipulator systems with a focus on ease of use for the operator is presented. The algorithm allows for unified, intuitive, and coordinated control of mobile manipulators. It consists of three states. In the first state, a single 6-degrees-of-freedom (DOF) joystick is used to control the manipulator's position and orientation. The second state occurs when the manipulator approaches a singular configuration, resulting in the mobile base moving in a manner so as to keep the end-effector travelling in its last direction of motion. This is done through the use of a constrained optimization routine. The third state is entered when the operator returns the joystick to the home position. Both the mobile base and manipulator move with respect to one another keeping the end-effector stationary and placing the manipulator into an ideal configuration. The algorithm has been implemented on an 8-DOF mobile manipulator and the test results show that it is effective at moving the system in an intuitive manner.

4 - Speaker diarization
- By Fabio Valente, Idiap Research Institute, Martigny, Switzerland, Gerald Friedland, International Computer Science Institute, Berkeley
Edited by Steve Renals, University of Edinburgh, Hervé Bourlard, Jean Carletta, University of Edinburgh, Andrei Popescu-Belis
Book:

Multimodal Signal Processing

Published online:

05 July 2012

Print publication:

07 June 2012, pp 40-55
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Introduction
Segmenting multi-party conversations into homogeneous speaker regions is a fundamental step towards automatic understanding of meetings. This information is used for multiple purposes as adaptation for speaker and speech recognition, as a meta-data extraction tool to navigate meetings, and also as input for automatic interaction analysis.
This task is referred to as speaker diarization and aims at inferring “who spoke when” in an audio stream involving two simultaneous goals: (1) the estimation of the number of speakers in an audio stream and (2) associating each speech segment with a speaker.
Diarization algorithms have been developed extensively for broadcast data, characterized by regular speaker turns, prompted speech, and high-quality audio, while processing meeting recordings presents different needs and additional challenges. From one side, the conversational nature of the speech involves very short turns and large amounts of overlapping speech; from the other side, the audio is acquired in a nonintrusive way using far-field microphones and is thus corrupted with ambient noise and reverberation. Furthermore real-time and online processing are often required in order to enable the use of many applications while the meeting is actually going on. The next section briefly reviews the state-of-the-art in the field.
State of the art in speaker diarization
Conventional speaker diarization systems are composed of the following steps: a feature extraction module that extracts acoustic features like mel-frequency cepstral coefficients (MFCCs) from the audio stream, a Speech/Non-speech Detection which extracts only the speech regions discarding silence, an optional speaker change module which divides the input stream into small homogeneous segments uttered by a single speaker, and an agglomerative hierarchical clustering step which groups together those speech segments into the same cluster.

Index
Tore Samuelsson, Göteborgs Universitet, Sweden
Book:

Genomics and Bioinformatics

Published online:

05 August 2012

Print publication:

07 June 2012, pp 330-338
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Appendix II - A selection of biological sequence analysis software
Tore Samuelsson, Göteborgs Universitet, Sweden
Book:

Genomics and Bioinformatics

Published online:

05 August 2012

Print publication:

07 June 2012, pp 289-299
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

4 - Modeling problems
from Part II - Estimation
Jorma Rissanen, Tampere University of Technology, Finland
Book:

Optimal Estimation of Parameters

Published online:

05 July 2012

Print publication:

07 June 2012, pp 35-56
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The basic modeling problem begins with a set of observed data yn = {yt : t = 1, 2, …, n}, generated by some physical machinery, where the elements yt may be of any kind. Since no matter what they are they can be encoded as numbers we take them as such, i.e. natural numbers with or without the order if the data come from finite or countable sets, and real numbers otherwise. Often each number yt is observed together with others x1,t, x2,t, …, called explanatory data, written collectively as a K × n matrix X = {xi,j}, and the data then are written as yn ∣X. It is convenient to use the terminology “variables” for the source of these data. Hence, we say that the data {yt} come from the variable Y, and the explanatory data are generated by variables X1, X2, and so on.
In physics the explanatory data often determine the data yn of interest, called a “law,” but not so in statistical problems. Although by taking sufficiently many explanatory data we may also fit a function to the given set of observed data, but this is not a “law,” since if the same machinery were to generate additional data yn+1, x1,n+1, x2,n+1, … the function would not give yn+1. This is the reason the objective is to learn the statistical properties of the data yn, possibly in the context of the explanatory data.

ROB volume 30 issue 4 Cover and Back matter
Journal:

Robotica / Volume 30 / Issue 4 / July 2012

Published online by Cambridge University Press:

07 June 2012, pp. b1-b2
- Article
- - You have access
- PDF
- Export citation

14 - Information resources: learning about flu viruses
Tore Samuelsson, Göteborgs Universitet, Sweden
Book:

Genomics and Bioinformatics

Published online:

05 August 2012

Print publication:

07 June 2012, pp 173-197
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

All science is either physics or stamp collecting.
(Ernest Rutherford)
The 1918 flu pandemic, also referred to as the Spanish flu, was a devastating infectious disease. It is estimated that 50 million people, about 3% of the world's population at the time, died of the disease. About 500 million people were infected. The causative agent was an influenza virus. In this chapter we will learn more about these viruses. We will make use of highly significant molecular biology databases and bioinformatics tools. These are useful not only for learning about influenza viruses, but are widely used to explore just about any topic in biology.
Short history of sequence databases
A vast amount of information is collected by projects around the world designed to characterize genomes, genes and proteins. The development with respect to DNA sequencing is particularly remarkable. One important task in bioinformatics is to store all of this information in databases and, importantly, to make it available to the scientific community for downloading and analysis. Numerous dedicated individuals working on database projects are the unsung heroes of bioinformatics and molecular biology (see also the quotation on stamp collecting above).

5 - Other optimality properties
from Part II - Estimation
Jorma Rissanen, Tampere University of Technology, Finland
Book:

Optimal Estimation of Parameters

Published online:

05 July 2012

Print publication:

07 June 2012, pp 57-69
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

11 - User requirements for meeting support technology
- By Denis Lalanne, University of Fribourg, Switzerland, Andrei Popescu-Belis, Idiap Research Institute, Martigny, Switzerland
Edited by Steve Renals, University of Edinburgh, Hervé Bourlard, Jean Carletta, University of Edinburgh, Andrei Popescu-Belis
Book:

Multimodal Signal Processing

Published online:

05 July 2012

Print publication:

07 June 2012, pp 193-203
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Many kinds of information technology can be used to make meetings more productive, some of which are related to what happens before and after meetings, while others are intended to be used during a meeting. Document repositories, presentation software, and even intelligent lighting can all play their part. However, the following discussion of user requirements will be restricted to systems that draw on the multimodal signal processing techniques described in the earlier chapters of this book to capture and analyze meetings. Such systems might help people understand something about a past meeting that has been stored in an archive, or they might aid meeting participants in some way during the meeting itself. For instance, they might help users understand what has been said at a meeting, or even convey an idea of who was present, who spoke, and what the interaction was like. We will refer to all such systems, regardless of their purpose or when they are used, as “meeting support technology.”
This chapter reviews the main methods and studies that elicited and analyzed user needs for meeting support technology in the past decade. The chapter starts by arguing that what is required is an iterative software process that through interaction between developers and potential users gradually narrows and refines sets of requirements for individual applications. Then, it both illustrates the approach and lays out specific user requirements by discussing the major user studies that have been conducted for meeting support technology.

List of contributors
Edited by Steve Renals, University of Edinburgh, Hervé Bourlard, Jean Carletta, University of Edinburgh, Andrei Popescu-Belis
Book:

Multimodal Signal Processing

Published online:

05 July 2012

Print publication:

07 June 2012, pp xii-xii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

References
Jorma Rissanen, Tampere University of Technology, Finland
Book:

Optimal Estimation of Parameters

Published online:

05 July 2012

Print publication:

07 June 2012, pp 156-160
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part II - Estimation
Jorma Rissanen, Tampere University of Technology, Finland
Book:

Optimal Estimation of Parameters

Published online:

05 July 2012

Print publication:

07 June 2012, pp 33-34
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

CPC volume 21 issue 4 Cover and Front matter
Journal:

Combinatorics, Probability and Computing / Volume 21 / Issue 4 / July 2012

Published online by Cambridge University Press:

07 June 2012, pp. f1-f2
- Article
- - You have access
- PDF
- Export citation

In Memoriam: Professor Miomir Vukobratović (1931–2012)
Branislav Borovac
Journal:

Robotica / Volume 30 / Issue 4 / July 2012

Published online by Cambridge University Press:

07 June 2012, pp. 515-516
- Article
- - You have access
- PDF
- Export citation
Professor Miomir Vukobratović passed away on March 11, 2012 at the age of 81. However, his achievements in robotics will remain with us always.
Miomir Vukobratović was born in 1931. He graduated in 1957 from the Faculty of Mechanical Engineering, University of Belgrade, where he also obtained his first PhD in 1964. In January 1958 he joined the Aeronautical Institute in Belgrade. At the beginning of 1965 he moved to the Mihajlo Pupin Institute in Belgrade, where he became director of the Robotics Laboratory. In 1980, he became professor at the Production Engineering Department of the Faculty of Mechanical Engineering, University of Belgrade.

5 - Speech recognition
- By Thomas Hain, University of Sheffield, UK, Philip N. Garner, Idiap Research Institute, Martigny, Switzerland
Edited by Steve Renals, University of Edinburgh, Hervé Bourlard, Jean Carletta, University of Edinburgh, Andrei Popescu-Belis
Book:

Multimodal Signal Processing

Published online:

05 July 2012

Print publication:

07 June 2012, pp 56-83
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

General overview
Meetings are a rich resource of information that, in practice, is mostly untouched by any form of information processing. Even now it is rare that meetings are recorded, and fewer are then annotated for access purposes. Examples of the latter only include meetings held in parliaments, courts, hospitals, banks, etc., where a record is required for reasons of decision tracking or legal obligations. In these cases a labor-intensive manual transcription of the spoken words is produced. Giving much wider access to the rich content is the main aim of the AMI consortium projects, and there are now many examples of interest in that access – through the release of commercial hardware and software services. Especially with the advent of high-quality telephone and videoconferencing systems the opportunity to record, process, recognize, and categorize the interactions in meetings is recognized even by skeptics of speech and language processing technology.
Of course meetings are an audio-visual experience by nature and humans make extensive use of visual and other sensory information. To illustrate the rich landscape of information is the purpose of this book and many applications can be implemented even without looking at the spoken word. However, it is still verbal communication that forms the backbone of most meetings, and accounts for the bulk of the information transferred between participants. Hence automatic speech recognition (ASR) is key to access the information exchanged and is the most important part required for most higher level processing.

A SMS normalization system integrating multiple grammatical resources
J. OLIVA, J. I. SERRANO, M. D. DEL CASTILLO, Á. IGESIAS
Journal:

Natural Language Engineering / Volume 19 / Issue 1 / January 2013

Published online by Cambridge University Press:

07 June 2012, pp. 121-141
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
SMS language presents special phenomena and important deviations from natural language. Every day, an impressive amount of chat messages, SMS messages, and e-mails are sent all over the world. This widespread use makes important the development of systems that normalize SMS language into natural language. However, typical machine translation approaches are difficult to adapt to SMS language because of many irregularities that are shown by this kind of language. This paper presents a new approach for SMS normalization that combines lexical and phonological translation techniques with disambiguation algorithms at two different levels: lexical and semantic. The method proposed does not depend on big annotated corpus, which is difficult to build and is applied in two different domains showing its easiness of adaptation across different languages and domains. The results obtained by the system outperform some of the existing methods of SMS normalization despite the fact that the Spanish language and the corpus created have some features that complicate the normalization task.

Computer Science

Refine search

Refine search

Actions for selected content:

49522 results in Computer Science

12 - Meeting browsers and meeting assistants

Summary

8 - Language structure

Summary

Frontmatter

9 - Sequential models

A Methodology for tele-operating mobile manipulators with an emphasis on operator ease of use

4 - Speaker diarization

Summary

Index

Appendix II - A selection of biological sequence analysis software

4 - Modeling problems

Summary

ROB volume 30 issue 4 Cover and Back matter

14 - Information resources: learning about flu viruses

Summary

5 - Other optimality properties

11 - User requirements for meeting support technology

Summary

List of contributors

References

Part II - Estimation

CPC volume 21 issue 4 Cover and Front matter

In Memoriam: Professor Miomir Vukobratović (1931–2012)

5 - Speech recognition

Summary

A SMS normalization system integrating multiple grammatical resources

Computer Science

Refine search

Refine search

Actions for selected content:

Save Search

49522 results in Computer Science

Summary

Summary

Summary

Summary

Summary

Summary

Summary