Search

3 - Common Representations of Multimedia Features
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp 99-142
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Most features can be represented in the form of one (or more) of the four common base models: vectors, strings, graphs/trees, and fuzzy/probabilistic logic-based representations.
Many features, such as colors, textures, and shapes, are commonly represented in the form of histograms that quantify the contribution of each individual property (or feature instance) to themedia object.Given n different properties of interest, the vector model associates an n-dimensional feature vector space, where the ith dimension corresponds to the ith property. Thus, each vector describes the composition of a given multimedia data object in terms of its quantifiable properties.
Strings, on the other hand, are commonly used for representing media of sequential (or temporal) nature, when the ordinal relationships between events are more important than the quantitative differences between their occurrences. As we have seen in Section 2.3.6.4, because of their simplicity, string-based models are also used as less complex representations for more complex features, such as the spatial distributions of points of interest.
Graphs and trees are used for representing complex media, composed of other smaller objects/events that cannot be ordered to form sequences. Such media include hierarchical data, such as taxonomies and X3D worlds (which are easily represented as trees), and directed/undirected networks, such as hypermedia and social networks (where the edges of the graph represent explicit or implicit relationships between media objects or individuals).
When vectors, strings, trees, or graphs are not sufficient to represent the underlying imprecision of the data, fuzzy or probabilistic models can be used to deal with this complexity.

8 - Clustering Techniques
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp 271-296
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In the previous chapter, we described mechanisms for indexing multimedia data for quick access. The indexing process, in general, is based on establishing some order between the data objects so that queries can be routed toward the ones that are likely to be matches for the query and prune away those objects that are non-matches. Thus, most indexing techniques are based on some form of data clustering. In fact, hierarchical multidimensional index structures are sometimes referred to as self-clustering techniques.
Naturally, establishing an order on the given set of data objects requires an understanding of the fundamental characteristics and features of the media and the use of data structures appropriate for these features. We have seen that the index structures applicable for different media (e.g., sequences, graphs, trees, and vectors) are based on different principles and operate differently from each other.
In many multimedia databases, however, we may not have prior knowledge about the explicit features of data. This is the case, for example, when we have “black-box programs” that can compare two objects or when the similarity of the pair is simply evaluated subjectively by users. In both cases, we can obtain information about distances and/or similarities between pairs of objects, but there are no explicit features that one can use as a basis for an index structure. As we have seen in Section 4.3, one possible solution in these cases is to map or embed the objects into a multidimensional space (using MDS or FastMap) using the available distance values.

Data Management for Multimedia Retrieval

K. Selçuk Candan, Maria Luisa Sapino
Published online:

05 July 2014

Print publication:

31 May 2010
- Book
- - Get access
    
    Buy a print copy
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Multimedia data require specialised management techniques because the representations of colour, time, semantic concepts, and other underlying information can be drastically different from one another. This textbook on multimedia data management techniques gives a unified perspective on retrieval efficiency and effectiveness. It provides a comprehensive treatment, from basic to advanced concepts, that will be useful to readers of different levels, from advanced undergraduate and graduate students to researchers and to professionals. After introducing models for multimedia data (images, video, audio, text, and web) and for their features, such as colour, texture, shape, and time, the book presents data structures and algorithms that help store, index, cluster, classify, and access common data representations. The authors also introduce techniques, such as relevance feedback and collaborative filtering, for bridging the 'semantic gap' and present the applications of these to emerging topics, including web and social networking.

10 - Ranked Retrieval
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp 327-379
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Ranked query processing is important in many application domains, including information retrieval and multimedia, where results presented to the user need to be ordered based on their scores of matching.
As discussed in earlier chapters, fuzziness is inherent in multimedia retrieval for many reasons, including similarity of features, imperfections in the feature extraction algorithms, imperfections in the query formulation methods, partial match requirements, and imperfections in the available index structures. Data (whether captured in real time through sensory measurements or processed, materialized, and stored for later use) are many times accurate only within a margin of error. Also, in many cases the importance of a feature depends on how dominant it is in a particular data object and how discriminatory/rare the feature is in the entire data collection. The popular term frequency/inverse document frequency (TF-IDF) keyword weights (Section 4.2) used in text retrieval rely on this principle. The importance of the feature can also reflect the retrieval context. For example, a keyword, say, “entropy,” may carry different meanings and relevance and imply different semantic similarity relationships when used within a computer science context versus within its physics context. Thus, in many applications, the utility of a data element to a particular retrieval task depends on the user's query and the usage context. Consequently, users are usually not interested in obtaining all possible matches to a query, but only the k best results, where k is application specific or provided by the user.

2 - Models for Multimedia Data
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp 20-98
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

A database is a collection of data objects that are organized in a way that supports effective search and manipulation. Under this definition, your personal collection of digital photos can be considered a database (more specifically an image database) if you feel that the software you are using to organize your images provides you with mechanisms that help you locate the images you are looking for easily and effectively.
Effective access, of course, depends on the data and the application. For example, in general, you may be satisfied if the images in your collection are organized in terms of a timeline or put into folders according to where they were taken, but for an advertising agency which is looking for an image that conveys a certain feeling or for a medical research center which is trying to locate images that contain a particular pattern, such a metadata-based organization (i.e., an organization not based on the content of the image, but on aspects of the media object external to the visual content) may not be acceptable. Thus, when creating a database, it is important to choose the right organization model.
A data model is a formalism that helps specify the aspects of the data relevant for their organization. For example, a content-based model would describe what type of content (e.g., colors or shape) is relevant for the organization of the data in the database, whereas a metadata-based model may help specify the metadata (e.g., date or place) relevant for the organization.

Preface
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp ix-x
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Database and multimedia systems emerged to address the needs of very different application domains. New applications (such as digital libraries, increasingly dynamic and complex web content, and scientific data management), on the other hand, necessitate a common understanding of both of these disciplines. Consequently, as these domains matured over the years, their respective scientific disciplines moved closer. On the media management side, researchers have been concentrating on media-content description and indexing issues as part of the MPEG7 and other standards. On the data management side, commercial database management systems, which once primarily targeted traditional business applications, today focus on media and heterogeneous-data intensive applications, such as digital libraries, integrated database/information-retrieval systems, sensor networks, bioinformatics, e-business applications, and of course the web.
There are three reasons for the heterogeneity inherent in multimedia applications and information management systems. First, the semantics of the information captured in different forms can be drastically different from each other. Second, resource and processing requirements of various media differ substantially. Third, the user and context have significant impacts on what information is relevant and how it should be processed and presented. A key observation, on the other hand, is that rather than being independent, the challenges associated with the semantic, resource, and context-related heterogeneities are highly related and require a common understanding and unified treatment within a multimedia data management system (MDMS). Consequently, internally a multimedia database management system looks and functions differently than a traditional (relational, object-oriented, or even XML) DBMS.

11 - Evaluation of Retrieval
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp 380-397
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In the previous chapters, we have covered various feature extraction, indexing, clustering, and classification techniques, all of which transform the raw data collected through various capture devices into models and data structures that support efficient and effective matching and retrieval. Many of these techniques are, however, lossy in nature:
▪ Feature extraction algorithms need to map a potentially infinite, continuous feature space into a finite feature model that can be represented using a finite data structure.
▪ Feature selection (to avoid the dimensionality curse) for indexing and query processing usually involves some transformation of the data to highlight important features and to eliminate others that are not as important from consideration.
▪ Indexing, clustering, and classification algorithms often trade efficiency against effectiveness. Therefore, they can introduce both false hits and misses.
As we briefly discussed in Section 4.2.1, all forms of information loss may not have the same impact on the retrieval effectiveness. For example, false hits (which can be eliminated through postprocessing) are often acceptable, whereas misses (which cannot be eliminated) are not. On the other hand, in many other applications (especially in those cases where user queries are not precise and, thus, there are large numbers of matches), completeness of the result set is less important than the precise ranking of the few initial results: a ranking that can help the user pick a promising result from the first few is better than a ranking that is complete but puts the most promising results in the bottom of a long list.

7 - Indexing, Search, and Retrieval of Vectors
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp 235-270
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

As we have seen in the previous chapters, it is common to map the relevant features of the objects in a database onto the dimensions of a vector space and perform nearest neighbor or range search queries in this space (Figure 7.1). The nearest neighbor query returns a predetermined number of database objects that are closest to the query object in the feature space. The range query, on the other hand, identifies and returns those objects whose distance from the query object is less than a provided threshold.
A naive way of executing these queries is to have a lookup file containing the vector representations of all the objects in the database and scan this file for the required matches, pruning those objects that do not satisfy the search condition. Although this approach might be feasible for small databases where all objects fit into the main memory, for large databases, a full scan of the database quickly becomes infeasible. Instead, multimedia database systems use specialized indexing techniques to help speed up search by pruning the irrelevant portions of the space and focusing on the parts that are likely to satisfy the search predicate (Figure 7.2).
Index structures that support range or nearest neighbor searches in general lay the data out on disk in sorted order (Figure 7.3(a)). Given a pointer to a data element on disk, this enables constraining further reads on the disk to only those disk pages that are in immediate neighborhood of this data element (Figure 7.3(b)).

Index
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp 473-489
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Plate section
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp -
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

9 - Classification
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp 297-326
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Bibliography
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp 427-472
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

5 - Indexing, Search, and Retrieval of Sequences
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp 181-207
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Sequences, such as text documents or DNA sequences, can be indexed for searching and analysis in different ways depending on whether patterns that the user may want to search for (such as words in a document) are known in advance and on whether exact or approximate matches are needed.
When the sequence data and queries are composed of words (i.e., nonoverlapping subsequences that come from a fixed vocabulary), inverted files built using B+-trees or tries (Section 5.4.1) or signature files (Section 5.2) are often used for indexing. When, on the other hand, the sequence data do not have easily identifiable word boundaries, other index structures, such as suffix trees (Section 5.4.2), or filtering schemes, such as ρ-grams (Section 5.5.4), may be more applicable.
In this section, we first discuss inverted files and signature files that are commonly used for text document retrieval. We then discuss data structures and algorithms for more general exact and approximate sequence matching.
INVERTED FILES
An inverted file index [Harman et al., 1992] is a search structure containing all the distinct words (subpatterns) that one can use for searching. Figure 5.1(a) shows the outline of the inverted file index structure:
▪ A word (or term) directory keeps track of the words that occur in the database. For each term, a pointer to the corresponding inverted list is maintained. In addition, the directory records the length of the corresponding inverted list. This length is the number of documents containing the term.

4 - Feature Quality and Independence: Why and How?
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp 143-180
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

For most media types, there are multiple features that one can use for indexing and retrieval. For example, an image can be retrieved based on its color histogram, texture content, or edge distribution, or on the shapes of its segments and their spatial relationships. In fact, even when one considers a single feature type, such as a color histogram, one may be able to choose from multiple alternative sets of base colors to represent images in a given database.
Although it might be argued that storing more features might be better in terms of enabling more ways of accessing the data, in practice indexing more features (or having more feature dimensions to represent the data) is not always an effective way of managing a database:
▪ Naturally, more features extracted mean more storage space, more feature extraction time, and higher cost of index management. In fact, as we see in Chapter 7, some of the index structures require exponential storage space in terms of the features that are used for indexing. Having a large number of features also implies that pairwise object similarity/distance computations will be more expensive.
Although these are valid concerns (for example, storage space and communication bandwidth concerns motivate media compression algorithms), they are not the primary reasons why multimedia databases tend to carefully select the features to be used for indexing and retrieval.

1 - Introduction: Multimedia Applications and Data Management Requirements
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp 1-19
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Among countless others, applications of multimedia databases include personal and public photo/media collections, personal information management systems, digital libraries, online and print advertising, digital entertainment, communications, long-distance collaborative systems, surveillance, security and alert detection, military, environmental monitoring, ambient and ubiquitous systems that provide real-time personalized services to humans, accessibility services to blind and elderly people, rehabilitation of patients through visual and haptic feedback, and interactive performing arts. This diverse spectrum of media-rich applications imposes stringent requirements on the underlying media data management layer. Although most of the existing work in multimedia data management focuses on content-based and object-based query processing, future directions in multimedia querying will also involve understanding how media objects affect users and how they fit into users’ experiences in the real world. These require better understanding of underlying perceptive and cognitive processes in human media processing. Ambient media-rich systems that collect diverse media from environmentally embedded sensors necessitate novel methods for continuous and distributed media processing and fusion schemes. Intelligent schemes for choosing the right objects to process at the right time are needed to allow media processing workflows to be scaled to the immense influx of real-time media data. In a similar manner, collaborative-filtering-based query processing schemes that can help overcome the semantic gap between media and users’ experiences will help the multimedia databases scale to Internet-scale media indexing and querying.
HETEROGENEITY
Most media-intensive applications, such as digital libraries, sensor networks, bioinformatics, and e-business applications, require effective and efficient data management systems.

12 - User Relevance Feedback and Collaborative Filtering
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp 398-426
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

As we discussed in Section 1.2, retrieval in multimedia databases is inherently imprecise and subjective. Consequently, multimedia query processing usually involves answering ill-posed questions: there may be multiple ways to interpret the query and data, and the appropriate query processing strategy may be user- and use context dependent.
Imprecisions in retrieval can be due to many factors, including feature extraction algorithms that are imprecise, partial matching requirements in the query, and the imperfections in the underlying indexing, clustering, and classification algorithms. Moreover, in the absence of precise knowledge about the objects in the database, users’ initial queries may be too vague. The set of results provided by the system in response to such imprecisely formulated queries, however, may contain hints to help users make their (initially vague) specifications iteratively more precise. Especially when users are not sufficiently informed about the data (or sometimes of their interests) to formulate a precise initial query, feedback-based data exploration plays a critical role in helping users find the relevant information.
Given a query (say, an image example provided for similarity search), which features of the query object are relevant (and how much so) for the user's query may not be known in advance. Consequently, it is almost impossible to expect that a multimedia database will be able to provide perfect answers to a user's query in its first attempt. Furthermore, most of the (large number of) candidate matches are only marginally relevant to the user's query and must be eliminated from consideration.

6 - Indexing, Search, and Retrieval of Graphs and Trees
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp 208-234
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In Chapter 2, we have seen that most high-level multimedia data models (especially those that involve representation of spatiotemporal information, object hierarchies – such as X3D – or links – such as the Web) require tree or graph-based modeling. Therefore, similarity-based retrieval and classification commonly involve matching trees and graphs.
In this chapter, we discuss tree and graph matching. We see that, unlike the case with sequences, computing edit distance for finding matches may be extremely complex (NP-hard) when dealing with graphs and trees. Therefore, filtering techniques that can help prune the set of candidates are especially important when dealing with tree and graph data.
GRAPH MATCHING
Although, as we discussed in Section 3.3.2, graph matching through edit distance computation is an expensive task, there are various heuristics that have been developed to perform this operation efficiently. In the rest of this section, we consider three heuristics, GraphGrep, graph histograms, and graph probes, for matching graphs.
6.1.1 GraphGrep
Because the graph-matching problem is generally very expensive, there are various heuristics that have been developed for efficient matching and indexing of graphs. GraphGrep [Giugno and Shasha, 2002] is one such technique, relying on a path-based representation of graphs.
GraphGrep takes an undirected, node-labeled graph and, for each node in the graph, finds all paths that start at this node and have length up to a given, small upper bound, lp.

Contents
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp v-viii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Frontmatter
K. Selçuk Candan, Arizona State University, Maria Luisa Sapino, Università degli Studi di Torino, Italy
Book:

Data Management for Multimedia Retrieval

Published online:

05 July 2014

Print publication:

31 May 2010, pp i-iv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Search Results

Refine search

Refine search

Actions for selected content:

19 results

3 - Common Representations of Multimedia Features

Summary

8 - Clustering Techniques

Summary

Data Management for Multimedia Retrieval

10 - Ranked Retrieval

Summary

2 - Models for Multimedia Data

Summary

Preface

Summary

11 - Evaluation of Retrieval

Summary

7 - Indexing, Search, and Retrieval of Vectors

Summary

Index

Plate section

9 - Classification

Bibliography

5 - Indexing, Search, and Retrieval of Sequences

Summary

4 - Feature Quality and Independence: Why and How?

Summary

1 - Introduction: Multimedia Applications and Data Management Requirements

Summary

12 - User Relevance Feedback and Collaborative Filtering

Summary

6 - Indexing, Search, and Retrieval of Graphs and Trees

Summary

Contents

Frontmatter

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

19 results

Summary

Summary

Data Management for Multimedia Retrieval

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary