Hostname: page-component-77f85d65b8-zzw9c Total loading time: 0 Render date: 2026-04-17T16:14:53.746Z Has data issue: false hasContentIssue false

Algebraic data integration*

Published online by Cambridge University Press:  02 November 2017

PATRICK SCHULTZ
Affiliation:
Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA, USA (e-mail: schultzp@mit.edu)
RYAN WISNESKY
Affiliation:
Categorical Informatics, Inc., Cambridge, MA, USA (e-mail: ryan@catinf.com)
Rights & Permissions [Opens in a new window]

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the 'Save PDF' action button.

In this paper, we develop an algebraic approach to data integration by combining techniques from functional programming, category theory, and database theory. In our formalism, database schemas and instances are algebraic (multi-sorted equational) theories of a certain form. Schemas denote categories, and instances denote their initial (term) algebras. The instances on a schema S form a category, SInst, and a morphism of schemas F : ST induces three adjoint data migration functors: ΣF : SInstTInst, defined by substitution along F, which has a right adjoint ΔF : TInstSInst, which in turn has a right adjoint ΠF : SInstTInst. We present a query language based on for/where/return syntax where each query denotes a sequence of data migration functors; a pushout-based design pattern for performing data integration using our formalism; and describe the implementation of our formalism in a tool we call AQL (Algebraic Query Language).

Information

Type
Research Article
Copyright
Copyright © Cambridge University Press 2017 
Submit a response

Discussions

No Discussions have been published for this article.