MONDRIAN : annotating and querying databases through colors and blocks
Faculty of Sciences. Mathematics and Computer Science
Los Alamitos, Calif. :IEEE Computer Society, 2006
Proceedings of the 22nd International Conference on Data Engineering (ICDE 2006), Atlanta, Ga, USA, 3-8 April 2006 / Liu, Ling [edit.]; et al.
Annotations play a central role in the curation of scientific databases. Despite their importance, data formats and schemas are not designed to manage the increasing variety of annotations. Moreover, DBMS?s often lack support for storing and querying annotations. Furthermore, annotations and data are only loosely coupled. This paper introduces an annotation-oriented data model for the manipulation and querying of both data and annotations. In particular, the model allows for the specification of annotations on sets of values and for effectively querying the information on their association. We use the concept of block to represent an annotated set of values. Different colors applied to the blocks represent different annotations. We introduce a color query language for our model and prove it to be both complete (it can express all possible queries over the class of annotated databases), and minimal (all the algebra operators are primitive). We present MONDRIAN, a prototype implementation of our annotation mechanism, and we conduct experiments that investigate the set of parameters which influence the evaluation cost for color queries.