Publication
Title
Semandaq : a data quality system based on conditional functional dependencies
Author
Abstract
We present SEMANDAQ, a prototype system for improving the quality of relational data. Based on the recently proposed conditional functional dependencies (CFDs), it detects and repairs errors and inconsistencies that emerge as violations of these constraints. We demonstrate the following functionalities supported by SEMANDAQ: (a) an interface for specifying CFDs; (b) a visual tool for automated detection of CFD violations in relational data, leveraging efficient SQL-based techniques; (c) extensive visual data exploration capabilities that provide the user with various measures of the quality of the data; (d) repair (cleaning) functionality without excess human interaction, built upon CFD-based cleaning algorithms; we show how SEMANDAQ allows for a natural exploration of the quality of the obtained repairs. SEMANDAQ is a promising tool that provides easy access and user-friendly data quality facilities for any relational database system.
Language
English
Source (journal)
Proceedings of the VLDB Endowment
Publication
2008
ISSN
2150-8097
Volume/pages
1:2(2008), p. 1460-1463
Full text (Publishers DOI)
Full text (publishers version - intranet only)
UAntwerpen
Faculty/Department
Publication type
Subject
External links
Record
Identification
Creation 14.01.2014
Last edited 21.11.2016